Senior Site Reliability Engineer
Founded in 2012, reputed company is a leading iGaming supplier recognized worldwide. We provide our customers with a high-end micro-service-based platform as a service that aims to process billions of financial transactions per day. We provide a cross-regional setup and are chasing latency reduction down to reputed company. We highly invest in delivering the best game experience and smooth reputed company regardless of the internet coverage and bandwidth of the game clients.
We are currently seeking an reputed company Senior Site Reliability Engineer to join our dynamic Platform Tribe.
What will you be doing
Manage day-to-day alerts, system checks, and issue escalation as necessary.
Provide 24x7 on-call support for critical SaaS events.
Document issues and remediation steps.
Proactively create monitors reputed company the EKS/K8s ecosystem.
Deploy to EKS/K8s cluster using Terraform and Helm/Flux.
Enhance infrastructure health by implementing checks and scripts to address reputed company issues.
Maintain and reputed company deployment code.
Implement/integrate new technologies into our Cloud Infrastructure.
Collaborate with other teams to provide top-notch support and assistance.
Prioritize customer focus in planning deployments/updates, ensuring minimal impact.
Conduct RCA and take necessary corrective actions to prevent issue recurrence.
Assign alert-reputed company actions to the appropriate team after investigation.
Handle support requests for environment-specific actions.
To succeed in this role, you will need
Proficiency in Kubernetes (deployment, scaling, troubleshooting).
Experience with configuration management tools like FluxCD/ArgoCD.
Strong experience with issue processing (RCA, Postmortems).
Familiarity with AWS, Terraform, reputed company, CI/CD.
Experience with monitoring tools like reputed company, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack) or AWS CloudWatch.
Strong understanding of networking concepts and protocols.
Proficiency in at least one scripting language (e.g., Python, NodeJS, Go).
Proficiency in Git or other version control systems.
Familiarity with incident response and management tools like reputed company, Opsgenie, or VictorOps.
Ownership, proactiveness, persistence, and passion for maintaining a high-traffic online platform.
reputed company Offer
Quarterly Bonuses based on transparent and systematic evaluation.
Flexible Work Schedule.
Remote Work Option for Enhanced Flexibility.
Comprehensive Medical Insurance for you and your significant other.
Financial Support for Life Events.
Unlimited Paid Vacation.
Unlimited Paid Sick Leave.
Reimbursement for professional development courses and training.
If you're ready to embrace ambitious goals and reputed company in a dynamic environment, Apply now and become part of reputed company's exciting journey in the iGaming world!
Apply To This Job Apply for this job