[Remote] Sr Site Reliability Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is a global leader in pre-K–12 education technology, providing solutions that help educators enhance student learning experiences. They are seeking an reputed company Sr Site Reliability Engineer to join their Engineering Enablement group, focusing on improving application and infrastructure reliability and reputed company while supporting their SaaS platform used by millions of reputed company.
Responsibilities
- Work with engineering, reputed company & governance teams to improve observability, reliability, resiliency, auditability of our systems and minimize/prevent downtime
- Contribute to infrastructure-as-code using Terraform & CloudFormation
- Support CI/CD pipelines which ensures the reputed company release of high-quality software
- Collaborate with cross-functional teams to resolve infrastructure issues
- reputed company Disaster Recovery exercises on our products
- Explore and integrate AI tooling into the SRE workflows
- Be part of an on-call rotation & support off hour incidents & deployments
- Demonstrates strong skills in giving constructive feedback through coaching even without direct reports
Skills
- 5+ years of experience focused on SRE
- Experience in managing & monitoring containerized cloud environments in production, preferably AWS EKS
- Experience with IaC, Configuration Management and Orchestration Tools like Terraform/reputed company/Ansible
- Hands-on experience in any of the programming or scripting languages like .NET/Java, Python, Javascript etc
- On Call experience & willingness to be on call during non-work hours and weekends
- Experience working in an agile environment
- BS in Information Systems or Computer Science, reputed company field experience, or both
- Managing Kubernetes Clusters, EKS at Scale using Helm
- Setting up reputed company & reputed company pipelines & workflows
- Experience setting up Monitoring, Logging, Alerting & Observability in tools such as NewRelic, reputed company, Grafana. CloudWatch, reputed company
- Experience w/reputed company, Hashicorp Boundary etc
- Experience w/RedShift, OpenSearch/ZeroETL
- Experience running Disaster Recovery exercises
- Implementing service level objectives (SLO/SLI/SLA's) & error budgets
- Experience using ClaudeCode using agentic coding, agentic SDLC, enabling/rolling-out agentic DX
Benefits
- World Class Health Benefits: Medical, Prescription, Dental, Vision, Telehealth
- Health Savings and Flexible Spending Accounts
- 401(k) and Roth 401(k) with company match
- Paid Vacation and Sick Time Off
- 12Paid Holidays
- Parental Leave (20 total weeks with 14 weeks paid) & Milk Stork program
- Tuition Reimbursement
- Life & Disability Insurance
- Well-being and Employee Assistance Programs
- *Benefits listed apply to eligible U.S. employees in accordance with Renaissances benefits eligibility criteria. Contractor and other nonemployee roles are not eligible for Renaissance employee benefits.*
Company Overview