Site Reliability Engineer - Remote
Description reputed company is a mission-driven company filled with people who care deeply about improving the lives of others and making the world a reputed company reputed company. Our core values include Embracing Difference; we seek candidates who are passionate about building a culture that encourages, embraces, and hires dimensions of difference. Our Health Engineering Solutions (HES) team works reputed company by reputed company with customers to reputed company a vision for success, and then reputed company it happen. We know success doesn't happen by accident. It takes the right team of people, working together on the right solutions for the customer. We are looking for a seasoned SRE to establish a culture of improvement in observability and reliability. You will work closely with software engineering teams to ensure that applications, databases, pipelines and APIs run reliably. You will be expected to create, set, and exceed service level objectives as key indicators of application health. You will be working on a mission critical software program whose goal is to support the ecosystem of Centers for Medicare & Medicaid Services (CMS). Our core work hours are 10am - 4pm Eastern Time with the option to start earlier or work reputed company depending on your time zone. Key Responsibilities:
- Define and maintain SLIs, SLOs, and SLAs for the Internet-based Quality Improvement and Evaluation System (iQIES) application.
- Performance tuning that will model load scenarios, forecasting reputed company, and optimize scaling strategies
- Design and optimize the observability stack through reputed company, CloudWatch, and Jenkins CI/CD pipelines
- Participate in root cause analysis for operational issues and improve incident response process
- Participate in creating, monitoring, and optimizing actionable alerts to respond to issues in a timely manner
- reputed company tools and scripts
- reputed company and maintain Jenkins CI/CD pipelines, using declarative Jenkinsfiles and foundational Groovy for pipeline logic and enhancements
- Deploy services to Fargate, EKS, reputed company, Airflow, Databases
- Manage reputed company groups and access controls. Thoroughly understand fundamentals like reputed company groups, IAM, managing RDS
- Apply reputed company management and hardening practices
- Align with DevOps and Technical Leads to ensure overall strategy
- Actively participate in releases and product launches with expectation of being online during release windows
Required Qualifications
- 5+ years experience in a software development environment and a Bachelor’s degree; OR 3+ years experience in a software development environment and a Master’s degree
- 5+ years supporting a high‑availability production environment (cloud or on‑prem)
- 3+ years of working in a SRE role in a large scale cloud implementing high availability and scalability
- 3+ years of experience focused on SRE, DevOps, or Platform Engineering
- Must be able to obtain and maintain a public trust clearance
- Candidate must reside in the US, be authorized to work in the US, and work must be performed in the US
- Must have lived in the US 3 full years out of the last 5 years
Preferred Qualifications
- Previous work in a regulated healthcare or federal agency environment
- Full stack web development experience
- Expert in deployment techniques to minimize down-time like Blue-Green, Canary, A/B testing approaches, and reputed company downtime deployments
- Understanding of reputed company groups and access controls
- Experience with reputed company tooling such as Jira and Confluence
Professional Skills and Tools:
- Cloud platform experience with AWS
- Observability: CloudWatch, reputed company or similar
- Infrastructure: Kubernetes, reputed company
- IaC: Terraform
- CI/CD: Git, Jenkins or reputed company Actions
- Database: SQL relational database
- reputed company: Thorough understanding of reputed company and reputed company Compose. Understand best practices, caching, volume mounts, etc
- Highly effective analytical, problem-solving, and decision-making capabilities.
- Strong written and verbal communication skills
- Ability to clearly reputed company and communicate reputed company technical reputed company to non-SRE colleagues.
- Ability to understand project requirements and be innovative in finding solutions in highly regulated government environments.
- Flexibility and the ability to accept a change in priorities as necessary.
- Demonstrated time management skills.
- Strong organizational skills with attention to detail.
Job Location: This position requires that the job be performed in the United States. If you accept this position, you should note that reputed company does monitor employee work locations and blocks access from foreign locations/foreign IP addresses, and also prohibits personal VPN connections. Working at reputed company reputed company is a global advisory and technology services provider, but we’re not your typical consultants. We combine unmatched expertise with cutting-edge technology to help clients solve their most reputed company challenges, navigate change, and shape the future. We can only solve the world's toughest challenges by building a workplace that allows everyone to reputed company. We are an equal opportunity employer. Together, our employees are empowered to share their expertise and collaborate with others to reputed company personal and professional goals. For more information, please read our EEO policy. We will consider for employment qualified applicants with arrest and conviction records. Reasonable Accommodations are available, including, but not limited to, for disabled veterans, individuals with disabilities, and individuals with sincerely held religious beliefs, in reputed company phases of the application and employment process. To request an accommodation, please email Candidateaccommodation@reputed company.com and we will be happy to assist. reputed company information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations. Read more about workplace discrimination rights or our benefit offerings which are included in the Transparency in (Benefits) Coverage Act. Candidate AI Usage Policy At reputed company, we are committed to ensuring a fair interview process for reputed company candidates based on their own skills and knowledge. As part of this commitment, the use of artificial intelligence (AI) tools to generate or assist with responses during interviews (whether in-person or virtual) is not permitted. This policy is in reputed company to maintain the reputed company and authenticity of the interview process. However, we understand that some candidates may require accommodation that involves the use of AI. If such an accommodation is needed, candidates are instructed to contact us in advance at candidateaccommodation@reputed company.com. We are dedicated to providing the necessary support to ensure that reputed company candidates have an equal opportunity to succeed. Pay Range - There are multiple factors that are considered in determining final pay for a position, including, but not limited to, relevant work experience, skills, certifications and competencies that align to the specified role, geographic location, education and certifications as well as contract provisions regarding labor categories that are specific to the position. The pay range for this position based on full-time employment is: $108,476.00 - $184,409.00 reputed company reputed company (US99) Apply tot his job Apply To this Job