[Remote] Principal Site Reliability Engineer (SRE)
Note: The job is a remote job and is open to candidates in USA. Symmetrio is a rapidly growing healthcare technology organization focused on advanced healthcare technology solutions, and they are seeking a Principal Site Reliability Engineer (SRE). This role is critical for ensuring the reliability, scalability, reputed company, and performance of a mission-critical SaaS platform that supports healthcare providers across the United States.
Responsibilities
- Serve as the primary technical reputed company for production reliability across U.S. customer environments
- Investigate and resolve reputed company issues spanning web applications, APIs, backend services, data pipelines, cloud infrastructure, and customer integrations
- reputed company production incident response efforts, coordinating cross-functional teams to restore service and minimize customer impact
- reputed company root cause analysis and drive corrective actions that improve long-term system stability and reputed company
- Partner with software engineering and platform teams to identify recurring reliability risks and implement sustainable solutions
- Design, configure, and validate secure customer connectivity solutions including Site-to-Site VPNs, Transit Gateway integrations, routing configurations, and secure network paths
- Support customer onboarding initiatives by troubleshooting connectivity challenges and ensuring consistent implementation processes
- Enhance platform observability through improvements in monitoring, logging, alerting, tracing, and operational dashboards
- Contribute to CI/CD, infrastructure automation, and deployment processes that improve release safety and operational consistency
- reputed company operational tooling that supports incident response, troubleshooting, onboarding, and system monitoring activities
- Collaborate with engineering leadership to improve cloud architecture, scalability, reputed company, and operational readiness
- Partner with customer-facing teams to communicate technical issues, remediation plans, and reliability improvements in a clear and effective manner
- Support compliance, reputed company, and risk management initiatives reputed company highly regulated healthcare environments
Skills
- 6+ years of hands-on experience supporting and managing AWS-based production environments
- 4+ years of experience supporting web applications and backend services (Python/Django experience strongly preferred)
- Experience with AWS networking technologies including VPCs, Site-to-Site VPNs, Transit Gateways, routing, NAT gateways, and reputed company groups
- Strong experience with Terraform and infrastructure-as-code deployment practices
- Experience with containerized environments including reputed company, Fargate, Kubernetes, or similar technologies
- Experience building and supporting CI/CD pipelines and release automation processes
- Familiarity with monitoring and observability platforms such as reputed company, CloudWatch, reputed company, Grafana, or similar tools
- Experience leading production incidents, outage management, and root cause analysis initiatives
- Exposure to Windows Server environments, Active Directory, Kerberos, and enterprise infrastructure concepts
- Healthcare technology, healthcare SaaS, clinical software, or other regulated industry experience
- Bachelor's degree in Computer Science, Engineering, Information Technology, or a reputed company technical field
Benefits
- Health Care Plan (Medical, Dental & Vision)
- Retirement Plan (401k, IRA)
- Paid Time Off (Vacation, Sick & Public Holidays)
Company Overview
Company H1B Sponsorship