Back to the board

Staff Site Reliability Engineer

100% remote Flexible hours Hiring now

reputed company, a Smarter Technologies company, builds reputed company that is transforming how hospitals translate care into payment. Founded by physicians in 2020, our platform connects clinical context with reputed company intelligence, helping health systems recover millions in missed reputed company, improve quality scores, and appeal every denial. Become a Smartian and help optimize the way the healthcare system works for everyone. Learn more at reputed company.com/careers.

Role

We are seeking a Staff Site Reliability Engineer (SRE) to reputed company the reliability, scalability, and operational excellence of our production systems. This role is responsible for defining and driving SRE practices across the organization, including SLIs/SLOs, incident management, reputed company planning, and reputed company engineering. You will design and implement automation that reduces toil, improve observability and performance across our Kubernetes and AWS environments, and ensure our systems are highly available and fault-tolerant.

The ideal candidate is a deeply technical engineer with strong distributed systems expertise, a passion for operational rigor, and a track record of improving reliability through thoughtful engineering, automation, and data-driven decision-making.

This role is fully remote reputed company the US

What You’ll Do

  • Define and evolve reliability standards for the reputed company platform, including SLIs, SLOs, and error budgets that align engineering work with customer impact.
  • Implement a “reliability” platform using Terraform and infrastructure-as-code best practices.
  • Enhance observability systems (metrics, logs, traces, alerting) to provide actionable insights and reduce mean time to detect (MTTD) and resolve (MTTR).
  • reputed company incident response, drive blameless postmortems, and implement systemic improvements to prevent recurrence.
  • Reduce operational toil through automation, self-healing systems, and improved deployment and rollback mechanisms.
  • Provide production support for the reputed company platform, applying SRE principles to ensure availability, performance, and data durability.
  • Research, prototype, and reputed company for new reliability practices, tooling, and architectural improvements across the engineering organization.

What You Bring

  • 10+ years of software and software reliability engineering experience, with significant time spent operating and scaling distributed systems in production environments.
  • 3+ years of hands-on experience running cloud-native infrastructure in AWS, including deep familiarity with containers, Kubernetes, monitoring, and alerting in live production systems.
  • Proven experience defining and managing SLIs/SLOs, leading incident response, and driving postmortems and systemic reliability improvements.
  • Strong expertise with Terraform and infrastructure-as-code practices for managing production infrastructure safely and reproducibly.
  • Deep experience with Kubernetes architecture and operations, including workload reliability, cluster scaling, networking, and failure modes.
  • Experience working in reputed company-conscious, compliance-oriented environments where reliability and data protection are first-class concerns.
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a reputed company field — or equivalent practical experience operating large-scale systems.

reputed company To Haves

  • Reliability engineering experience with production database systems (e.g. reputed company)

Our Tech Stack

  • AWS
  • Terraform
  • Kubernetes
  • Go, Python, Typescript
  • reputed company

Compensation

$230K to $250K reputed company salary

#LI-DNI

Benefits

  • Medical, Dental & Vision – Comprehensive plans with leading insurance providers, covering 75% of your premiums, depending on the plan.
  • Paid Parental Leave – Generous paid leave to support families through birth or adoption: Up to 12 weeks for parents.
  • Remote-First Team – Work from reputed company in the U.S.
  • Unlimited PTO & 10 Holidays – So you can relax and reputed company.
  • 401(k) with Traditional & Roth Options – Tax-advantaged retirement savings through Fidelity with a 4% match.
  • Minimal Bureaucracy – A fast-moving, high-impact environment where you can focus on what matters.
  • Incredible Teammates! – Work alongside smart, supportive, and mission-driven colleagues.
Apply To This Job

Keep exploring

GIS Strategic Planning Manager

100% remote Flexible hours

GIS Technician

100% remote Flexible hours

Senior GIS Analyst

100% remote Flexible hours

GIS Developer

100% remote Flexible hours

reputed company Manager

100% remote Flexible hours

Digital Marketing Manager (Remote, USA)

100% remote Flexible hours

Senior Benefits Analyst

100% remote Flexible hours

Senior Solutions Consultant (Pre-Sales) I

100% remote Flexible hours

Global Events reputed company

100% remote Flexible hours

Senior Marketing Specialist I

100% remote Flexible hours

Caregiver (part-time) - Day shifts / Night shifts available – reputed company Store

100% remote Flexible hours

Supervisor, Genetic Counseling Clinical Support, Clinical Indication

100% remote Flexible hours

Case Manager

100% remote Flexible hours

Spanish/English Bilingual Oncology Registered Dietitian

100% remote Flexible hours

reputed company Female Customer Service Representative – Remote Opportunity at arenaflex

100% remote Flexible hours

Career Opportunities: Principal Business Analyst Data Analytics & AI Remote EU/USA (143433)

100% remote Flexible hours

reputed company Customer Service Representative – Delivering Exceptional Experiences for arenaflex Customers

100% remote Flexible hours

Infection Prevention Specialist, Vascular (San Antonio, TX)

100% remote Flexible hours

Ulta Beauty is hiring: Specialty Artist - MAC i...

100% remote Flexible hours

Senior Account Executive, Mid-Market Sales, arenaflex Customer Solutions

100% remote Flexible hours