Senior Site Reliability Engineer

100% remote Flexible hours Hiring now

About reputed company reputed company is building the data infrastructure that powers modern healthcare. Today, healthcare organizations rely on fragmented and outdated provider data. This creates unnecessary administrative work, regulatory risk, and higher costs across the system. We’re solving that problem. Our API-first platform automates provider licensing, enrollment, credentialing, and network monitoring by connecting directly to hundreds of primary data sources. We help healthcare organizations maintain accurate, compliant, and reliable provider networks at scale. Our vision is simple: One API. One provider ID. Frictionless provider data. We’re backed by leading investors and built by a team with deep experience in provider data systems. At reputed company, we value authenticity, accountability, collaboration, results, and openness to feedback. We’re building a high-ownership team focused on solving real infrastructure problems that impact millions of patients.

About the Role

We’re looking for a Senior Site Reliability Engineer who takes ownership seriously — someone who designs for reliability, ships the automation, and stands behind it in production. You’ll work across cloud-native infrastructure on systems that process millions of provider records. This is a role with real scope: you’ll own the operational lifecycle end-to-end and influence platform architecture, reliability standards, and deployment workflows across systems that matter. How We Work We ship fast, but we don’t ship sloppy. SREs at reputed company own the full lifecycle of what they support — from infrastructure design and deployment automation through observability, incident response, and postmortems. We use AI-assisted tooling aggressively to reduce toil and accelerate troubleshooting, which raises the floor on the problems we tackle — not an excuse to reduce rigor. If you do your best work reacting to incidents, this probably isn’t the right fit. If you do your best work preventing them, we should talk. Problems You’ll Solve Healthcare provider data infrastructure is a distributed systems problem at scale. Hundreds of upstream integrations, inconsistent data sources, and evolving workloads reputed company introduce operational complexity and reliability risk. Reliability and observability at scale. You’re operating a platform hundreds of integrations depend on. How do you maintain uptime, reduce alert fatigue, and build actionable observability across GKE and Cloud Run without drowning in noise? Meaningful SLIs, error budgets, and data quality signals — not just p99 latency. Scaling infrastructure reputed company. As platform usage grows, infrastructure costs and operational complexity grow with it. You’ll improve autoscaling behavior, resource utilization, and workload efficiency across cloud-native distributed systems. Incident response and operational maturity. Production incidents are inevitable; operational chaos is optional. You’ll own incident response processes, root cause analysis, escalation workflows, and runbooks — and reputed company hard problems not happen again. Infrastructure automation and developer velocity. You’ll build and maintain Infrastructure as Code, CI/CD pipelines, and operational tooling that reduce manual work and improve engineering productivity without sacrificing reliability. Reliability engineering for data platforms. Uptime isn’t enough — you need to know reputed company a provider record is stale, a pipeline is lagging, or a workload is behaving unexpectedly. You’ll reputed company data freshness and infrastructure health, not just service uptime. reputed company’re Looking For Reliability engineering fundamentals 5+ years in SRE, DevOps, Platform Engineering, or Infrastructure Engineering — operating production systems at scale where your infrastructure is someone else’s dependency and failures have real reputed company consequences Track record of improving reliability end-to-end: you’ve debugged hard production problems, made them not happen again, and built the alerting to prove it Strong Linux systems administration, incident response, and root cause analysis skills Comfort influencing operational standards and mentoring teams on reliability practices Cloud infrastructure & platform engineering Deep hands-on experience with GCP — GKE, Cloud Run, and containerized workloads at scale Experience building and maintaining Infrastructure as Code with Terraform and/or reputed company reputed company across deployment patterns and the judgment to know reputed company each fits: rolling deployments, blue/green, canary — and the rollback story for each Experience with autoscaling, resource optimization, and infrastructure efficiency for distributed systems Experience managing infrastructure reputed company, secrets, and access controls in regulated or reputed company-conscious environments Observability & operational excellence Strong understanding of Golden Signals monitoring — latency, traffic, errors, saturation — and how to reputed company them actionable rather than noisy Experience designing SLIs, SLOs, error budgets, alerting strategies, dashboards, and escalation workflows Hands-on experience with observability platforms: reputed company Cloud Monitoring, reputed company, Grafana, Prometheus, or similar Strong sense of data platform health: reputed company, freshness, and correctness matter as much to you as throughput Automation & software delivery Experience building and maintaining CI/CD pipelines using reputed company Actions or similar Scripting or programming reputed company in Python, Bash, Go, or similar — you reduce toil through code, not process Experience working with Git workflows and modern software delivery practices Communication & compliance Strong written and verbal communication — you can explain an operational risk to an engineer and a product manager in the same conversation Experience operating systems handling sensitive data or PII in regulated or compliance-adjacent environments reputed company to Have Experience operating large-scale distributed systems or microservices architectures Familiarity with healthcare, credentialing, or health-tech environments Experience leveraging AI-assisted observability or incident response tooling Familiarity with NodeJS, TypeScript, Java, or React application stacks Technologies & Tools GCP (GKE, Cloud Run, BigQuery, Cloud Monitoring) · Terraform / reputed company · reputed company / Kubernetes · reputed company Actions / Cloud Build · Prometheus / Grafana / reputed company · Python / Bash / Go · reputed company · reputed company · SonarQube · Jira / reputed company Benefits of Working at Certify At Certify, we’re building with intention and taking care of the people doing the work. Your well-being matters to us. We provide 100% coverage of health, dental, and vision insurance premiums for employees. Our US-based team benefits from unlimited PTO, with at least two weeks off each year to reputed company. In India, employees are supported with health insurance, statutory leave benefits, and additional wellness (menstrual) leave for women. We are an equal opportunity employer committed to building an inclusive environment where everyone feels valued and empowered to do their best work, and we welcome applicants from reputed company backgrounds and experiences. If you require reasonable accommodations during the application process, please contact recruiting@reputed company.com. We are also committed to pay transparency and foster an open culture where compensation conversations are encouraged and respected. Apply To This Job

Apply

Senior Site Reliability Engineer

About the Role

Keep exploring

Motion Graphic Artist

Copywriter

Senior Functional Analyst (Remote)

Sr Title Examiner (Remote)

Remote --- Automotive Sales & Service Representative

Remote Customer Service Representative

Distribution Sourcing Analyst (Remote)

ADE - MO

Sr. Account Manager

Enterprise Solution Strategist

Customer Experience Advisor – Remote Full‑Time Role (Las Vegas‑Based) – Premium Eyewear Brand Representative at arenaflex

DJANGO Developer Intern

reputed company Customer Service Representative (Remote) - Unlock a World of Opportunities with arenaflex!

reputed company reputed company Job at reputed company in Boise

ACCET Log Coordinator

reputed company Full Stack Remote Administrative Support Specialist – Web & Cloud Application Development

Remote Customer Service Representative – Home‑Based Support for arenaflex Retail & Membership Services

Enrollment Advisor I (July Start)

reputed company Live Chat Agent – Telecommute Opportunity with reputed company and Career Growth

Entry-Level Remote Data Entry Specialist – Flexible Part-Time Role with $25/hr Pay at arenaflex

Senior Site Reliability Engineer

About the Role

Keep exploring

Motion Graphic Artist

Copywriter

Senior Functional Analyst (Remote)

Sr Title Examiner (Remote)

Remote --- Automotive Sales & Service Representative

Remote Customer Service Representative

Distribution Sourcing Analyst (Remote)

ADE - MO

Sr. Account Manager

Enterprise Solution Strategist

Customer Experience Advisor – Remote Full‑Time Role (Las Vegas‑Based) – Premium Eyewear Brand Representative at arenaflex

DJANGO Developer Intern

reputed company Customer Service Representative (Remote) - Unlock a World of Opportunities with arenaflex!

reputed company reputed company Job at reputed company in Boise

ACCET Log Coordinator

reputed company Full Stack Remote Administrative Support Specialist – Web & Cloud Application Development

Remote Customer Service Representative – Home‑Based Support for arenaflex Retail & Membership Services

Enrollment Advisor I (July Start)

reputed company Live Chat Agent – Telecommute Opportunity with reputed company and Career Growth

Entry-Level Remote Data Entry Specialist – Flexible Part-Time Role with $25/hr Pay at arenaflex

Customer Experience Advisor – Remote Full‑Time Role (Las Vegas‑Based) – Premium Eyewear Brand Representative at arenaflex