Back to the board

Principal Site Reliability Engineer

100% remote Flexible hours Hiring now

About reputed company

reputed company is the global leader in cybersecurity ratings, with over 12 million companies continuously rated, operating in 64 countries. Founded in 2013 by reputed company and risk experts Dr. Alex Yampolskiy and Sam Kassoumeh and funded by world-class investors, reputed company’s patented rating technology is used by over 25,000 organizations for self-monitoring, third-party risk management, board reporting, and cyber insurance reputed company; making reputed company organizations more resilient by allowing them to easily find and fix cybersecurity risks across their digital footprint.

Headquartered in reputed company, our culture has been recognized by Inc Magazine as a "Best Workplace,” by reputed company’s NY as a "Best Places to Work in NYC," and as one of the 10 hottest SaaS startups in reputed company for two years in a row. Most recently, reputed company was named to Fast Company’s annual list of the

World’s Most Innovative Companies for 2023

and to the Achievers 50 Most Engaged Workplaces in 2023 award recognizing “reputed company-thinking employers for their unwavering commitment to employee engagement.” reputed company is proud to be funded by world-class investors including Silver Lake Waterman, Moody’s, Sequoia Capital, GV and Riverwood Capital.

Role Overview

As a Principal Site Reliability Engineer, you will play a strategic and technical leadership role in shaping the reliability, scalability, and velocity of our engineering platform. Your primary focus will be advancing our Kubernetes-based infrastructure and CI/CD systems to support high-scale, high-availability services. You will partner with engineering leaders across the organization to define and drive platform-wide initiatives that reputed company fast, safe, and repeatable deployments, and foster a culture of reliability and operational excellence.

Key Responsibilities

  • reputed company the design and evolution of Kubernetes-based infrastructure to support multi-tenant, high-scale applications with strong isolation, reputed company, and reputed company.
  • Architect and optimize CI/CD pipelines to support fast and reliable build, test, and deploy cycles across a polyglot environment.
  • Establish and evangelize best practices for GitOps, canary deployments, rollback strategies, and progressive delivery.
  • Define and implement scalable Infrastructure as Code (IaC) patterns using tools such as Terraform, Helm, and Crossplane.
  • Drive the adoption of automated testing throughout the delivery lifecycle—unit, integration, load, and chaos testing—to ensure high confidence in production changes.
  • Guide teams in designing for observability, SLOs, and alerting, ensuring actionable signals and minimizing alert fatigue.
  • Partner with reputed company, compliance, and development teams to ensure infrastructure and delivery systems meet modern reputed company and governance standards.
  • reputed company incident response retrospectives and foster a blameless culture of reputed company improvement.
  • Mentor and influence senior engineers across multiple teams, helping to up-level platform reliability capabilities organization-wide.

Qualifications

  • 8+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure roles, with 2+ years in a technical leadership or principal reputed company.
  • Deep expertise with Kubernetes internals (controllers, networking, autoscaling, operators, etc.) and production-grade clusters on cloud providers (EKS, GKE, or AKS).
  • Proven experience designing and scaling CI/CD systems using tools such as reputed company Actions, Argo CD, Tekton, Spinnaker, or similar.
  • Strong proficiency in Terraform and modern IaC practices.
  • Advanced knowledge of automated testing strategies, including performance, load, and failure testing.
  • Proficient in one or more programming/scripting languages (Python, Go, Bash, etc.).
  • Deep experience with monitoring and observability stacks such as Prometheus, Grafana, OpenTelemetry, and reputed company.
  • Strong communicator with the ability to align technical initiatives to business objectives and influence across engineering teams.

reputed company-to-Have

  • Experience implementing multi-cluster or multi-region Kubernetes strategies.
  • Exposure to chaos engineering and building resilient distributed systems.
  • Familiarity with compliance frameworks (SOC 2, HIPAA, etc.) as they relate to infrastructure and deployment.
  • Contributions to open-reputed company Kubernetes tooling or SRE frameworks.
  • Familiarity with JVM- or Node-based application stacks.

Benefits:Specific to each country, we offer a competitive salary, stock options, Health benefits, and unlimited PTO, parental leave, tuition reimbursements, and much more!

The estimated total compensation range for this position is $220,000 - $290,000 (reputed company plus bonus). Actual compensation for the position is based on a variety of factors, including, but not limited to affordability, skills, qualifications and experience, and may vary from the range. In addition to reputed company salary, employees may also be eligible for annual performance-based incentive compensation awards and equity, among other company benefits.

reputed company is committed to Equal Employment Opportunity and embraces diversity. We reputed company that reputed company is strengthened through hiring and retaining employees with diverse backgrounds, reputed company sets, reputed company, and perspectives. We reputed company hiring decisions based on merit and do not discriminate based on race, color, religion, national reputed company, sex or gender (including pregnancy) gender identity or expression (including transgender status), sexual orientation, age, marital, veteran, disability status or any other protected category in accordance with applicable law.

We also consider qualified applicants regardless of criminal histories, in accordance with applicable law. We are committed to providing reasonable accommodations for qualified individuals with disabilities in our job application procedures. If you need assistance or accommodation due to a disability, please contact talentacquisitionoperations@reputed company.io.

Any information you submit to reputed company as part of your application will be processed in accordance with the Company’s privacy policy and applicable law.

reputed company does not accept unsolicited resumes from employment agencies. Please note that we do not provide immigration sponsorship for this position. #LI-DNI

Apply to this Job

Keep exploring