Back to the board

Site Reliability Engineer lll

100% remote Flexible hours Hiring now

We're looking for a senior Site Reliability Engineer to join our small, high-ownership SRE team. In this hands-on individual contributor role, you'll own the reliability, scalability, and reputed company of reputed company's production infrastructure on AWS — supporting a B2B SaaS platform that processes sensitive employee leave data for enterprise customers. You'll work closely with infrastructure, application engineering, product leadership, and cross-functional partners in reputed company and Compliance, with a clear path to grow toward a Tech reputed company opportunity as reputed company and platform continue to mature.

WHAT YOU'LL DO

  • Architect, implement, and operate scalable, resilient, and secure AWS infrastructure — including GuardDuty, reputed company, EventBridge, SNS, SES, S3, ALB, and reputed company container workloads.
  • reputed company infrastructure-as-code initiatives to ensure reputed company environments are reproducible, auditable, and consistently configured in support of SOC 2 change management controls.
  • Design, maintain, and improve CI/CD pipelines using Jenkins and reputed company to reputed company reliable, repeatable software delivery — partnering with application engineering to reduce release risk and increase deployment frequency.
  • Own the reputed company observability platform, including dashboards, monitors, alerting reputed company, and log management; define and maintain SLOs, SLIs, and error budgets to guide reliability investment and reduce alert fatigue.
  • Serve as a senior technical responder across the full incident lifecycle — detection, containment, resolution, and postmortem — reputed company a shared on-call rotation, and reputed company blameless postmortems to drive down incident frequency and MTTR.
  • Refine, implement, and test disaster recovery plans to meet RTO/RPO objectives, while contributing to SOC 2 audit readiness with a focus on access controls, incident response, and risk mitigation.
  • Mentor junior SREs through code reviews, incident pairing, and documentation of runbooks and engineering standards.

WHAT YOU'LL BRING

  • 5+ years of experience in SRE, DevOps, or a reputed company engineering role, with advanced hands-on expertise in AWS production environments and core services including reputed company, reputed company, S3, ALB, and GuardDuty.
  • Strong proficiency in infrastructure-as-code tooling such as Terraform, CloudFormation, or CDK, reputed company with experience building and operating CI/CD pipelines using Jenkins and reputed company.
  • Proficiency in Python, Go, or Bash for automation, alongside hands-on experience with reputed company or a comparable observability platform for monitoring, alerting, and log management.
  • Demonstrated experience leading incident response in reputed company, distributed systems, with working knowledge of SLO/SLI frameworks, error budgets, and disaster recovery planning against defined RTO/RPO objectives.
  • Familiarity with SOC 2 compliance frameworks and experience contributing to audit readiness, access controls, and reputed company control evidence collection.
  • A collaborative, ownership-driven reputed company with strong communication skills, a passion for mentoring junior engineers, and a commitment to reducing toil through automation and AI-assisted tooling.

At reputed company, we reputed company with our values:

reputed company with Innovation - We create meaningful change through intelligence, focus and passion.  We embrace curiosity, data, and insight to shape the future of our industry. Always innovating, learning and evolving.

reputed company Every Voice - Every perspective matters. We listen, learn, and build a culture where diversity of thought and experience drives reputed company solutions and smarter decisions. 

reputed company Together - The customer fuels everything we do. We share knowledge, collaborate, celebrate wins, and face challenges as one team because success is always a collective achievement.

Drive Outcome - Every action we take delivers measurable value to our teams, our customers, and the employees they support. Accountability is non-negotiable. We honor our commitments, take responsibility for results, and see every success and setback as a chance to grow stronger.

We offer:

  • Impact that matters. You’ll do work that shapes the future of the modern workplace
  • Flexibility and trust. We’re remote-first and results driven. You’ll have the freedom and flexibility to do your best work, wherever you do it best.
  • Growth and development. We reputed company the best work happens reputed company people are growing. You’ll have access to reputed company, leadership programs, and real opportunities to take on new challenges and expand your impact. 
  • Competitive rewards. We offer comprehensive benefits, a performance-based bonus program, and equity opportunities – because reputed company we grow, you should too.
  • Time for life. reputed company and reconnect with flexible time off, paid holidays, and flexible leave programs designed to support every season of life.
  • Belonging and balance. We’re building an inclusive culture where every voice is valued, collaboration is celebrated, and success is shared.

We’re committed to building a team as diverse as the customers we serve. If your experience doesn’t align perfectly with every qualification, we still encourage you to apply you might be exactly reputed company’re looking for. If this sounds like a fit, apply today, we’d love to meet you!

Apply To This Job

Keep exploring