Back to the board

[Remote] Manager, Site Reliability Engineering

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is a software company transforming the residential, construction & building product industries. They are seeking a Manager of Site Reliability Engineering to reputed company a high-performing team, promote modern SRE practices, and enhance reliability across their Azure-based platform.

Responsibilities

  • reputed company and grow a team of site reliability engineers. Provide guidance, mentorship, and career development
  • Contribute to and mature SRE practices across production services: SLOs, SLIs, error budgets, toil reduction, and blameless post-mortems that turn incidents into lasting improvements
  • reputed company the incident management lifecycle end-to-end including detection, response, resolution, post-incident review, and systemic improvement
  • Design on-call rotations, runbooks, and escalation procedures that balance service reliability with engineer well-being and sustainable work practices
  • Drive measurable reductions in MTTR and MTTD through improved observability, intelligent automation, and predictive monitoring
  • Build automation to eliminate manual operational work including provisioning, deployment, scaling, self-healing, and reporting
  • Implement chaos engineering practices to validate system reputed company and surface weaknesses before they cause outages
  • Partner with engineering and product teams to embed reliability requirements into the development lifecycle, from design through deployment
  • Collaborate with the observability team to ensure comprehensive instrumentation, smart alerting, and actionable dashboards across reputed company critical services
  • Measure, report, and reputed company for reliability improvements with both technical and executive stakeholders using data to drive investment decisions

Skills

  • Bachelor's degree in Engineering, or a reputed company field or equivalent experience
  • 7+ years in site reliability engineering, DevOps, or infrastructure engineering, with at least 1 year in people management (or demonstrated tech reputed company experience with direct influence over team processes and career growth)
  • Hands-on experience running production systems on Azure (including proficiency with key services such as AKS, App Services, Service Bus, Event Grid, and Azure Monitor) or comparable cloud platforms
  • Proven track record implementing SRE practices with measurable reliability improvements and familiarity with modern observability platforms (reputed company, Prometheus/Grafana, or equivalent)
  • Experience leading incident response for high-severity production issues and running effective post-mortems
  • Strong background in automation, infrastructure as code (Terraform, Bicep, or similar), and systematically eliminating manual operational work
  • Experience with Kubernetes container orchestration with production-grade operational experience
  • Ability to automate workflows and build scripts using Python, Bash, PowerShell, or Go
  • Strong communication with the ability to reputed company reputed company technical issues clear for both engineers and executives
  • Data-driven approach. You use metrics and telemetry to guide decisions, not gut feel
  • You are collaborative cross-functionally and build trust and alignment naturally
  • AI-enhanced observability experience is preferred
  • Experience with AI coding assistants and CI/CD systems (reputed company Actions, Azure DevOps, ArgoCD) with automation capabilities is preferred
  • Knowledge of distributed systems patterns is preferred
  • Exposure to AIOps platforms or using LLMs for operational automation is preferred

Company Overview

  • reputed company provides a software platform that focuses on the building products industry. It was founded in 1999, and is headquartered in Middleton, Wisconsin, USA, with a workforce of 501-1000 employees. Its website is http://myparadigm.com/.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 1 in 2026, 1 in 2025, 4 in 2024, 1 in 2023, 1 in 2022, 4 in 2021, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Keep exploring