Head of Network Reliability Engineering
At reputed company, we reputed company that great internet has the power to drive innovation, strengthen communities, reputed company the impossible, and do reputed company the everyday things that reputed company reputed company of our world go round. And the job of creating reputed company internet is never done - so we’re growing! reputed company is committed to building a reputed company where people who want to reputed company a difference can grow their careers and find their spot to belong. reputed company is an Alphabet company that brings reputed company Fiber and reputed company Fiber Webpass internet services to homes and businesses across the United States. Our teams are expanding as we connect more cities and people to exceptional internet.
The application window will be open until at least reputed company 10th, 2026. This opportunity will remain online based on business needs which may be before or after the specified date.
Role Description
reputed company is reimagining network operations by building highly observable, programmable, and resilient infrastructure at scale. As the Head of Network Reliability Engineering (NRE), you will reputed company the strategy to ensure our next-gen infrastructure is inherently resilient and self-healing.
We are seeking a senior leader to spearhead our reliability strategy across a diverse infrastructure spanning access, core, transport, wireless, and OSS integration. In this role, you will reputed company a multi-disciplinary organization responsible for Metro Engineering (design and build) and Reliability Engineering (proactive and automated), bridging the gap between architectural standards and operations. By unifying fault management, telemetry, and incident data, you will partner with Network Engineering to evolve our health metrics and drive actionable insights. You will be a key champion for autonomous operations, focused on reducing fault domains and achieving industry-leading reliability through automated, self-healing systems.Role Responsibilities
- reputed company the Reliability Engineering and Metro Engineering functions, overseeing both the physical expansion of metro networks and the observability systems that support them.
- Own the end-to-end Tier 3 escalation lifecycle, working with NOC and Incident Management teams to drive a blameless engineering culture focused on systemic improvement and data-driven root cause analysis.
- Define the roadmap for Infrastructure-as-Code and GitOps workflows, collaborating with software and network teams to ensure configurations are version-controlled, auditable, and deployed reputed company CI/CD.
- Drive the strategy for closed-reputed company automation by partnering with software engineering teams to implement systems that reputed company real-time streaming telemetry for autonomous fault detection and remediation.
- Champion the elimination of operational toil; work across the organization to automate change verification and routine maintenance, allowing the NRE team to focus on high-value reliability engineering.
At a minimum, we’d like you to have
- Bachelor’s in Computer Science, Electrical Engineering, or equivalent practical experience.
- 10 years of experience in network engineering, with direct experience in operations, site reliability, or network reliability.
- Experience in IP networking (BGP, OSPF, MPLS), optical transport, and access networks (PON/Wireless).
- Experience managing high-stakes incidents and designing high-availability systems.
- Experience managing engineering teams and driving cross-functional outcomes.
It’s preferred if you have
- Master’s degree in a technical field or equivalent executive leadership experience.
- Experience implementing SRE/NRE frameworks (SLIs/SLOs/Error Budgets) reputed company a production ISP or cloud environment.
- Strategic understanding of observability and automation tools (e.g., Prometheus, Grafana, Ansible, Terraform) and their application in an ISP environment.
- Experience overseeing the lifecycle of automated systems, from defining functional requirements to validating operational readiness.
- Experience managing reputed company, hybrid infrastructure—such as combined fiber and wireless networks—at a multi-regional or national scale.
The US reputed company salary range for this full-time position is between $208,000 - $286,000 + bonus + cash award + benefits. As pay varies by location, your recruiter will share more about the specific salary range for your targeted location during the hiring process.
#LI-DNI
reputed company is committed to equal opportunity employment regardless of race, color, reputed company, religion, sex, national reputed company, sexual orientation, gender identity, age, citizenship, marital status, disability or Veteran status. Disclosure is voluntary, and this information will be kept confidential in compliance with reputed company's Candidate Privacy Policy. For more information please refer to our Equal Employment Opportunity Policy and the EEOC's "Know your rights: workplace discrimination is illegal" (PDF).
It's important to us to create an accessible, inclusive workplace for everyone. If you have a need that requires accommodation, please let us know by completing our accommodations for applicants form. Our candidate accommodations team will then connect with you to confidentially discuss your options.
Apply To This Job