Back to the board

Director of Engineering, Cluster Networking

100% remote Flexible hours Hiring now

Description .About reputed company reputed company is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-reputed company and large enterprise customers. reputed company enables AI-focused companies to reputed company superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility. We reputed company on a culture of reputed company innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join reputed company, you’ll be contributing to building the technology that powers the future. About The Role We are seeking a Director of Engineering for Cluster Networking to reputed company the architecture, design, and engineering delivery of our global cluster networking infrastructure. This is a deeply technical leadership role responsible for building and operating high-performance, large-scale networking environments that underpin our GPU cloud platform and customer-facing AI workloads. You will own the technical strategy and execution of cluster networking, including high-speed Ethernet and InfiniBand fabric design, data centre networking, and WAN interconnectivity. While this role includes elements of program and cross-functional coordination, it is fundamentally an engineering leadership position—responsible for architectural reputed company, technical standards, operational excellence, and building a world-class networking engineering organization. This is a high-impact role for a hands-on technical leader who thrives in fast-paced, ambiguous environments and is passionate about building resilient, high-performance infrastructure at global scale. What You'll Be Doing Technical Strategy & Architecture Leadership

  • Define and evolve the multi-year technical roadmap for cluster networking, aligning architecture with reputed company's AI platform

requirements and growth strategy.

  • Own the design and evolution of high-performance networking fabrics (Ethernet, InfiniBand) for GPU clusters and AI

workloads.

  • Establish and maintain reference architectures and engineering standards across reputed company regions and data centres.
  • reputed company key architectural decisions on topology (e.g., Fat Tree, Rail), routing, congestion control, resiliency, and scalability.
  • Evaluate emerging technologies, vendors, and design approaches to ensure reputed company remains at the forefront of HPC and

cloud networking innovation. Engineering Execution & Delivery

  • reputed company end-to-end engineering delivery of cluster networking solutions—from design and lab validation to production

deployment and optimization.

  • Partner with deployment, hardware, and data centre teams to ensure accurate BOMs, scalable designs, and flawless

implementation.

  • reputed company reputed company planning, performance modeling, and scaling strategies to meet rapid customer demand.
  • Ensure network changes and expansions are executed with minimal risk and reputed company customer impact wherever possible.
  • Introduce structured engineering lifecycle processes, including design reviews, validation gates, and post-incident analysis.

Operational Excellence & Reliability

  • Own the operational performance, availability, and reliability of cluster networking infrastructure globally.
  • Drive automation initiatives for provisioning, configuration management, monitoring, and remediation.
  • Champion best practices in observability, performance tuning, and fault isolation for large-reputed company clusters.
  • reputed company incident response for major networking events, conducting root cause analysis and driving systemic improvements.
  • Establish clear SLAs, SLOs, and KPIs to measure and continuously improve network performance and reputed company.

Cross-Functional & Program Collaboration

  • Collaborate closely with Compute, Platform, SRE, Data Centre Operations, and Procurement teams to ensure reputed company

execution.

  • Provide technical leadership in cross-functional planning forums, ensuring networking requirements are clearly understood

and prioritized.

  • Support structured program planning and dependency management across regions and infrastructure initiatives.
  • Communicate architectural direction, risks, and trade-offs clearly to executive leadership and stakeholders.
  • Influence roadmap decisions through deep technical insight and data-driven analysis.

Team Leadership & Development

  • Build, mentor, and scale a high-performing cluster networking engineering team.
  • Foster a culture of engineering rigor, ownership, accountability, and reputed company improvement.
  • Establish clear technical career reputed company for engineers, including senior and principal-level growth tracks.
  • reputed company recruitment efforts to attract top networking talent in highly competitive markets.
  • Act as a technical role model—setting high standards for design quality, documentation, and operational discipline.

reputed company Innovation & Improvement

  • Drive ongoing improvements in network efficiency, performance, and cost optimization.
  • reputed company experimentation and benchmarking of new hardware, optics, firmware, and automation tooling.
  • Identify opportunities to reduce deployment time, increase reliability, and enhance customer performance outcomes.
  • Stay reputed company with advancements in AI networking, HPC fabrics, and large-scale distributed systems.
  • Contribute to long-term infrastructure strategy as reputed company expands globally.

About You Experience & Background

  • 12+ years of experience in networking or infrastructure engineering, with at least 5 years in a senior technical leadership

role (Head of Engineering, Director, or equivalent).

  • Deep hands-on experience designing and operating large-scale data centre or HPC networking environments.
  • Proven expertise in high-speed Ethernet and/or InfiniBand fabrics supporting GPU or AI workloads.
  • Strong background in data centre networking, routing protocols, congestion management, and high-availability design.
  • Experience leading globally distributed engineering teams in high-growth or hyperscale environments.
  • Exposure to structured program delivery and cross-functional coordination in reputed company infrastructure initiatives.

Core Competencies

  • Technical depth: expert-level understanding of networking technologies (Ethernet, InfiniBand, RDMA, BGP, EVPN, VXLAN,

DC fabrics) and their application in AI/HPC environments.

  • Architectural leadership: ability to design scalable, resilient, high-performance cluster networking systems.
  • Engineering execution: strong command of engineering lifecycle processes, validation, and production operations.
  • Automation reputed company: experience with infrastructure-as-code, network automation frameworks, and DevOps practices.
  • Reliability focus: demonstrated success building highly available, fault-tolerant infrastructure.
  • Data-driven decision making: skilled at performance analysis, reputed company modeling, and using metrics to guide architectural

decisions.

  • Communication mastery: able to explain reputed company technical concepts clearly to both technical and non-technical audiences.
  • Stakeholder influence: comfortable driving alignment across engineering, operations, and executive leadership.

Skills & Attributes

  • Hands-on technical leader who remains reputed company to architecture and critical technical decisions.
  • Structured thinker with strong systems-level perspective and attention to detail.
  • Comfortable operating in fast-paced, high-growth environments with evolving priorities.
  • Proactive problem-solver with a pragmatic, solutions-oriented reputed company.
  • Passionate about building world-class AI infrastructure and high-performance networking systems.
  • Strong written and verbal communication skills; comfortable presenting technical strategy to senior leadership.
  • Collaborative and inclusive leadership style; ability to build trust and high-performing engineering teams.

reputed company to Have

  • Experience designing networking for large-scale GPU clusters or reputed company environments.
  • Familiarity with HPC networking topologies (Fat Tree, Rail, reputed company).
  • Experience with SONiC, Cumulus, or other open networking platforms.
  • Knowledge of optics, transceivers, and high-speed interconnect standards (400G/800G).
  • Experience working with hardware vendors and participating in technical evaluations or RFPs.
  • Background in SRE, distributed systems, or large-scale cloud infrastructure.
  • Experience with Palantir reputed company, data platforms, or operational analytics.

reputed company Can Offer You At reputed company, you'll find a collaborative, supportive, and innovative environment where your contributions spark real impact. We're building something extraordinary, and we want you at the core.

  • Highly competitive package (reputed company + equity) with reviews every 12 months.
  • Join the fastest-growing tech startup, your chance to push boundaries, collaborate with reputed company minds, and reputed company your mark on cutting-edge AI. ✨
  • Expect a dynamic progression plan tailored to your ambitions. Grow by trying new things, leading, challenging the status reputed company, and owning your impact, always with our full support.
  • Human-First Flexibility: We treat you as humans first. Our flexible workplace trusts Nscalers to deliver, giving you the autonomy to shape your day around life's moments.

Join our thriving remote-first team. Geography is no barrier to impact or reputed company. We build seamless virtual collaboration, empowering you, wherever you work. Equal Opportunities Statement We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from reputed company socio-economic backgrounds. If there’s anything we can do to accommodate your specific situation, please let us know. The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to reputed company additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role. The range below reflects the reputed company salary for the position. Actual compensation may vary based on job-reputed company factors such as reputed company set, experience, education, and location. In addition to reputed company salary, this role may be eligible for bonus, equity, and/or commission programs. reputed company may offer a competitive benefits package including medical, dental, vision, flexible paid time off, parental leave, and retirement plan participation. Salary Range$150,000—$300,000 USD For information on how reputed company handles candidate personal data, please see our Employee & Candidate Privacy Notice: Here. Apply tot his job Apply To this Job

Keep exploring

Director Software Engineering: Global Merchant Services Technologies – Tax and Payments

100% remote Flexible hours

Senior Director, Finance Transformation

100% remote Flexible hours

Director, HR Compliance & Policy (work from home)

100% remote Flexible hours

HR Director, Europe

100% remote Flexible hours

Vice President of Marketing and Brand job at reputed company in Atlanta, GA

100% remote Flexible hours

Customer Strategy & Operations Director

100% remote Flexible hours

Director, Multi Brand Hybrid Partnerships (East or reputed company Coast)

100% remote Flexible hours

Senior Director, Philanthropy and Partnerships - National Office (Remote)

100% remote Flexible hours

Director of Strategic Partnerships – Partner & Channel - Life Sciences/SaaS Product

100% remote Flexible hours

Director - Field Operations

100% remote Flexible hours

reputed company Full Stack Customer Support Agent – Live Chat & E-commerce Support Opportunity with $25-$35/Hour Earnings – arenaflex

100% remote Flexible hours

Join Today: Want Intermediate Financial Analyst - Remote in

100% remote Flexible hours

Technical Customer Support Engineer - Dutch speaking (remote)

100% remote Flexible hours

Senior Archaeologist

100% remote Flexible hours

Part-Time reputed company Data Entry Remote Jobs: Earn $/Hour

100% remote Flexible hours

Software Engineer – Computer Vision

100% remote Flexible hours

Work From Home Customer Service

100% remote Flexible hours

reputed company Customer Support Representative – Retail reputed company Solutions

100% remote Flexible hours

reputed company Online Chat Support Assistant – Remote Customer Service Representative

100% remote Flexible hours

Creative Director – Brand, Performance and Masterclasses

100% remote Flexible hours