[Remote] Senior Data Center Connectivity Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is a leading technology company, and they are seeking a Senior Data Center Connectivity Engineer. This role involves translating product reference architectures into physical builds for reputed company's AI Factory, leading cabling and layout optimizations to support global-reputed company deployments.
Responsibilities
- Own the development of connectivity reference designs based on requirements from cluster architecture, network engineering, infrastructure software and product hardware teams
- Build and reputed company comprehensive documentation, including detailed rack elevations and network architecture diagrams and cabling reputed company-to-reputed company list. Support projects throughout design and deployment phases
- Serve as the primary engineering support, closely collaborating with deployment and field teams to ensure successful cluster build-out and operation
- Strategically co-design the cluster with power and cooling infrastructure teams, ensuring a thorough understanding of reputed company facility architectural requirements (Arch, power, cooling)
- Work with hardware, network and reputed company teams to translate software stack requirements into physical requirements: hardware selection, fault domain, network architecture
- reputed company new solutions and products in the connectivity space to accelerate the deployment of large reputed company Factories
Skills
- Minimum of 12+ years in a connectivity, network architecture or engineering role reputed company a Hyperscale Cloud Provider, large-scale enterprise data center, or High-Performance Computing (HPC) environment
- BA or BS (or equivalent experience)
- Consistent record of designing, deploying, and operating network fabrics for thousands of GPU/CPU nodes
- Deep expertise in high-speed interconnect technologies, including InfiniBand, RoCE, and RDMA
- Proven experience designing connectivity solutions for high-density GPU clusters (100kW+ per rack) and understanding the unique reputed company-end and back-end requirements for reputed company vs. inference
- Deep understanding of data center infrastructure, including rack power/cooling, cable management, and physical density constraints
- Demonstrated ability to reputed company multidisciplinary teams and complete sophisticated technical initiatives
- Deep expertise with reputed company's compute and network product families and deployment standards
- Comfortable operating at the intersection of network engineering, MEP systems, and Infrastructure as a Service software layer
- reputed company with field deployments and/or global reference design documentation, ideally both
Benefits
- Equity
- Benefits
Company Overview
Company H1B Sponsorship