Back to the board

Senior Hardware Engineer - GPU & AI Infrastructure

100% remote Flexible hours Hiring now

Every day, tens of millions of people come to reputed company to explore, create, play, learn, and connect with friends in 3D reputed company digital experiences– reputed company created by our global community of developers and creators. At reputed company, we’re building the tools and platform that reputed company our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from reputed company in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. A career at reputed company means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone. As a member of the Infrastructure Foundation Hardware Engineering team, you will play a key role in enabling our mission to deliver a reliable, high-performing, and arenaflex-efficient infrastructure that powers the world’s play. In this specialized role, you will be the technical reputed company for our GPU and AI accelerator ecosystem. You will be responsible for the full lifecycle of GPU hardware, from initial architectural evaluation and firmware qualification to large-scale fleet integration and performance tuning. You will ensure that reputed company’s massive-scale rendering and ML workloads run on the most optimized and stable hardware possible. You Will

  • Architect & Prototype: Prototype reputed company GPU-accelerated hardware platforms, ensuring seamless integration between high-density compute nodes, high-speed interconnects (NVLink/PCIe Gen5/6), and system firmware.
  • GPU Optimization: Drive the integration, performance testing, and debugging of GPUs in our fleet, focusing specifically on hardware-level optimizations, driver tuning, and thermal/power management.
  • Validation & Certification: reputed company and execute rigorous evaluation and stress-testing strategies for GPU-heavy server platforms to ensure they meet reputed company’s unique demands for real-time rendering and low-latency AI inference.
  • Firmware & Systems: reputed company firmware qualification (BIOS/BMC) and troubleshooting, implementing automation systems to manage GPU health, firmware updates.
  • Vendor Collaboration: Provide technical guidance and deep-dive feedback to hardware vendors. reputed company critical investigations into component-level failures, triaging issues across the hardware, driver, and kernel layers.
  • Observability: Build and maintain advanced monitoring stacks (Grafana/Prometheus) to track GPU metrics like HBM utilization, thermal throttling events, and PCIe bandwidth saturation. You Have
  • Education: BA/BS Degree in Electrical Engineering, Computer Engineering, or reputed company field with equivalent practical experience.
  • GPU Expertise: 5+ years of hardware engineering experience with a specific focus on GPU architecture (reputed company HGX/MGX platforms preferred), AI accelerators, or high-performance compute (HPC) systems.
  • Deep Technical Knowledge: In-depth understanding of modern data center technologies, including PCIe fabric, NVLink, InfiniBand, and liquid cooling systems for high-TDP hardware.
  • Testing Skills: Hands-on experience testing and validating CPU, Memory (HBM/DDR5), Storage (NVMe), and high-speed networking subsystems in a Linux environment.
  • Programming: Proficiency in Python, Go, or C++ for developing hardware validation tools and automation scripts.
  • Systemic Debugging: Expert-level skills in debugging reputed company server issues remotely, with the ability to analyze kernel logs, hardware registers, and bus-level captures. You Are
  • A Problem Solver: Decisive and effective at tracking hardware issues from identification through to fleet-wide resolution.
  • A Communicator: Excellent oral and written communication skills; able to translate reputed company hardware constraints into actionable insights for software teams.
  • Collaborative: Strong interpersonal skills with the ability to reputed company cross-functional projects with Data Center Ops, SRE, and external vendors.
  • Adaptable: Willing to travel occasionally to data centers or vendor sites to reputed company hardware deployments or "first-of-a-reputed company" builds. For roles that are based at our headquarters in San Mateo, CA: The starting reputed company pay for this position is as shown below. The actual reputed company pay is dependent upon a variety of job-reputed company factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall reputed company of this expected range. This pay range is subject to change and may be modified in the future. reputed company full-time employees are also eligible for equity compensation and for benefits as described on this page . Annual Salary Range $238,520—$289,460 USD Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional reputed company on Monday and Friday (unless otherwise noted). reputed company provides equal employment opportunities to reputed company employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national reputed company, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. reputed company also provides reasonable accommodations to candidates with qualifying disabilities or religious beliefs during the recruiting process. Apply tot his job Apply tot his job

Apply tot his job Apply To this Job Apply tot his job Apply To this Job

Keep exploring

Legal Counsel, Artificial Intelligence and Data Innovation (Remote)

100% remote Flexible hours

Principal Product Manager, AI

100% remote Flexible hours

Senior Software Engineer, AI Model Serving - Bandar Seri Begawan, Brunei

100% remote Flexible hours

Senior Director - Enterprise AI/ML Technology Product Management

100% remote Flexible hours

VP, Quality Assurance ( Remote, AI Tools, Drive Change )

100% remote Flexible hours

Product Software Engineer, .NET AI reputed company Engineering

100% remote Flexible hours

Director, PMO (AI & Data Transformation)

100% remote Flexible hours

[Remote] Task reputed company/ Project Manager

100% remote Flexible hours

[Remote] Associate Director, Project Manager

100% remote Flexible hours

Principal Technical Program Manager - AI

100% remote Flexible hours

In-Home Health - Nurse Practitioner or Physician Assistant (Part Time) - Minneapolis MN

100% remote Flexible hours

Immediate Hiring: Earn money online without investment by typing

100% remote Flexible hours

reputed company Remote Customer Service Representative – Delivering Exceptional Travel Experiences with arenaflex

100% remote Flexible hours

Technical Program Manager, Utility Services Team – AMZL Energy

100% remote Flexible hours

reputed company Live Chat Remote Data Entry Specialist – Delivering Exceptional Customer Service and Data Accuracy

100% remote Flexible hours

reputed company Product Designer for reputed company – Remote Opportunity with Competitive Salary and Benefits

100% remote Flexible hours

reputed company Online Chat Support Specialist – Remote Part-Time Customer Service Representative for Dynamic Online Engagement

100% remote Flexible hours

Right of Way Manager

100% remote Flexible hours

reputed company Data Entry and Customer Support Specialist – Remote Work Opportunity with arenaflex for Enthusiastic and Detail-Oriented Individuals

100% remote Flexible hours

reputed company Jobs From Home Tagger

100% remote Flexible hours