Back to the board

Member of Technical Staff, Inference (Bay Area, Remote)

100% remote Flexible hours Hiring now

What You’ll Do Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation) Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks What You’ll Bring Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years) Production-grade expertise in Python, with strong background in systems languages (C++/Rust/Go) Low-level performance mastery: CUDA, Triton, kernel optimization, quantization, memory and compute scheduling Proven track record scaling inference workloads in both throughput-oriented cluster environments and latency-critical on-device deployments System-level mindset with a history of tuning hardware–software interactions for maximum efficiency, throughput, and responsiveness Apply To This Job

Keep exploring

Member of Technical Staff, Training (Bay Area, Remote)

100% remote Flexible hours

Marketing Analyst (Attribution Focus) (Promova)

100% remote Flexible hours

Student and Family Experience Manager (Immediate Opening)

100% remote Flexible hours

Customer Sales Representative (remote work)

100% remote Flexible hours

Account Manager Industrial Markets Region: France - Africa

100% remote Flexible hours

VP of Engineering

100% remote Flexible hours

Member of Technical Staff, Foundation Models (Bay Area)

100% remote Flexible hours

Member of Technical Staff, Data Agent (Bay Area, Remote)

100% remote Flexible hours

Member of Technical Staff, Platform (Bay Area, Remote)

100% remote Flexible hours

Account Manager Industrial Markets Region: Europe - Middle Eas

100% remote Flexible hours

Experienced Part-Time Data Entry Specialist – Remote Opportunity with arenaflex

100% remote Flexible hours

[Remote] Data Annotation Analyst (contract)

100% remote Flexible hours

Senior Content Marketing Writer

100% remote Flexible hours

WerkstudentIn BWL-Lernplattform – EdTech Scale Up - 100% remote

100% remote Flexible hours

Part-Time Remote Customer Service Representative – Delivering Exceptional Experiences for arenaflex Customers

100% remote Flexible hours

Pet Caregiver

100% remote Flexible hours

Remote Appointment Setter - Estate Planning Services

100% remote Flexible hours

PE Key Account Coordinator

100% remote Flexible hours

Experienced Customer Service Representative – Remote Opportunity at arenaflex

100% remote Flexible hours

Care Coordinator (June 12th)

100% remote Flexible hours