Back to the board

[Remote] Machine Learning Research Engineer, Agents - Enterprise GenAI

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. Scale AI is a leading AI data foundry focused on accelerating the development of AI applications. The Machine Learning Research Engineer will work on applying agent reinforcement learning algorithms to enterprise datasets, creating best-in-class agents that achieve state-of-the-art results.

Responsibilities

  • Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers
  • Research cutting edge algorithms to integrate directly into our training stack
  • Build agents that leverage our proprietary agent-building algorithms to automatically hill climb datasets – including defining highly performant tools, multi-agent systems, and complex rewards

Skills

  • 1-3 years of building with LLMs in a production environment
  • Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc
  • Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years
  • PhD or Masters in Computer Science or a related field

Benefits

  • Equity based compensation, subject to Board of Director approval
  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • A learning and development stipend
  • Generous PTO
  • A commuter stipend

Company Overview

  • Scale’s mission is to develop reliable AI systems for the world’s most important decisions. It was founded in 2016, and is headquartered in San Francisco, California, USA, with a workforce of 501-1000 employees. Its website is https://scale.com.
  • Company H1B Sponsorship

  • Scale AI has a track record of offering H1B sponsorships, with 11 in 2026, 81 in 2025, 54 in 2024, 29 in 2023, 17 in 2022, 10 in 2021, 10 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Keep exploring

    [Remote] Solutions Architect, Data Processing - New College Grad 2026

    100% remote Flexible hours

    Engineer

    100% remote Flexible hours

    Research Engineer/Research Scientist – Model Transparency

    100% remote Flexible hours

    Research Engineer/Research Scientist – Model Transparency

    100% remote Flexible hours

    AWS Cloud Engineer

    100% remote Flexible hours

    [Remote] MTS SDET, Test Infrastructure

    100% remote Flexible hours

    Associate Data Engineer

    100% remote Flexible hours

    Programmer Analyst, Associate

    100% remote Flexible hours

    Software Engineer II, Backend (PMI Integrations)

    100% remote Flexible hours

    [Remote] Software Developer

    100% remote Flexible hours

    Experienced Customer Support Representative – Delivering Exceptional Service at arenaflex

    100% remote Flexible hours

    Senior Security Engineer, AI Model and Application

    100% remote Flexible hours

    [Remote] IT Systems Analyst IV (IT Workday Procure To Pay Support Analyst)

    100% remote Flexible hours

    Creative Retail Graphic Designer - Industrial Color Extended

    100% remote Flexible hours

    Steuerfachkraft (m/w/d) in Heroldsberg mindestens 52.000€ - 100% Remote möglich

    100% remote Flexible hours

    Epic Systems Analyst - Willow Ambulatory

    100% remote Flexible hours

    Appraisal Underwriter

    100% remote Flexible hours

    Experienced Email and Chat Support Professionals Wanted – Remote Opportunities for Career Growth and Flexibility

    100% remote Flexible hours

    Equipment Finance Sales Exec

    100% remote Flexible hours

    Copywriter Needed for Growth Accounting & Advisory Firm (Conversion-Focused)

    100% remote Flexible hours