Back to the board

Data Engineer

100% remote Flexible hours Hiring now

Summary

We’re a small, cross-functional team focused on building AI systems that reason and code. We care deeply about understanding how people interact with these systems and how we can use data to make them safer, smarter, and more useful .

We're looking for a Data Engineer to build and own the pipelines and data infrastructure that power our product and research efforts. Your work will directly support model training, evaluation, product analytics, and safety systems. You’ll partner closely with team members building our coding agents to make sure we’re capturing the right signals and using them well.

If you’re excited about turning messy product data into actionable insights, and building systems that can scale with our research, we’d love to get connected!

Example Projects

• Combine synthetic data generation with human annotation platforms to produce high quality data that advances our product and research roadmap.

• Design and build resilient, scalable pipelines (ETL and ELT) for batch and streaming data.

• Develop and maintain infrastructure to support self-serve analytics, experimentation, and dataset generation. Prototype, evaluate, and make “build vs buy” decisions.

• Help define and improve data modeling practices across the company, including instrumentation standards, dimensional modeling for analytics and feature stores for machine learning (ML).

• Build integrations with ML infrastructure to support training pipelines, inference logging, and model monitoring (MLOps).

• Debug pipeline failures, automate deployment processes, and improve data quality and reusability.

You are

• A strong software engineer with 5+ years of experience, ideally working with large-scale data systems.

• Experienced in designing and maintaining data pipelines and infrastructure, especially for analytics, experimentation, and ML.

• Comfortable with tools for data orchestration (Airflow, Prefect), batch or streaming processing (Spark, Ray, Flink), and event tracking and analytics (Amplitude, PostHog).

• Experienced with cloud-based infrastructure and storage (e.g., S3, GCP, Snowflake, or Redshift), and thoughtful about cost-performance tradeoffs.

• Exposure to MLOps, model serving infrastructure, or ML workflows.

• Pragmatic and principled! You know when to optimize and when to ship.

Compensation and Benefits

Work directly on creating software with human-like intelligence.

Generous compensation, equity, and benefits.

• B

udget for self-improvement: coaching, courses, conferences, etc.

Actively co-create and participate in a positive, intentional team culture.

Spend time learning, reading papers, and deeply understanding prior work.

Frequent team events, dinners, off-sites, and hanging out.

Compensation packages are highly variable based on a variety of factors. If your salary requirements fall outside of the stated range, we still encourage you to apply. The range for this role is $170,000–$350,000 cash, $10,000–$2,000,000 in equity.

How to apply

All submissions are reviewed by a person, so we encourage you to include notes on why you're interested in working with us. If you have any other work that you can showcase (open source code, side projects, etc.), certainly include it! We know that talent comes from many backgrounds, and we aim to build a team with diverse skillsets that spike strongly in different areas.

About us

Imbue builds AI systems that reason and code, enabling AI agents to accomplish larger goals and safely work in the real world. We train our own foundation models optimized for reasoning and prototype agents on top of these models. By using these agents extensively, we gain insights into improving both the capabilities of the underlying models and the interaction design for agents.

We aim to rekindle the dream of the *personal* computer, where computers become truly intelligent tools that empower us, giving us freedom, dignity, and agency to pursue the things we love.

Apply to this Job

Keep exploring

Senior Solutions Engineer

100% remote Flexible hours

Future IDMWORKS Career Opportunities (India)

100% remote Flexible hours

Future IDMWORKS Career Opportunities (Canada)

100% remote Flexible hours

Salesforce for Nonprofits: Solution Architect

100% remote Flexible hours

Sales Development Representative

100% remote Flexible hours

Associate Creative Director (Full-Time, 6-Month Contract)

100% remote Flexible hours

Salesforce For Nonprofits: Business Analyst

100% remote Flexible hours

Sales Manager

100% remote Flexible hours

Senior Solutions Architect (Public Sector)

100% remote Flexible hours

Outbound Sales Director

100% remote Flexible hours

Immediately Require Xfinity Retail Sales Professional - $22.00/hr ($15.00/hr Base Pay plus Targeted Commission) $2,000 Signing Bonus, Uncapped Commission, Day One Benefits, Tuition Reimbursement Courtesy Internet & TV in Manitowoc, WI

100% remote Flexible hours

Remote Data Entry Clerk – Flexible Part-Time Work-From-Home Position | Earn Income From Home

100% remote Flexible hours

Part-Time Remote Data Entry Analyst – Paid Media Analytics & Business Intelligence (arenaflex)

100% remote Flexible hours

Experienced Part-Time Remote Data Entry Clerk and Customer Service Representative - Flexible Work-from-Home Opportunity with Lucrative Side Gigs and Growth Potential

100% remote Flexible hours

Experienced Online Chat Representative – Remote Customer Service Professional

100% remote Flexible hours

Bakery Production Assistant

100% remote Flexible hours

Entry-Level Virtual Assistant and Data Entry Junior Position for Ambitious Individuals with No Prior Experience Required, Offering Remote Work Flexibility, Competitive Salary, and Opportunities for Career Growth and Professional Development

100% remote Flexible hours

Full-stack Developer

100% remote Flexible hours

Educators - Life After Teaching / Remote Marketing Opportunity

100% remote Flexible hours

[Hiring] Customer Service Representative @Conduent State & Local Solutions, Inc

100% remote Flexible hours