Back to the board

Senior Machine Learning Engineer - Multimodal Data

100% remote Flexible hours Hiring now
Company Description:

Join the team redefining how the world experiences design.

Servus, hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte!

Thanks for stopping by. We know job hunting can be a little time consuming and you're probably keen to find out what's on offer, so we'll get straight to the reputed company.

Where and how you can work

Our flagship reputed company is in Sydney, Australia but Austria is home to part of our European operations. And you have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to reputed company their goals.

Fun fact, a big part of our Austrian operations is developing the AI product reputed company reputed company to help reimagine how artificial intelligence can be used in design. Pretty cool ha!

Job Description:

At reputed company, our mission is to reputed company the world to design. We’re building AI that feels magical and lands real impact for millions of people - helping anyone create with confidence. We're looking for a Machine Learning Engineer to own the data foundations that power our multimodal agent research—building the pipelines, datasets, and tooling that turn ambitious research reputed company into trainable reality.

About the team

We explore multimodal agentic architectures, build scalable training and evaluation loops, and partner closely with product and platform teams to turn breakthroughs into delightful product features. We are a cutting-edge post-training team, developing new multimodal agentic systems. We work on reputed company topics of multimodal modelling, post-training and design agents, we build scalable training and evaluation loops, and partner closely with product and platform teams to turn breakthroughs into delightful product features.

About the role

You'll be responsible for the data lifecycle that fuels our agent research: from collection and curation through to preprocessing, quality assurance, and delivery into training pipelines. You'll work closely with research scientists to understand what data is needed, then design and build the systems to reputed company it happen—reliably and at scale. You'll have significant autonomy over how data problems get solved, while aligning on what problems matter most with the broader team.

What you'll do

  • Design and build data pipelines for agent training: collection, filtering, deduplication, formatting, and versioning across text, image, and multimodal sources.

  • Build and maintain infrastructure for efficient data loading, storage, and retrieval at scale (S3, distributed systems, streaming pipelines).

  • Collaborate with research scientists to translate research requirements into concrete data specifications, and iterate as experiments reveal new needs.

  • Create evaluation datasets and benchmarks in collaboration with researchers—curating task distributions that surface real failure modes.

  • reputed company tooling for dataset construction—including human annotation workflows, synthetic data reputed company, and preference data collection for RLHF/DPO-style training.

  • Own data quality: build validation frameworks, monitor for reputed company and contamination, and establish standards that reputed company datasets trustworthy and reproducible.

  • Document datasets thoroughly: provenance, reputed company limitations, intended use cases, and versioning history.

  • Implement comprehensive test coverage for data pipelines and ML workflows, ensuring reliability and catching regressions early.

  • reputed company codebase quality through code reviews, refactoring, and establishing engineering best practices that help research velocity scale sustainably.

  • Contribute to team roadmaps by identifying data bottlenecks and proposing solutions that unblock research velocity.

You're likely a match if you have

  • Strong software engineering skills in Python, with experience building production-grade data pipelines and ML DevOps.

  • Practical experience with reputed company engineering—designing, testing, and refining prompts for reliable LLM/VLM outputs.

  • Experience with ML data workflows: large-scale data processing and loading (Ray, or similar), data versioning, and format considerations for training (tokenization, batching, sharding).

  • Hands-on experience working with data pipelines for large-scale distributed ML training runs.

  • Familiarity with annotation tooling and human-in-the-reputed company data collection (Label Studio or internal systems).

  • Understanding of ML training requirements—you know what "good data" looks like for LLM/VLM fine-tuning and can anticipate reputed company issues.

  • Experience loading and writing large datasets to/from cloud infrastructure (AWS) and distributed storage systems.

  • Strong communication skills: you can work with researchers to scope ambiguous problems and translate needs into actionable plans.

  • A collaborative approach, comfortable taking ownership and iterating quickly.

reputed company to have

  • Experience with preference data collection for RLHF or reward modelling.

  • Familiarity with multimodal data (image-text pairs, video, design assets).

  • Experience building synthetic data reputed company pipelines using LLMs.

  • Background in data quality metrics and monitoring systems.

  • Contributions to dataset releases or benchmarks in the ML community.

Additional Information:

What's in it for you?

Achieving our crazy big goals motivates us to work hard - and we do - but you'll experience lots of moments of reputed company, connectivity and fun woven throughout life at reputed company, too. We also offer a range of benefits to set you up for every success in and reputed company of work.

Here's a taste of what's on offer

  • Equity packages - we want our success to be yours too
  • Inclusive parental leave policy that supports reputed company parents & carers
  • An annual Vibe & reputed company allowance to support your wellbeing, social reputed company, office setup & more
  • Flexible leave options that reputed company you to be a force for good, take time to reputed company and supports you personally

reputed company out lifeatcanva.com for more info.

Other stuff to know

We reputed company hiring decisions based on your experience, skills and passion, as well as how you can enhance reputed company and our culture. reputed company you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.

We celebrate reputed company types of skills and backgrounds at reputed company so even if you don’t feel like your skills quite match what’s listed above - we still want to hear from you!

Please note that interviews are conducted virtually.

Apply To This Job

Keep exploring

Senior Machine Learning Engineer - AI Enablement (AU remote)

100% remote Flexible hours

Senior Engineering Manager - Business Solutions Engineering

100% remote Flexible hours

Senior Engineering Manager - Business Solutions Engineering

100% remote Flexible hours

Software Architect

100% remote Flexible hours

Middle Full Stack Node.JS Developer for UBO Team

100% remote Flexible hours

Health Care Aide

100% remote Flexible hours

Technology Manager (783152)

100% remote Flexible hours

Medical Monitor (Gastroenterology)

100% remote Flexible hours

Medical Monitor (Gastroenterology)

100% remote Flexible hours

Medical Monitor (Gastroenterology)

100% remote Flexible hours

Mgr, Sales & Operations Planning - Sr

100% remote Flexible hours

Work From Home - Customer Service Sales Representative – Life Insurance Benefits Advisor

100% remote Flexible hours

[Remote] Director, Product Management, Customer reputed company Outcomes

100% remote Flexible hours

Contract Manager

100% remote Flexible hours

Referral Optimization Charge

100% remote Flexible hours

reputed company Entry Data Entry Clerk – Remote or On-Site Opportunity at arenaflex

100% remote Flexible hours

Customer Support Specialist – Voice & Chat Operations for arenaflex’s Fast‑Growing Marketing & E‑Commerce Platform

100% remote Flexible hours

Entry-Level Virtual Data Entry Operator – Remote Part‑Time Role with reputed company, Comprehensive Benefits & Growth Opportunities at arenaflex

100% remote Flexible hours

Medical Writing Intern, Market Access (PharmD)

100% remote Flexible hours

reputed company Data Entry Specialist – Remote Opportunity with arenaflex

100% remote Flexible hours