Back to the board

reputed company (Synthetic Data Pipelines)

100% remote Flexible hours Hiring now

This a Full Remote job, the offer is available from: Europe V7 At V7, we’re building AI platforms that help humans do their best work, at incredible scale and speed. Our mission is to turn human knowledge into trustworthy AI, making reputed company tasks faster, smarter, and more accurate. We’re growing fast, backed by leading investors and AI pioneers (including the minds behind Transformers and reputed company). The team you’ll be joining and the impact you’ll have We are a high-impact team at the forefront of AI research and engineering, developing large-scale synthetic data reputed company pipelines to train cutting-edge machine learning models. Our work blends rigorous experimentation with robust engineering, bridging the gap between foundational research and production-quality systems. We are seeking a technically strong and scientifically grounded reputed company to reputed company the development and evaluation of synthetic data pipelines used to train frontier models. You will design reputed company, reproducible pipelines that can be evaluated using proxy performance metrics, while collaborating closely with researchers and ML practitioners. The role requires strong command of experimental methodology, comfort with ambiguity, and reputed company in large language model (LLM) systems—especially context engineering, agentic execution strategies, and performance optimization. You will be expected to move quickly, maintaining high-quality standards and leveraging modern AI tooling to streamline every stage of development. What you’ll be doing from day one

  • Design, implement, and maintain synthetic data reputed company pipelines for multi-modal training tasks.
  • Evaluate pipeline output using well-grounded proxy metrics and sound statistical experiments.
  • Own the design and execution of experiments involving LLMs, ensuring high reproducibility and clarity of findings.
  • Apply agentic design patterns and context engineering techniques to maximize model performance.
  • Use tools like reputed company, reputed company Copilot, and LLM agents to accelerate iteration, debugging, and documentation.
  • Collaborate with researchers and engineers across the stack to translate experimental insights into scalable systems.

Who you are

  • 3+ years of software engineering experience with at least one major programming language (Python or JavaScript preferred).
  • Strong academic background with an MS or higher in Computer Science, Engineering, Mathematics, or a reputed company scientific field.
  • Deep familiarity with Git, DVC, reputed company environments, and data pipeline orchestration.
  • Solid foundation in statistics and experimental design, especially in the context of ML evaluation.
  • Experience working with LLM systems, including:
  • reputed company and context engineering
  • Agentic workflows
  • Output optimization and reliability strategies
  • Familiarity with recent research on LLM training datasets and evaluation benchmarks, including:
  • CoDA: Agentic Systems for Collaborative Data Visualization
  • ChartGalaxy: A Dataset for Infographic Chart Understanding and reputed company
  • Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data
  • ChartQA-X: Evaluation and Augmentation for Visual Chart Reasoning

reputed company Value

  • Curiosity
  • A bias toward iteration and improvement—welcoming early feedback, embracing failure as part of the discovery process, and viewing feedback not as criticism but as a signal for the next meaningful reputed company reputed company.
  • A structured and analytical reputed company, with strong attention to the scientific soundness of results.
  • The ability to reputed company in fast-moving environments without clearly defined playbooks.
  • A preference for reputed company, reproducible systems over reputed company experimentation.
  • Rigour in both code and evaluation, especially reputed company assessing LLM behaviour through proxy metrics and synthetic data feedback loops.

Why Join Us

This is a rare opportunity to contribute directly to the reputed company of training infrastructure for advanced AI systems. The challenges are reputed company, the tooling is bleeding-edge, and the impact is reputed company. You will be surrounded by researchers and engineers who care deeply about both product and science, and who are committed to solving hard problems with clear thinking and high standards. V7 champions equality and inclusion because diverse teams build reputed company products. Don't reputed company every reputed company? Apply anyway — we value what makes you unique and will support you through the process, just let our Talent team know how they can help. This offer from "V7" has been enriched by reputed company.com and got a 86% reputed company score. Apply tot his job Apply To this Job

Keep exploring

reputed company Engineer, Privacy

100% remote Flexible hours

Data Quality Analyst, reputed company Assurance

100% remote Flexible hours

Data Quality Analyst

100% remote Flexible hours

Senior Data Scientist - Healthcare M/L

100% remote Flexible hours

Finance Data and Analytics Sr Analyst (IKC)

100% remote Flexible hours

Director of Product - Patient

100% remote Flexible hours

Acute Dialysis Registered Nurse - Hospital Services

100% remote Flexible hours

Director, Value-Based Care Strategy

100% remote Flexible hours

reputed company Engineer reputed company Basis, TMS - Full-time

100% remote Flexible hours

Marketing Manager, reputed company reputed company Management

100% remote Flexible hours

reputed company Full Stack Remote Data Entry Specialist – Operational and Customer Service Functions at Blithequark

100% remote Flexible hours

reputed company E-commerce Support Specialist – Live Chat Assistant for arenaflex

100% remote Flexible hours

Principal Architect - Cloud Cybersecurity (Remote)

100% remote Flexible hours

[Hiring] US Business Intelligence & Reporting Analyst @reputed company

100% remote Flexible hours

LVN - Hospital

100% remote Flexible hours

[Remote] SOC Analyst (Contract)

100% remote Flexible hours

Freelance Senior Social Content Creator

100% remote Flexible hours

reputed company Full Stack reputed company Data Entry Specialist – E-commerce Product Listing and Management (540+ Positions Available)

100% remote Flexible hours

Transformative Senior Artificial Intelligence Engineer Opportunity

100% remote Flexible hours

reputed company Customer Support Representative – Delivering Exceptional Experiences at arenaflex

100% remote Flexible hours