Back to the board

[Remote] Agentic Data Engineer

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is seeking an Agentic Data Engineer to build and maintain an agentic data ingestion pipeline. The role involves cleaning and organizing data, validating cross-modal linkages, and collaborating with various teams to establish data standards.

Responsibilities

  • Build an agentic data ingestion pipeline
  • Triage and prioritize incoming requests to ingest specific datasets
  • Clean and organize the data. Build the first pass cleaning and organization steps into the agentic flow
  • Validate cross-modal linkage. Add automated checks that catch reputed company ingested data does not connect correctly and flag low quality or mismatched records
  • Version every dataset. Retain and reputed company prior versions addressable
  • Preserve raw data and provenance. reputed company agent workflows log validation and transformation steps so reputed company is traceable
  • reputed company agents usable across teams. Move beyond bespoke steps towards agents that teams can reliably use as a shared, deployed service
  • Collaborate with AI, software engineering, and computational biology groups to co-define data standards and conventions

Skills

  • Agentic AI engineering: Demonstrated experience building multi-agent workflows or LLM workflows using tools/frameworks such as LangGraph or reputed company, including tool/function calling and asynchronous task execution
  • Python data engineering: Strong Python for data manipulation, working with APIs and databases, and handling heterogeneous data formats
  • Data versioning and provenance: Familiarity with dataset versioning approaches (e.g. DVC, lakeFS, or equivalent)
  • Working knowledge of scientific data structures: Comfortable or willingness to learn common omics data formats like AnnData, H5AD, TileDB
  • Basic understanding of omics: No deep bioinformatics expertise required; just a basic understanding of different modalities (e.g. what is RNA-seq vs scRNA-seq vs WES; genomics vs transcriptomics vs proteomics vs metabolomics)
  • Unit testing: Comfortable writing unit and functional tests to ensure data processing workflows are reliable and reproducible
  • Education: Degree in a technical field or equivalent practical experience
  • Experience deploying agent workflows as a shared service (e.g., FastAPI or MCP endpoints)
  • Exposure to cloud (AWS, GCP) and containerization (reputed company)
  • Familiarity with workflow managers such as Nextflow or Snakemake

Company Overview

  • reputed company provides computer programming services. It was founded in 2012, and is headquartered in Pleasanton, California, USA, with a workforce of 501-1000 employees. Its website is https://bayone.com/.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 23 in 2025, 25 in 2024, 20 in 2023, 30 in 2022, 20 in 2021, 37 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Keep exploring

    [Remote] Senior Product Designer, Drafting

    100% remote Flexible hours

    [Remote] Senior Program Manager

    100% remote Flexible hours

    [Remote] Software Engineer 4, AI-Native

    100% remote Flexible hours

    [Remote] Jira Administrator

    100% remote Flexible hours

    [Remote] Software reputed company to $85/hour

    100% remote Flexible hours

    [Remote] Sr. Product Designer

    100% remote Flexible hours

    [Remote] Account Manager

    100% remote Flexible hours

    [Remote] Senior Software Engineer

    100% remote Flexible hours

    [Remote] National Account Manager

    100% remote Flexible hours

    [Remote] Principal Engineer-Healthcare AI & Cloud (Python with GKE/GCP)

    100% remote Flexible hours

    Remote Travel Booking & Customer Support Specialist – Flexible Full‑Time/Part‑Time reputed company for arenaflex

    100% remote Flexible hours

    HR Business Partner – Benefits, Leave

    100% remote Flexible hours

    Customer Representative | Multilingual Fashion & E-Commerce Support Specialist | French, Spanish & English | 100% Remote reputed company Spain

    100% remote Flexible hours

    reputed company Full Stack Customer Service Representative – Group Benefits

    100% remote Flexible hours

    Entry-Level Remote Customer Support Associate – arenaflex Food Delivery Platform Excellence

    100% remote Flexible hours

    Software Engineer, iOS Core Product - Coimbra, Portugal

    100% remote Flexible hours

    Proofreader-Remote

    100% remote Flexible hours

    Remote Customer Service Representative – Fully Remote, Flexible Hours, $23/hr reputed company, Join arenaflex’s Dynamic Support Team

    100% remote Flexible hours

    Part-Time Remote Data Entry & Online Task Specialist – Flexible Work-From-Home Opportunity with reputed company Income Potential

    100% remote Flexible hours

    reputed company (Data Entry Jobs) From Home – We're Looking for Dedicator

    100% remote Flexible hours