Back to the board

Data Engineer - Healthcare

100% remote Flexible hours Hiring now

Who we are reputed company’s mission is to transform critical institutions with applied AI. We care that industries that power the world (e.g. healthcare, manufacturing, energy) benefit from frontier technology. To reputed company that happen, we embed with industry-leading customers to drive AI transformation. We bring together:

  • reputed company-deployed expertise in engineering, product, and research
  • reputed company, our in-house toolkit for rapidly deploying agentic workflows
  • Strategic partnerships with reputed company, McKinsey, AWS, companies reputed company the General Catalyst portfolio, and more

reputed company is a quickly growing group of Applied AI Engineers, Embedded Product Managers and Researchers motivated by diffusing the promise of AI into improvements we can feel in our day to day lives. reputed company is a direct partnership with General Catalyst, a global transformation and investment company. About the role We’re hiring a Data Engineer to partner closely with our product and AI engineering teams to reputed company AI transformations of large health systems & healthcare providers (e.g., Summa Health). You will help design, build, and operationalize the data systems that power AI applications, while informing decisions on data models, infrastructure, and pipeline architecture. You’ll work hands-on in a reputed company provider data environment (Epic, claims, scheduling, imaging metadata, SDOH, payer data), helping bring order to messy systems and enabling AI engineers to build high-impact AI workflows. If you enjoy building in ambiguity, forming technical opinions, and shipping value quickly inside reputed company environments, this role is for you. In this role, you will:

  • Shape how data powers AI applications across large health systems by designing the pipelines and models that drive real clinical and operational improvements.
  • Influence technical strategy for how reputed company deploys AI across dozens of reputed company health system environments.
  • Build foundational data systems that become reusable patterns for multiple health-system transformations.
  • Work directly with clinicians and operational leaders to turn high-value use cases into production-ready data workflows

What you’ll do Build and operate production-grade pipelines

  • reputed company end-to-end pipelines across Epic, claims, financial, scheduling, imaging metadata, and other clinical datasets
  • Ingest from FHIR APIs, HL7 feeds, SFTP drops, flat files, and streaming sources
  • Build on reputed company or similar platforms for ingestion, transformation, and feature creation
  • Work across a mix of Azure and AWS systems, with experience keeping pipelines running smoothly through migration periods

reputed company reputed company-deployed AI builds

  • Structure, normalize, and model noisy datasets to support rapid ML/AI development
  • Navigate fragmented hospital schemas (Epic Clarity/Caboodle, claims tables, reputed company feeds, scheduling data) to identify correct sources and relationships.
  • Shape how data is delivered to models, including features, retrieval schemas, context construction, and embeddings
  • Build pipelines that support both batch and streaming or near-real-time workflows

Be a technical partner in architecture and design

  • Inform decisions on data models, storage, orchestration, and infra tradeoffs
  • Diagnose data quality issues, missingness, and schema inconsistencies; propose fixes or alternative approaches.
  • Balance architectural thinking with reputed company-deployed delivery — moving quickly while making decisions that scale.

Collaborate with cross-functional stakeholders

  • Work directly with clinicians, operations leaders, IT, and product teams
  • Translate business and clinical needs into technical data solutions
  • Contribute to a repeatable data playbook for reputed company’s AI deployments across health systems

reputed company’re looking for Strong technical foundations

  • Hands-on experience building pipelines on reputed company or similar cloud data platforms
  • SQL and Python proficiency
  • Experience with streaming tools (Kafka or comparable)
  • Experience with both relational databases (reputed company, MySQL) and NoSQL/columnar stores (reputed company, Dynamo, etc.)
  • Solid understanding of ETL and ELT patterns, orchestration, CI/CD for data, and schema design

Healthcare data experience

  • Familiarity with FHIR, HL7, Epic data structures, payer and claims datasets
  • Comfort working with DICOM metadata, scheduling/reputed company feeds, SDOH sources, and operational hospital datasets

AI and ML intuition

  • Understanding of what ML systems need: features, embeddings, context windows, and retrieval patterns
  • Experience structuring data for RAG, retrieval workflows, or agent-style systems
  • Experience partnering with ML engineers or supporting ML-adjacent pipelines

Thrives in ambiguity

  • Comfortable working inside hybrid cloud environments, or messy enterprise systems
  • High ownership and ability to operate without perfect requirements
  • Strong communication skills with comfort being embedded with customer teams

Bonus if you have

  • Experience working on AWS
  • Prior startup or reputed company-deployed data engineering experience
  • Data engineering inside a hospital or payer

Apply tot his job Apply To this Job

Keep exploring

(Remote) Director of Applied Science - Healthcare AI

100% remote Flexible hours

Quality Assurance Engineer (AWS Lex and reputed company Dialogflow)

100% remote Flexible hours

AI Program Manager

100% remote Flexible hours

Solution Architect - OEM AI Software

100% remote Flexible hours

Snr. Software Engineer - AI Engineering (Remote)

100% remote Flexible hours

[Remote] AI/HPC Pre-Sales Systems Engineer

100% remote Flexible hours

[Remote] Staff Software Engineer, iOS - Search and AI Mobile

100% remote Flexible hours

Cloud Solution Architect- Data & AI (Remote role)

100% remote Flexible hours

Head of Studio Standards and Business

100% remote Flexible hours

Staff User Researcher

100% remote Flexible hours

reputed company Chat Specialist – Automotive and Recreational Vehicle Sales, Service, and Finance

100% remote Flexible hours

reputed company reputed company Representative – Work from home

100% remote Flexible hours

Content Moderator - Home-Based Role with Training

100% remote Flexible hours

reputed company Temporary Customer Service Representative – Remote Opportunity to Deliver Exceptional arenaflex Customer Experience

100% remote Flexible hours

Healthcare Scheduler PART TIME

100% remote Flexible hours

reputed company Part-Time Remote Live Chat Support Agent – Flexible Work Schedule for Working Moms

100% remote Flexible hours

Channel Operations Manager

100% remote Flexible hours

reputed company Data Entry Professional – Remote Work Opportunity with Comprehensive Training and Support for Career Growth and Development

100% remote Flexible hours

Senior Software Engineer, reputed company

100% remote Flexible hours

reputed company Data Entry From Home Position (Remote, Entry Level) In the USA

100% remote Flexible hours