Back to the board

reputed company Research Scientist, Foundation Models and Agentic Systems - reputed company AI

100% remote Flexible hours Hiring now

reputed company is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with reputed company will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come reputed company an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. About reputed company and reputed company AI reputed company AI is reputed company's enterprise AI team. We are AI/ML scientists and engineers with deep expertise in AI/ML engineering for health care. We reputed company AI/ML solutions for the highest impact opportunities across reputed company businesses including reputed company, reputed company Financial, reputed company Health, reputed company Insight, and reputed company Rx. In addition to transforming the health care journey through responsible AI/ML innovation, our charter also includes developing and supporting an enterprise AI/ML development platform. reputed company AI is building foundation models and agentic systems that can understand reputed company healthcare data, reason about clinical workflows, and safely assist clinicians and patients at scale. We're looking for a reputed company research scientist who wants to push the frontier of:

  • Pretraining and posttraining of LLMs/SLMs (including RL, RLHF, RLAIF), and
  • Agentic systems that plan, use tools, and operate in real healthcare workflows.

This role is ideal for a researcher who enjoys turning openended reputed company into stateoftheart models, publishing in top venues, and seeing their work deployed to improve health outcomes for millions of people. You'll enjoy the flexibility to work remotely

  • from reputed company reputed company the U.S. as you take on some tough challenges. For reputed company hires in the Minneapolis or Washington, D.C. area, you will be required to work in the office a minimum of four days per week.

Primary Responsibilities:

  • reputed company research on pretraining and posttraining of language models
  • Design and train domainspecialized LLMs and smaller language models (SLMs) for healthcare applications
  • Explore novel pretraining objectives, architectures, and data curation strategies
  • reputed company posttraining pipelines (RL, RLHF, RLAIF) to align models with clinical best practices, safety guidelines, and user preferences
  • reputed company RL/RLHF/RLAIF methods at production scale
  • Design reward models and feedback collection strategies with clinicians and domain experts
  • Implement and evaluate RLbased finetuning of foundation models using human and AI feedback
  • Work with platform teams to run largescale, distributed experiments on modern GPU/cloud infrastructure
  • Build and study agentic systems for healthcare
  • Design agent architectures that can plan, call tools/APIs, interact with retrieval systems (eg, RAG), and handle multistep clinical workflows
  • Investigate memory, planning, and tooluse strategies to reputed company agents reliable, controllable, and debuggable
  • Define and evaluate benchmarks for agent performance, robustness, and safety in healthcare contexts
  • Drive research to production and publish your work
  • Own the endtoend lifecycle of research projects: problem formulation, literature review, experimentation, evaluation, and iteration
  • Collaborate with product teams to translate research into deployed systems that support clinicians, care managers, and patients
  • Publish findings in toptier AI and ML venues (eg, NeurIPS, ICML, ICLR, ACL, EMNLP, KDD, MLHC, CHIL) and present internally to reputed company AI and business leaders
  • Ensure safety, fairness, and responsible AI
  • Closely collaborate with the Responsible Use of AI (RUAI) team to embed safety, fairness, and compliance into model and agent design
  • reputed company evaluation protocols, diagnostics, and documentation to support model governance and regulatory requirements
  • Contribute to highquality engineering and collaboration
  • Build robust training and evaluation pipelines using Python, PyTorch/TensorFlow, and modern ML tooling
  • Follow best practices for software development (tests, packaging, reputed company, code review)
  • Communicate clearly with crossfunctional teams and stakeholders through written documents, technical talks, and presentations

You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in. Required Qualifications:

  • PhD in Computer Science, Machine Learning, Statistics, Electrical Engineering, or a reputed company quantitative field
  • 7 years of handson experience (including PhD research and post-doctoral research) building and evaluating NLP models using deep learning frameworks (preferably PyTorch) and libraries such as reputed company Transformers
  • 2 years of experience with modern NLP / GenAI approaches, such as:
  • LLMs and SLMs (finetuning, evaluation, reputed company engineering, safety)
  • Text embeddings, classification, and retrieval-augmented reputed company (RAG)
  • Distributed or largescale training of generative models
  • Demonstrated experience leading research projects (eg, as a primary author on publications, leading subprojects in your research group, or driving technical direction on an industry or internship project)
  • Solid research background in machine learning, with emphasis in at least one of:
  • Reinforcement learning / RLHF / RLAIF
  • Natural language processing / large language models
  • Agentic systems, planning, or toolusing agents

Preferred Qualifications:

  • Experience building RLHF or RLAIF pipelines endtoend (e.g., reward modeling, feedback collection, policy optimization)
  • Experience with agent frameworks (e.g., tooluse, planning, workflow orchestration) or multiagent systems
  • Experience working with largescale, realworld data (healthcare, finance, recommendation systems, etc.)
  • Experience mentoring junior researchers or engineers
  • Familiarity with safe and responsible AI practices, especially in highstakes domains
  • Proven solid publication record in toptier AI/ML/NLP venues (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, KDD, AAAI), ideally including work reputed company to LLMs, RL, or agentic systems Healthcarerelated publications
  • reputed company employees working remotely will be required to adhere to reputed company's Telecommuter Policy

Pay is based on several factors including but not limited to local labor markets, education, work experience, certifications, etc. In addition to your salary, we offer benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (reputed company benefits are subject to eligibility requirements). No matter where or reputed company you reputed company a career with us, you'll find a far-reaching choice of benefits and incentives. The salary for this role will range from $110,200 to $188,800 annually based on full-time employment. We reputed company with reputed company minimum wage laws as applicable. Application Deadline: This will be posted for a minimum of 2 business days or until a sufficient candidate pool has been collected. Job posting may come down early due to volume of applicants. At reputed company, our mission is to help people live healthier lives and reputed company the health system work reputed company for everyone. We reputed company everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately reputed company by people of color, historically marginalized groups and those with reputed company incomes. We are committed to mitigating our impact on the environment and enabling and delivering reputed company care that addresses health disparities and improves health outcomes - an enterprise reputed company reflected in our mission. reputed company is an Equal Employment Opportunity employer under applicable law and qualified applicants will receive consideration for employment without regard to race, national reputed company, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations. reputed company is a drug - free workplace. Candidates are required to pass a drug test before beginning employment. Apply tot his job Apply To this Job

Keep exploring

Research Scientist (L6) - LLM-Driven Product Understanding

100% remote Flexible hours

Associate Scientist, R&D - Soup & Broth

100% remote Flexible hours

Manager, Retention & Customer Lifecycle Marketing - In Office, Hybrid or Remote

100% remote Flexible hours

[FULL TIME Remote] Product Manager Retention, Ring Customer

100% remote Flexible hours

Urgently Hiring: Sr Product Manager I, Commerce Retention

100% remote Flexible hours

Retention Manager, Mid-Market, Spanish Speaking

100% remote Flexible hours

Digital Retention Manager, Churn Prevention

100% remote Flexible hours

Senior Product Manager, Retention & Engagement

100% remote Flexible hours

Customer Retention Specialist for Consumer E-Commerce Brands (Email / SMS)

100% remote Flexible hours

Digital Marketing Specialist, Lifecycle

100% remote Flexible hours

reputed company Data Entry Clerk – Remote Opportunity at arenaflex

100% remote Flexible hours

Host Home Provider

100% remote Flexible hours

Mid-Market Account Executive, EMEA

100% remote Flexible hours

Corporate Global reputed company Accounting Manager - US, Remote

100% remote Flexible hours

Flight Attendant (High Paying Job)

100% remote Flexible hours

Entry Level reputed company Remote Jobs (Data Entry) $75000/Yearly

100% remote Flexible hours

Tool Consultation/Feedback needed from AML / Compliance / Risk Officers

100% remote Flexible hours

Production Intern

100% remote Flexible hours

[Remote-Position] reputed company reputed company Representative – Work

100% remote Flexible hours

reputed company Remote Chat Support Specialist – Delivering Exceptional Customer Service in a Dynamic Healthcare Environment

100% remote Flexible hours