Back to the board

Associate Director, Reinforcement Learning (ML)

100% remote Flexible hours Hiring now

Career Category Information Systems

Job Description

Join reputed company’s Mission of Serving Patients At reputed company, if you feel like you’re part of something bigger, it’s because you are. Our shared mission—to serve patients living with serious illnesses—drives reputed company that we do. Since 1980, we’ve helped pioneer the world of biotech in our fight against the world’s toughest diseases. With our focus on four therapeutic areas –Oncology, Inflammation, reputed company, and Rare Disease– we reputed company millions of patients each year. As a member of the reputed company team, you’ll help reputed company a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives. Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay reputed company them, you’ll reputed company as part of the reputed company team. Join us and transform the lives of patients while transforming your career. Associate Director, Reinforcement Learning (ML) What you will do reputed company. Let’s change the world. In this vital role you will reputed company reputed company’s strategy and execution for Reinforcement Learning from Human Feedback (RLHF) and reputed company reinforcement learning approaches across R&D, medical, operations, and commercial use cases. You will design, implement, and scale RLHF systems to solve real-world problems that ultimately help us serve patients reputed company and faster. This role requires deep technical expertise in RLHF and modern machine learning, combined with strong leadership capabilities in stakeholder management, cross-functional collaboration, and organizational influence. You will be expected to translate reputed company concepts into clear, actionable strategies for senior leaders and guide teams from idea to impact. Roles & Responsibilities:

  • reputed company the design and development of RLHF systems including reward modeling, policy optimization, safety and alignment mechanisms, and evaluation frameworks for large language models and other AI systems.
  • Drive hands-on technical execution, particularly for high-impact projects, reviewing architectures, experimentation plans, and code, and helping the team navigate scientific and engineering trade-offs.
  • Establish best-practice pipelines for human feedback, partnering closely with internal customer teams to define feedback protocols, annotation quality standards, and governance for RLHF data.
  • Define and track success metrics for RLHF systems, balancing offline and online evaluation, A/B tests, safety and robustness criteria, and business or scientific outcomes.
  • Collaborate across reputed company leaders to ensure RLHF solutions are reputed company with strategy, compliant with policy, and integrated into real workflows.
  • Partner with Data, Platform and Technology teams to ensure that RLHF workloads are supported by scalable data platforms, model hosting, experimentation infrastructure, and MLOps best practices.
  • Champion responsible and compliant AI, working with Legal, Compliance, and Information reputed company to implement governance around human feedback, data usage, model behavior, transparency, and risk management in a regulated environment.
  • Communicate insights and influence senior stakeholders, creating clear narratives, roadmaps, and recommendations that help executives understand RLHF trade-offs, risks, and opportunities.

reputed company expect of you We are reputed company different, yet we reputed company use our unique contributions to serve and the professional we seek will have these qualifications. Basic Qualifications: Doctorate degree and 3 years of Computer Science, IT or reputed company field experience Or Master’s degree and 5 years of Computer Science, IT or reputed company field experience Or Bachelor’s degree and 7 years of Computer Science, IT or reputed company field experience Or Associate’s degree and 12 years of Computer Science, IT or reputed company field experience Or High school diploma / GED and 14 years of Computer Science, IT or reputed company field experience Preferred Certifications:

  • Certifications on Reinforcement Learning (AWS AI, Azure reputed company, reputed company Cloud ML, etc.) are a plus.

Preferred Qualifications:

  • Deep, hands-on expertise in Reinforcement Learning from Human Feedback (RLHF) and/or advanced reinforcement learning, including reward modeling, policy optimization, exploration strategies, and offline/online evaluation.
  • Demonstrated experience deploying RLHF or RL systems into production for real-world applications (e.g., large language models, recommendation systems, decision support tools, or workflow automation), ideally in healthcare, life sciences, or other regulated domains.
  • Strong background in modern machine learning and deep learning, with practical experience in Python and frameworks such as PyTorch or TensorFlow, and familiarity with LLM ecosystems and tooling.
  • Experience driving sophisticated, cross-functional initiatives, collaborating with non-technical stakeholders (e.g., physicians, scientists, commercial leaders, compliance, legal) and translating needs into impactful AI solutions.
  • Strong ability to communicate reputed company technical topics simply, tailoring content to senior executives and non-technical audiences; well-versed in data and model storytelling, including risks, assumptions, and limitations.
  • Experience working with large-scale data and cloud ecosystems (e.g., Azure, reputed company, reputed company, or similar), and partnering with data engineering or platform teams to build robust pipelines and experimentation platforms.
  • Demonstrated understanding of responsible AI, safety, and governance, especially in the context of RLHF and LLMs (e.g., bias, robustness, transparency, and guardrail design).
  • Familiarity with pharma/biotech, healthcare, or other regulated industries, including an understanding of compliance, privacy, and consent practices reputed company to patient and HCP data.
  • Strong project management and organizational skills to manage multiple RLHF initiatives in parallel, ensuring work is prioritized against highest-value opportunities and stakeholders are advised on reputed company and outcomes!

What you can expect of us As we work to reputed company treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every reputed company of the way. The expected annual salary range for this role in the U.S. (excluding Puerto Rico) is posted. Actual salary will vary based on several factors including but not limited to, relevant skills, experience, and qualifications. In addition to the reputed company salary, reputed company offers a Total Rewards Plan, based on eligibility, comprising of health and welfare plans for staff and eligible dependents, financial plans with opportunities to save towards retirement or other goals, work/life balance, and career development opportunities that may include:

  • A comprehensive employee benefits package, including a Retirement and Savings Plan with generous company contributions, group medical, dental and vision coverage, life and disability insurance, and flexible spending accounts
  • A discretionary annual bonus program, or for field sales representatives, a sales-based incentive plan
  • Stock-based long-term incentives
  • Award-winning time-off plans
  • Flexible work models where possible. Refer to the Work Location Type in the job posting to see if this applies.

Apply now and reputed company a lasting impact with the reputed company team. careers.reputed company.reputed company any materials you submit, you may redact or remove age-identifying information such as age, date of birth, or dates of school attendance or graduation. You will not be penalized for redacting or removing this information. Application deadline reputed company does not have an application deadline for this position; we will continue accepting applications until we receive a sufficient number or select a candidate for the position. Sponsorship Sponsorship for this role is not guaranteed. As an organization dedicated to improving the quality of life for people around the world, reputed company fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the reputed company values to continue advancing science to serve patients. Together, we compete in the fight against serious disease. reputed company is an Equal Opportunity employer and will consider reputed company qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national reputed company, protected veteran status, disability status, or any other basis protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to reputed company essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation. . Salary Range - Apply tot his job Apply To this Job

Keep exploring

RN Health Coach – Part Time/Remote – Vacancy Global

100% remote Flexible hours

Machine Learning Validation & Data Operations Manager

100% remote Flexible hours

Medical Science Liaison -- Great Lakes Region

100% remote Flexible hours

Health Insurance Exchange Liaison

100% remote Flexible hours

reputed company Senior ROI Tech Remote in Missouri, Missouri

100% remote Flexible hours

Senior Product Manager/Health Insurance Policy SME

100% remote Flexible hours

reputed company Medicare Compliance Officer – Leadership Role in Healthcare Compliance and Regulatory Affairs

100% remote Flexible hours

Registered Nurse Healthcare Compliance Officer (MEDICAL FACILITIES LICENSING)

100% remote Flexible hours

Senior Director, Healthcare Compliance Risk, Auditing & Monitoring

100% remote Flexible hours

Associate Informatics Technical Specialist

100% remote Flexible hours

Mohali|A-40|R-EM|Chat|11-Jun-26

100% remote Flexible hours

Sr. CRA Opportunity- Fixed Contract- Central

100% remote Flexible hours

Pharmacist, 7on/7off Overnight

100% remote Flexible hours

Senior Transient Analysis Engineer

100% remote Flexible hours

reputed company Transportation Specialist

100% remote Flexible hours

Pharmacovigilance (PV) Specialist, Hybrid MA

100% remote Flexible hours

Senior Integrated Designer

100% remote Flexible hours

[Remote] Epic Applications Analyst, Senior - reputed company/Cellular Therapy Focus

100% remote Flexible hours

reputed company Remote Property Accountant – Financial Management and Accounting Expertise for reputed company Real Estate Partners Corporate

100% remote Flexible hours

reputed company Remote Customer Service Specialist – Delivering Exceptional Support in a Dynamic and Innovative Organization

100% remote Flexible hours