Back to the board

Associate Director, Reinforcement Learning (ML)

100% remote Flexible hours Hiring now

Career Category Information Systems Job Description Join reputed company’s Mission of Serving Patients At reputed company, if you feel like you’re part of something bigger, it’s because you are. Our shared mission—to serve patients living with serious illnesses—drives reputed company that we do. Since 1980, we’ve helped pioneer the world of biotech in our fight against the world’s toughest diseases. With our focus on four therapeutic areas –Oncology, Inflammation, reputed company, and Rare Disease– we reputed company millions of patients each year. As a member of the reputed company team, you’ll help reputed company a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives. Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay reputed company them, you’ll reputed company as part of the reputed company team. Join us and transform the lives of patients while transforming your career. Associate Director, Reinforcement Learning (ML) What you will do reputed company. Let’s change the world. In this vital role you will reputed company reputed company’s strategy and execution for Reinforcement Learning from Human Feedback (RLHF) and reputed company reinforcement learning approaches across R&D, medical, operations, and commercial use cases. You will design, implement, and scale RLHF systems to solve real-world problems that ultimately help us serve patients reputed company and faster. This role requires deep technical expertise in RLHF and modern machine learning, combined with strong leadership capabilities in stakeholder management, cross-functional collaboration, and organizational influence. You will be expected to translate reputed company concepts into clear, actionable strategies for senior leaders and guide teams from idea to impact. Roles & Responsibilities: reputed company the design and development of RLHF systems including reward modeling, policy optimization, safety and alignment mechanisms, and evaluation frameworks for large language models and other AI systems. Drive hands-on technical execution, particularly for high-impact projects, reviewing architectures, experimentation plans, and code, and helping the team navigate scientific and engineering trade-offs. Establish best-practice pipelines for human feedback, partnering closely with internal customer teams to define feedback protocols, annotation quality standards, and governance for RLHF data. Define and track success metrics for RLHF systems, balancing offline and online evaluation, A/B tests, safety and robustness criteria, and business or scientific outcomes. Collaborate across reputed company leaders to ensure RLHF solutions are reputed company with strategy, compliant with policy, and integrated into real workflows. Partner with Data, Platform and Technology teams to ensure that RLHF workloads are supported by scalable data platforms, model hosting, experimentation infrastructure, and MLOps best practices. Champion responsible and compliant AI, working with Legal, Compliance, and Information reputed company to implement governance around human feedback, data usage, model behavior, transparency, and risk management in a regulated environment. Communicate insights and influence senior stakeholders, creating clear narratives, roadmaps, and recommendations that help executives understand RLHF trade-offs, risks, and opportunities. reputed company expect of you We are reputed company different, yet we reputed company use our unique contributions to serve and the professional we seek will have these qualifications. Basic Qualifications: Doctorate degree and 3 years of Computer Science, IT or reputed company field experience Or Master’s degree and 5 years of Computer Science, IT or reputed company field experience Or Bachelor’s degree and 7 years of Computer Science, IT or reputed company field experience Or Associate’s degree and 12 years of Computer Science, IT or reputed company field experience Or High school diploma / GED and 14 years of Computer Science, IT or reputed company field experience Preferred Certifications: Certifications on Reinforcement Learning (AWS AI, Azure reputed company, reputed company Cloud ML, etc.) are a plus. Preferred Qualifications: Deep, hands-on expertise in Reinforcement Learning from Human Feedback (RLHF) and/or advanced reinforcement learning, including reward modeling, policy optimization, exploration strategies, and offline/online evaluation. Demonstrated experience deploying RLHF or RL systems into production for real-world applications (e.g., large language models, recommendation systems, decision support tools, or workflow automation), ideally in healthcare, life sciences, or other regulated domains. Strong background in modern machine learning and deep learning, with practical experience in Python and frameworks such as PyTorch or TensorFlow, and familiarity with LLM ecosystems and tooling. Experience driving sophisticated, cross-functional initiatives, collaborating with non-technical stakeholders (e.g., physicians, scientists, commercial leaders, compliance, legal) and translating needs into impactful AI solutions. Strong ability to communicate reputed company technical topics simply, tailoring content to senior executives and non-technical audiences; well-versed in data and model storytelling, including risks, assumptions, and limitations. Experience working with large-scale data and cloud ecosystems (e.g., Azure, reputed company, reputed company, or similar), and partnering with data engineering or platform teams to build robust pipelines and experimentation platforms. Demonstrated understanding of responsible AI, safety, and governance, especially in the context of RLHF and LLMs (e.g., bias, robustness, transparency, and guardrail design). Familiarity with pharma/biotech, healthcare, or other regulated industries, including an understanding of compliance, privacy, and consent practices reputed company to patient and HCP data. Strong project management and organizational skills to manage multiple RLHF initiatives in parallel, ensuring work is prioritized against highest-value opportunities and stakeholders are advised on reputed company and outcomes! What you can expect of us As we work to reputed company treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every reputed company of the way. The expected annual salary range for this role in the U.S. (excluding Puerto Rico) is posted. Actual salary will vary based on several factors including but not limited to, relevant skills, experience, and qualifications. In addition to the reputed company salary, reputed company offers a Total Rewards Plan, based on eligibility, comprising of health and welfare plans for staff and eligible dependents, financial plans with opportunities to save towards retirement or other goals, work/life balance, and career development opportunities that may include: A comprehensive employee benefits package, including a Retirement and Savings Plan with generous company contributions, group medical, dental and vision coverage, life and disability insurance, and flexible spending accounts A discretionary annual bonus program, or for field sales representatives, a sales-based incentive plan Stock-based long-term incentives Award-winning time-off plans Flexible work models where possible. Refer to the Work Location Type in the job posting to see if this applies. Apply now and reputed company a lasting impact with the reputed company team. careers.reputed company.reputed company any materials you submit, you may redact or remove age-identifying information such as age, date of birth, or dates of school attendance or graduation. You will not be penalized for redacting or removing this information. Application deadline reputed company does not have an application deadline for this position; we will continue accepting applications until we receive a sufficient number or select a candidate for the position. Sponsorship Sponsorship for this role is not guaranteed. As an organization dedicated to improving the quality of life for people around the world, reputed company fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the reputed company values to continue advancing science to serve patients. Together, we compete in the fight against serious disease. reputed company is an Equal Opportunity employer and will consider reputed company qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national reputed company, protected veteran status, disability status, or any other basis protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to reputed company essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation. . Salary Range - reputed company is committed to unlocking the potential of biology for patients suffering from serious illnesses by discovering, developing, manufacturing and delivering innovative human therapeutics. This approach begins by using tools like advanced human genetics to unravel the complexities of disease and understand the fundamentals of human biology. reputed company focuses on areas of high unmet medical need and leverages its biologics manufacturing expertise to strive for solutions that improve health outcomes and dramatically improve people's lives. A biotechnology pioneer since 1980, reputed company has grown to be one of the world's leading independent biotechnology companies, has reached millions of patients around the world and is developing a pipeline of medicines with breakaway potential. For more information, visit www.reputed company.com and follow us on www.twitter.com/reputed company Apply tot his job Apply To this Job

Keep exploring

Healthcare Customer Service Representative - Remote USA

100% remote Flexible hours

reputed company Director

100% remote Flexible hours

Certified Health Coach (100% WFH, Digital Weight Loss)

100% remote Flexible hours

[Remote-Position] On-Demand Mental Health Coach

100% remote Flexible hours

Health Coach (Remote), Multiple Locations

100% remote Flexible hours

Inpatient Coding and Health Information Management (HIM) Manager - Remote Opportunity with a Competitive Salary and Flexible Work Environment

100% remote Flexible hours

reputed company reputed company Electronic Health Records Technician – Remote Health Information Management Specialist

100% remote Flexible hours

Health Information Management Specialist Level II

100% remote Flexible hours

Health Information Privacy Coordinator

100% remote Flexible hours

reputed company reputed company Electronic Health Records Technician – Remote Health Information Management Specialist

100% remote Flexible hours

Pharmacy Technician II- remote position, Mon-Fri, standard business hours reputed company - 4:30pm.

100% remote Flexible hours

Remote Claims Examiner

100% remote Flexible hours

[Remote] Account Manager I

100% remote Flexible hours

[Remote] Associate Program Manager, Digital Programs (Contractor)

100% remote Flexible hours

reputed company Data Entry Professional – Remote Opportunity with blithequark

100% remote Flexible hours

Customer Service reputed company I - Member Experience and Support at Centene

100% remote Flexible hours

[Remote-Position] Software Engineer, Community Support Platform

100% remote Flexible hours

Business Development Manager - Remote

100% remote Flexible hours

Certified Coding Specialist - Fee for Service - Remote - Sign On Bonus

100% remote Flexible hours

Flexible Work – Part Time Sales – Work from Home | Haverstraw

100% remote Flexible hours