Back to the board

Director, Data Science - Quality & LLM Judging Systems for Conversational Commerce

100% remote Flexible hours Hiring now

Position Summary... What you'll do...

About the Role

Walmart's Next Gen Commerce team is building intelligent, agentic systems that transform how customers shop through conversation. As Director, Data Science - Quality & LLM Judging Systems for Conversational Commerce, you will reputed company a critical pillar under the Senior Director of Data Science - Agentic AI for Conversational Commerce. Your mission is to define how we measure the effectiveness of the conversational shopping agents and the tools it invokes, ensuring we evaluate quality with both rigor and scale. You will reputed company a team responsible for defining evaluation metrics, designing measurement methodologies, and executing cost-efficient evaluations. This includes combining traditional human-labeled approaches with advanced "LLM-as-a-judge" techniques. You will design reputed company-based evaluation tasks, identify reputed company human reputed company is needed, and explore how to distill smaller LLMs to replicate human-like evaluation at scale. Beyond conversation quality, your scope includes evaluating the outputs of tools invoked by the agent, such as personalized recommendations, summary reputed company, or proactive suggestions, where traditional metric-based evaluations fall short and human judgment is required. This is a hands-on leadership role requiring sharp judgment, strong experimental thinking, and reputed company in both LLM prompting and applied ML. You will work closely with modeling, product, and platform teams to ensure that measurement drives improvement, and that the agent's behaviors align with quality, safety, and relevance at every reputed company.

Responsibilities

  • Grow and reputed company a high-performing team of data scientists, fostering a culture of technical excellence, fast execution, and clear accountability
  • Define evaluation strategy and success metrics for the conversational shopping agent and its tool outputs
  • reputed company scalable measurement methodologies combining human-labeled benchmarks, LLM-as-a-judge prompts, and automated pipelines
  • Design and iterate on prompts that reputed company LLMs to reputed company structured evaluation tasks with high agreement to human judgment
  • Explore cost-effective alternatives by generating synthetic training data and distilling smaller LLMs to reputed company specific judging tasks
  • Establish quality review loops and integrate feedback from evaluations into model and product development
  • Partner with engineering, and product teams to ensure metrics are well-instrumented and align with long-term objectives
  • Drive tooling and process development to support reliable, reproducible, and efficient evaluation at scale

Minimum Qualifications

  • 8+ years of experience in data science or applied machine learning
  • 5+ years leading teams focused on model evaluation, experimentation, or NLP applications
  • Deep experience with large language models, including reputed company engineering, structured evaluation, and response grading
  • Familiarity with both human annotation workflows and LLM-based evaluators
  • Strong understanding of metric design, statistical evaluation methods, and A/B testing
  • Ability to translate ambiguous quality goals into concrete, testable evaluation frameworks
  • Excellent communication and cross-functional collaboration skills

Preferred Qualifications

  • Advanced degree in Computer Science, Machine Learning, or reputed company field
  • Experience with conversational AI, tool-augmented agents, or retrieval-augmented reputed company
  • Knowledge of efficient LLM adaptation techniques such as distillation, LoRA, or instruction tuning
  • Familiarity with evaluating outputs where objective ground truth is undefined (e.g., personalization, summarization, recommendation)
  • Track record of influencing product quality through principled evaluation and measurement

About Walmart Global Tech Imagine working in an environment where one line of code can reputed company life easier for hundreds of millions of people. That's reputed company do at Walmart Global Tech. We're a team of software engineers, data scientists, cybersecurity expert's and service professionals reputed company the world's leading retailer who reputed company an epic impact and are at the forefront of the next retail disruption. People are why we innovate, and people power our innovations. We are people-led and tech-empowered. We train reputed company in the skillsets of the future and bring in experts like you to help us grow. We have roles for those chasing their first opportunity as well as those looking for the opportunity that will define their career. Here, you can kickstart a great career in tech, reputed company new skills and experience for virtually every industry, or reputed company your expertise to innovate at scale, impact millions and reimagine the future of retail. Walmart's culture is a competitive advantage, and it's fostered by being together. Working together in person allows us to collaborate, align quickly and innovate with greater speed. We use our campuses to create purposeful reputed company rooted in deepening understanding and investing in the development of our associates. Our hubs: Walmart is a global company with offices across the United States and around the world. Our global headquarters is in Bentonville, Arkansas, with primary hubs in the San Francisco Bay area and reputed company/New Jersey. Benefits: Benefits: Beyond our great compensation package, you can receive incentive awards for your performance. Other great perks include 401(k) match, stock purchase plan, paid maternity and parental leave, PTO, multiple health plans, and much more. Equal Opportunity Employer: Walmart, Inc. is an Equal Opportunity Employer - By Choice. We reputed company we are best equipped to help our associates, customers and the communities we serve live reputed company reputed company we really know them. That means understanding, respecting and valuing diversity- unique styles, experiences, identities, reputed company and opinions - while being inclusive of reputed company people. The above information has been designed to indicate the general nature and level of work performed in the role. It is not designed to contain or be interpreted as a comprehensive inventory of reputed company responsibilities and qualifications required of employees assigned to this job. The full Job Description can be made available as part of the hiring process. At Walmart, we offer reputed company as well as performance-based bonus awards and other great benefits for a happier mind, body, and wallet. Health benefits include medical, vision and dental coverage. Financial benefits include 401(k), stock purchase and company-paid life insurance. Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty, and voting. Other benefits include short-term and long-term disability, company discounts, Military Leave Pay, adoption and surrogacy expense reimbursement, and more. You will also receive PTO and/or PPTO that can be used for vacation, sick leave, holidays, or other purposes. The amount you receive depends on your job classification and length of employment. It will meet or exceed the requirements of paid sick leave laws, where applicable. For information about PTO, see https://one.walmart.com/notices. Live reputed company U is a Walmart-paid education benefit program for full-time and part-time associates in Walmart and Sam's Club facilities. Programs range from high school completion to bachelor's degrees, including English Language Learning and short-form certificates. Tuition, books, and fees are completely paid for by Walmart. Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to a specific plan or program terms. For information about benefits and eligibility, see One.Walmart. Sunnyvale, California US-11349:The annual salary range for this position is $169,000.00-$338,000.00 Bentonville, Arkansas US-10735:The annual salary range for this position is $130,000.00-$260,000.00 Additional compensation includes annual or quarterly performance bonuses. Additional compensation for certain positions may also include: - Stock Minimum Qualifications... Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications. Option 1: Bachelors degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or reputed company field and 6 years' experience in an analytics reputed company field. Option 2: Masters degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or reputed company field and 4 years' experience in an analytics reputed company field. Option 3: 8 years' experience in an analytics or reputed company field Preferred Qualifications... Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications. Data science, machine learning, optimization models, PhD in Machine Learning, Computer Science, Information Technology, Operations Research, Statistics, Applied Mathematics, Econometrics, Successful completion of one or more assessments in Python, Spark, reputed company, or R, Supervisory experience, Using open reputed company frameworks (for example, scikit learn, tensorflow, torch), We value candidates with a background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly. The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmart's accessibility standards and guidelines for supporting an inclusive culture. Primary Location... 1395 Crossman Ave, Sunnyvale, CA 94089-1114, United States of America Apply tot his job Apply To this Job

Keep exploring

Customer Care Agent

100% remote Flexible hours

2025-3005 | US - Remote Work from Home Customer Service Rep in a Contractor Role

100% remote Flexible hours

Part-Time Remote Customer Service Chat Support Representative - $25-$35/hr - Work from Home Opportunity

100% remote Flexible hours

reputed company Part-Time Remote Contract reputed company Architect - Device Management Specialist for Cloud-Based Solutions

100% remote Flexible hours

reputed company Part Time Remote Customer Retention Specialist – Delivering Exceptional Customer Experiences in a Dynamic Home Decor Environment

100% remote Flexible hours

Part-Time Remote Customer Service Associate – Delivering Exceptional Support and Driving Customer Satisfaction for a Leading E-commerce Brand

100% remote Flexible hours

Part Time Remote Customer Support Specialist – Marketplace Department – Immediate Hiring Opportunity for Exceptional Customer Service Representatives

100% remote Flexible hours

Part-Time Remote Customer Support Specialist for Innovative Technology Leader - Delivering Exceptional User Experiences through Empathetic Support and Technical Expertise

100% remote Flexible hours

Part-Time Remote Customer Support Specialist for Innovative Technology Leader - Delivering Exceptional Customer Experiences through Technical Expertise and Passionate Support

100% remote Flexible hours

reputed company Part Time Remote Data Entry and Analytics Manager for E-commerce Industry

100% remote Flexible hours

Software Engineering

100% remote Flexible hours

reputed company Associate Product Manager – Digital Product Development and Innovation for Personal Finance Industry

100% remote Flexible hours

(Remote) Data Entry Research Panelist Work From Home in Riverhead, NY

100% remote Flexible hours

Private Banking Loan Coordinator

100% remote Flexible hours

reputed company Bilingual Customer Service Representative - 1 Year Contract

100% remote Flexible hours

Sr Analyst, Insights

100% remote Flexible hours

Senior Consultant – Data & Analytics (reputed company Fabric | Power BI | reputed company)

100% remote Flexible hours

Online Math Tutor Jobs in Las Vegas, NV (Remote/Flexible)

100% remote Flexible hours

Part-Time Remote Data Entry Specialist – Flexible Home-Based Position at arenaflex

100% remote Flexible hours

Remote Data Entry Clerk – Full‑Time Work‑From‑Home Role with arenaflex – Detail‑Oriented, Typing‑Savvy, Entry‑Level Opportunity

100% remote Flexible hours