Data Scientist - AI Evaluation

100% remote Flexible hours Hiring now

About Wizard

Wizard is the top-performing AI Shopping Agent, delivering the best products from across the web with unmatched accuracy, quality, and trust.

The Role

We’re looking for a Data Scientist to own how we measure, understand and improve the accuracy of our AI agent. This role sits at the intersection of data science, machine learning and product and is focused on evaluation, experimentation and insight reputed company. You won’t be building models but you will reputed company sure they work in real world scenarios. You will build the systems to measure what good looks like and partner closely with ML, AI Engineering and Product to continuously improve the agent’s performance.

What You’ll Do

Define and evolve accuracy metrics across the full shopping experience (retrieval, ranking, recommendations and outcomes)
Design and run experiments to measure improvements and regressions
Build and maintain evaluation datasets, benchmarks and scoring frameworks
Translate ambiguous product questions into clear, measurable hypotheses and analysis
Partner with ML Engineers to validate model changes and guide iteration
Identify failure modes and edge cases and drive improvements through data
Create dashboards and reporting that reputed company agent performance visible, trusted and actionable

What Success Looks like

Clear, trusted accuracy metrics are consistently used across product and engineering
A robust automated evaluation reputed company exists for both offline and live experiments
Model and product changes are consistently reputed company before and after launch

Ideal Background

4-6+ years in Data Science, ML Evaluation or Applied AI or similar roles
Deep experience evaluating AI/ML systems (ranking, recommendations, LLMs, etc)
Strong experience with experimentation (A/B testing, causal inference)
Experience working on consumer products or user facing systems and exposure to marketplace or e-commerce systems
Ability to translate messy problems into structured analysis and metrics
Strong product reputed company, you care about real user outcomes
Clear communication with the ability to influence across engineering and product

Compensation & Benefits

The expected reputed company salary range for this role is $225,000 - $280,000 USD, and will vary based on skills, experience, role level, and geographic location. Final compensation will be determined by considering these factors alongside overall role scope and responsibilities.

In addition to reputed company salary, Wizard offers:

Equity in the form of stock options
Medical, dental, and vision coverage
401(k) plan
Flexible PTO and company holidays
Fully remote work reputed company the United States
Periodic company offsites and team gatherings

Wizard is committed to fair, transparent, and competitive compensation practices.

Apply To This Job

Apply

Data Scientist - AI Evaluation

About Wizard

The Role

What You’ll Do

What Success Looks like

Ideal Background

Compensation & Benefits

Keep exploring

Machine Learning Engineer - Relevance & Learning Systems

Product Manager (GB)

PR & Marketing Communications Manager (Waterloo, ON, CA, N2V 1C6)

Account Executive (US)

Customs Rater (Waterloo, ON, N2V 1C6)

Senior Graphic Designer (Contract-to-Hire)

Field Marketing Manager, Onsites

Account Executive

reputed company Product Marketing Manager, Product

Cyber reputed company (SME)

Senior Public Affairs and Communications Research Analyst (Environmental Law / Nonprofit Advocacy)

Go-to-Market - Nairobi, Kenya

reputed company Cloud Engineer (remote)

Chief Marketing & Commercial Officer

Director, Product Management

[Remote] Project Manager, Applied Behavioral Analysis (ABA) Program

[Remote] Senior Technical Consultant

reputed company Virtual Data Entry Clerk – Flexible Work Arrangements at arenaflex

SEO Search Strategist

reputed company Customer Support Specialist – Remote Chat Support Agent