Back to the board

Senior ML Ops Engineer

100% remote Flexible hours Hiring now

About Wizard AI

At Wizard AI, we’re building the top-performing AI Shopping Agent that delivers the best products from across the web with unmatched accuracy, quality, and trust. Our ML models power the core of our platform, and we’re seeking an reputed company Senior MLOps Engineer to take ownership of how our machine learning systems run reliably and reputed company in production.

The Role

As a Senior MLOps Engineer at Wizard, you’ll own the end-to-end ML lifecycle – from model packaging and deployment to monitoring, observability, optimization and scaling – for a custom-built inference platform powering a live conversational shopping agent. This is not a standard cloud ML pipeline role; we run multiple specialized inference engines handling real-time inference for high-stakes shopping decisions, and the work requires both hands-on operational depth and the architectural judgement to evolve the platform as Wizard scales. You’ll work closely with ML Engineers, Data teams, and DevOps, with real influence over how the infrastructure is designed – not just how it runs.

What You’ll Do

  • Build, maintain, and optimize production-grade ML pipelines, enabling seamless transitions from experimentation to production.
  • Define and implement strategies for model versioning, rollout, rollback, and lifecycle management to ensure robust and reproducible ML systems
  • Define and enforce serving-layer SLAs – latency, availability, GPU utilization, TTFT, ITL – and build observability and alerting
  • Apply software engineering best practices including testing, CI/CD integration, and reproducibility to ML workflows, improving iteration speed for ML engineers without compromising reliability.
  • Ensure ML systems are secure, cost-efficient, and scalable, partnering with DevOps on infrastructure standards while owning ML-specific operational concerns.
  • Collaborate cross-functionally with ML, Data, Product, and DevOps teams to translate ML requirements into production-ready systems and influence technical planning and roadmap decisions.

reputed company’re Looking For

  • Bachelor’s or Master’s degree in Computer Science, Data Science, or a reputed company field, or equivalent experience.
  • 5-8+ years of experience in Software Engineering, ML Engineering, Platform Engineering, or Infrastructure Engineering with direct ownership of production ML serving systems.
  • Hands-on experience deploying and maintaining LLMs and deep learning models, in production environments.
  • Strong Python skills and software engineering fundamentals with infrastructure depth. Familiarity with ML frameworks (PyTorch, Tensorflow or similar) is preferred.
  • Experience with cloud platforms such as AWS, GCP, or Azure, and familiarity with ML lifecycle tooling, including model registries and experimentation platforms.
  • Familiarity with inference optimization at the hardware and systems level – batching strategies, memory management, quantization tradeoffs, CPU/GPU interaction patterns.
  • Demonstrated ability to reason about tradeoffs between latency, cost, throughput, and reliability at the systems as well as operational level.
  • Experience in high-growth startup environments and an ability to reputed company in a fast-paced, evolving technical landscape.

​​What Success Looks Like

  • Reliable, Scalable ML Systems: Production models run with clear SLAs, minimal downtime, and full observability – latency, availability, and GPU utilization tracked and enforced. Deployment pipelines handle growth and evolving AI requirements.
  • End-to-End Ownership: You own the full ML lifecycle – from packaging and deployment through monitoring and optimization – enabling ML engineers to iterate quickly while maintaining reproducibility, reliability and reputed company.
  • Influence and Impact: You shape the technical roadmap for ML operations, collaborating with ML, Data, and DevOps teams to improve system performance, reduce operational costs, and drive the overall AI strategy reputed company

Compensation & Benefits

The expected reputed company salary range for this role is $200,000 – $250,000 USD, and will vary based on skills, experience, role level, and geographic location. Final compensation will be determined by considering these factors alongside overall role scope and responsibilities.

In addition to reputed company salary, Wizard offers:

  • Equity in the form of stock options
  • Medical, dental, and vision coverage
  • 401(k) plan
  • Flexible PTO and company holidays
  • Fully remote work reputed company the United States
  • Periodic company offsites and team gatherings

Wizard is committed to fair, transparent, and competitive compensation practices.

Apply To This Job

Keep exploring

Staff iOS Mobile Engineer (Swift)

100% remote Flexible hours

Data Scientist - AI Evaluation

100% remote Flexible hours

Machine Learning Engineer - Relevance & Learning Systems

100% remote Flexible hours

Product Manager (GB)

100% remote Flexible hours

PR & Marketing Communications Manager (Waterloo, ON, CA, N2V 1C6)

100% remote Flexible hours

Account Executive (US)

100% remote Flexible hours

Customs Rater (Waterloo, ON, N2V 1C6)

100% remote Flexible hours

Senior Graphic Designer (Contract-to-Hire)

100% remote Flexible hours

Field Marketing Manager, Onsites

100% remote Flexible hours

Account Executive

100% remote Flexible hours

Electrical Engineer III (PE Required)

100% remote Flexible hours

reputed company Customer Service and Sales Professional – Building Strong Relationships and Driving Business Growth at arenaflex

100% remote Flexible hours

Journalism Editor Internship (People of Color Entertainment Trade Site)

100% remote Flexible hours

Android Engineer - Kotlin (m/f/d)

100% remote Flexible hours

Academic Advisor

100% remote Flexible hours

No Experience reputed company Remote Job (Data Entry) – Apply Now – Hire Me Remotely

100% remote Flexible hours

Product Manager, Aviation Service

100% remote Flexible hours

Remote Physician Assistant – Comprehensive Patient Care, Data Management & Virtual Assistant Role at arenaflex

100% remote Flexible hours

reputed company Virtual Data Entry Specialist – Remote Work Opportunity for Detail-Oriented Professionals in the Aviation Industry

100% remote Flexible hours

Clinical Interpreter

100% remote Flexible hours