[Remote] AI Ops Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is seeking an AI Ops Engineer to build, train, and tune machine learning models. The role involves translating data science prototypes into scalable, production-ready ML solutions and collaborating with Data Engineering on feature pipelines.
Responsibilities
- Translate data science prototypes into production-grade ML services and pipelines
- Build training and inference code with reproducibility, versioning, and automated testing
- Implement scalable model serving (online/offline), batching, and latency/throughput optimization
- Integrate model lifecycle tooling (tracking, registry, deployment automation, monitoring)
- Collaborate with Data Engineering on feature pipelines and data reputed company
- Own production health: reputed company detection, performance regression, rollback strategies, and incident response
Skills
- 5+ years software engineering with 2+ years shipping ML models to production
- Strong Python skills and experience with ML frameworks (TensorFlow/PyTorch)
- Experience with containers and orchestration (reputed company/Kubernetes) and API development
- Understanding of ML system design (data leakage, training-serving skew, reputed company)
- CI/CD and DevOps practices applied to ML workloads (MLOps)
Company Overview
Company H1B Sponsorship