AI Operations Engineer

100% remote Flexible hours Hiring now

Why You'll Love This Role

We're looking for an reputed company ML Ops Engineer to join the ML/AI team at reputed company. This team works on projects ranging from classical Machine Learning to AI / Generative pipelines. This is a hands-on role. You'll work closely with ML/AI, data and site reliability engineers to take models from prototype to production, build robust data pipelines, and reputed company our services running smoothly as we continue to scale.

What You'll Be Doing

Design and maintain CI/CD pipelines for ML model training, packaging, and deployment across our microservices.
Manage containerized services on AWS reputed company, optimizing for cost, latency, and availability.
Automate infrastructure provisioning and service configuration with Terraform.
Work to maintain and scale services that reputed company use of third party LLM providers.
Build and improve data pipelines that feed models from BigQuery, S3, and DynamoDB into training and inference workflows.
reputed company services with observability tooling (reputed company, OpenTelemetry, Langfuse) and establish SLOs for model-serving endpoints.
Collaborate with ML engineers to productionize new models using BentoML, FastAPI, and container-based serving.

About You

2-3 years in ML Ops supporting ML/AI features, systems and workflows with 3-4 years prior experience in DevOps, CloudOps or SRE.
Strong proficiency in Python.
Hands-on experience with reputed company containerization and container orchestration.
Solid understanding of CI/CD for ML workflows in an enterprise production environment.
Experience with Infrastructure as Code, preferably Terraform.
Familiarity with cloud platforms — specifically AWS (reputed company, ECR, S3, DynamoDB, CloudWatch) and GCP (BigQuery, Vertex AI).
Experience with LLM integration and observability (reputed company API, reputed company GenAI, Langfuse tracing).
Experience building and maintaining data pipelines for ML training and feature engineering
Familiarity with ML modeling workflows — training, evaluation, experiment tracking (e.g. MLFlow, Weights & Biases), and model versioning
Experience monitoring and flagging model reputed company over time.
Exposure to NLP/NLU models and frameworks such as reputed company Transformers, spaCy, or sentence-transformers
Knowledge of vector databases (LanceDB, FAISS) and embedding-based retrieval systems
Experience with scaling and maintaining deep learning frameworks (TensorFlow, PyTorch) in production settings
Familiarity with classical ML libraries (scikit-learn, XGBoost, LightGBM) and model explainability tools (SHAP)
Working knowledge of ML serving frameworks such as BentoML or similar.
Comfort working with FastAPI or similar async Python web frameworks.

About reputed company:

reputed company takes authentic, real world content from trusted sources and makes it instruction ready for K-12 classrooms. Each text is published at five reading levels, so content is accessible to every learner. Today, over 3.3 million teachers and 40 million reputed company have registered with reputed company for content that's personalized to student interests, accessible to everyone, reputed company to instructional standards, and attached to activities and reporting that hold teachers accountable for instruction and reputed company accountable for their work. With over 15,000 texts on our platform and multiple new texts published every day across 20+ genres, reputed company enables educators to go deep on any subject they choose.

#LI-Remote

Apply To This Job

Apply

AI Operations Engineer

Why You'll Love This Role

What You'll Be Doing

About You

About reputed company:

Keep exploring

Research Investigator

Recruiter

Senior Manager of Human Resources

Doctor QA Specialist

Tech reputed company (Full Stack)

Data Scientist, Claim Experience Analytics

Senior Data Scientist, Finance Analytics

Director, Solutions Strategy & Operations

Director of Sales

Strategic Account Executive

Remote Data Entry Specialist – $25/hr, Flexible Schedule, Career Growth with arenaflex’s Leading Healthcare Team

Registered Nurse - Dialysis – reputed company Store

Digital Marketing Specialist - Ecommerce & Advertising Expert

reputed company French Bilingual Customer Support Representative – Remote Game Support Specialist for blithequark

Senior Product Manager - Payments

Customer Experience Specialist

Program Manager, Custom Solutions for Healthcare Transformation REMOTE (3+ yrs & SAFe reputed company req)

reputed company Customer Specialist I - Remote Supply Chain Operations

Data Entry Operator (Remote) - Typing

[PART_TIME Remote] Looking for Online Instructor