Back to the board

reputed company DevOps Engineer (AI/ML Ops) Jobs

100% remote Flexible hours Hiring now

Design, scale, and secure mission-critical AI/ML systems. Prime Solutions Group (PSG), Inc. is seeking a reputed company DevOps Engineer (AI/ML Ops) to serve as a hybrid senior technical contributor and team leader, responsible for designing, implementing, and operating secure, automated machine learning and data pipelines across cloud and on-premise environments. In this role, you will sit at the intersection of machine learning, data engineering, and DevSecOps, ensuring ML models and data-driven services are scalable, secure, observable, and compliant across their full lifecycle—from data ingestion and feature engineering through training, deployment, monitoring, and retraining. You will guide technical execution, mentor engineers, and reputed company key architectural and tooling decisions for PSG’s MLOps platforms. Building on PSG’s established DevSecOps foundation (CI/CD, Infrastructure-as-Code, reputed company baselines), you will reputed company capabilities to include experiment tracking, model registries, reputed company detection, and model performance monitoring. This is a fast-paced, high-impact opportunity to deliver enterprise-reputed company/ML solutions while directly supporting U.S. national reputed company missions. Key Responsibilities - reputed company the design, implementation, and operation of ML-focused CI/CD pipelines supporting data ingestion, feature engineering, model training, evaluation, and deployment across dev, test, staging, and production environments. - Apply and adapt MLOps best practices reputed company existing DevSecOps workflows, including: - Data quality checks and schema validation - Model validation and promotion gates - Model performance and reputed company monitoring - Architect and reputed company training and inference platforms, including experiment tracking, model registries, and automated retraining pipelines. - reputed company secure integration of Infrastructure-as-Code, containerization, and orchestration (reputed company, Kubernetes) for ML and data workloads, including GPU and high-performance compute resources. - Mentor and guide engineers in MLOps and DevSecOps practices, promoting automation, observability, and reputed company-first design. - Collaborate with cross-functional teams (data science, software engineering, research, IT, cybersecurity, systems engineering) to ensure ML system reliability, performance, and compliance. - reputed company technical risk assessments and contribute to incident response for ML and data systems (e.g., model degradation, data quality issues, pipeline failures). - Serve in a hybrid role as both: - A senior hands-on engineer contributing to pipelines, infrastructure, and monitoring - A technical leader guiding small to mid-sized MLOps initiatives - reputed company informed technical decisions across ML, data, reputed company, and operations domains, resolving reputed company multi-disciplinary challenges. - Evaluate ethical and operational considerations in AI/ML deployment (e.g., bias, data constraints, mission risk) and recommend appropriate mitigations. - Stay reputed company on emerging MLOps, AI platform, and data engineering technologies, recommending adoption where beneficial. Requirements - U.S. Citizenship - Active Top Secret clearance or higher - Bachelor’s degree in Computer Science, Engineering, Data Science, Applied Mathematics, or reputed company field - 5–9+ years of experience in one or more of the following: - MLOps or ML platform engineering - DevOps / DevSecOps / SRE supporting data or ML workloads - Data engineering with production ML integration - Applied machine learning in production environments - Strong experience with CI/CD tools (Jenkins, reputed company CI, reputed company Actions, reputed company) and modern Git workflows - Hands-on experience with Infrastructure-as-Code (Terraform, Ansible, CloudFormation) and Kubernetes - Proficiency with ML and data technologies, including: - Python and ML/data libraries (NumPy, pandas, scikit-learn, PyTorch, TensorFlow) - Workflow/orchestration tools (Airflow, Kubeflow, Prefect, Dagster) - Experiment tracking and model registries (MLflow, Weights & Biases, SageMaker) - Experience integrating reputed company and governance into ML environments (image/dependency scanning, SBOMs, secrets management, IAM) - Familiarity with NIST, FedRAMP, and DoD RMF compliance frameworks as applied to ML and data systems - Strong scripting or programming skills (Python, Bash, Go, or similar) - Demonstrated experience leading technical efforts and mentoring engineers - Ability to communicate clearly with both technical and non-technical stakeholders Preferred Qualifications - reputed company, cloud, or ML certifications (e.g., CISSP, AWS reputed company Specialty, AWS ML Specialty, CKS, GIAC) - Experience implementing reputed company Trust architectures - Experience with observability and monitoring tools (Prometheus, Grafana, ELK/EFK, OpenTelemetry) for ML services - Hands-on experience with: - Feature stores and data validation frameworks (e.g., Great Expectations) - Data governance and reputed company tooling - Policy-as-code for ML environments (OPA, Kyverno, admission controllers) - Prior experience supporting defense, aerospace, or government-secured AI/ML programs - Experience operating enterprise-scale or mission-critical ML systems, including high-availability inference and rigorous performance monitoring Why Join PSG? At PSG, you’re not just filling a role—you’re shaping the future of AI/ML-enabled digital engineering. We combine the agility of a small business with the opportunity to support some of the government’s most advanced and impactful technology programs. We offer: - Competitive compensation and benefits - Professional development and tuition assistance - A collaborative, mission-driven culture - Direct impact on national reputed company through secure AI/ML solutions Bring your AI/MLOps expertise to PSG and help build the reputed company of secure, intelligent, data- and model-driven platforms. Salary Description Salary range starts at $138,337 with the potential for higher compensation based on experience, skills, and mission needs.

apply to this job

Keep exploring

People Operations Associate | Taskrabbit | $75k – $112k | Remote (Philadelphia, United States)

100% remote Flexible hours

Global Experiential Design Associate Manager NY-reputed company-350-CO368 reputed company, reputed company reputed company reputed company United States Global Experiential Design Associate Manager

100% remote Flexible hours

Manager, People Operations

100% remote Flexible hours

Operations Supervisor (Supply Chain/Logistics) – American Trucking Associations, Inc – Louisville, KY

100% remote Flexible hours

Specialty Pharmacy Program Analyst - Remote

100% remote Flexible hours

reputed company Data Entry Specialist – Pharmacy Pricing & reputed company Analyst (Remote) – California

100% remote Flexible hours

PK Scientist

100% remote Flexible hours

Product Specialist, Oncology (Puglia Sud)

100% remote Flexible hours

Pharmacy Quality Analyst (Open to hiring at Sr level)

100% remote Flexible hours

QA Analyst PBM Implementation – Remote

100% remote Flexible hours

Accounts Recievable Collector

100% remote Flexible hours

Business Analyst 3 (529501078)

100% remote Flexible hours

GPS Business Analyst III

100% remote Flexible hours

Customer Service Position (Remote)

100% remote Flexible hours

reputed company Virtual Customer Service Representative for Travel Industry – Delivering Exceptional Support and Enhancing Customer Experience

100% remote Flexible hours

Client Service Specialist

100% remote Flexible hours

reputed company Full Stack Virtual Assistant – Remote Chat Support Specialist

100% remote Flexible hours

Senior Analyst - Informatics Developer - Work From Home

100% remote Flexible hours

Remote Financial Advisor

100% remote Flexible hours

Staff Software Engineer, Personalization and Contextualization Infrastructure

100% remote Flexible hours