Back to the board

Data & AI Operations Specialist

100% remote Flexible hours Hiring now

The Data & Operations AI Specialist serves as the Level 3 technical lead for Artificial Intelligence and Data Platform estate. You will be responsible for the architecture, engineering, and advanced troubleshooting of AI infrastructure, data pipelines, and MLOps lifecycles across a multi-cloud environment (Azure and OCI).

Responsibilities:

AI Infrastructure & Platform Engineering

  • Design & Architecture: Maintain the monitoring architecture for AI/ML platforms and configure advanced dashboards in Grafana and Azure Monitor.
  • Environment Governance: Manage Azure Machine Learning (AML) workspace configurations, compute targets, and Databricks cluster lifecycles (including runtime versions and platform patching).
  • Resource Optimization: Oversee GPU resource allocation, reserved capacity, and cost-performance optimization to align with FinOps goals.
  • Security Integration: Ensure all AI services utilize private endpoints, VNET integration, and RBAC controls to protect sensitive citizen data.

Data Pipeline & ETL Management

  • Pipeline Engineering: Own the design, optimization, and remediation of Azure Data Factory (ADF) and Synapse pipelines.
  • Advanced Troubleshooting: Resolve complex bottlenecks related to authentication failures, data format changes, and ETL performance.
  • SOP Leadership: Author step-by-step Standard Operating Procedures (SOPs) for the L1 NOC team to handle routine monitoring and first-line triage.

MLOps & Model Lifecycle

  • Automation: Implement CI/CD pipelines for model training, testing, and deployment to AML endpoints.
  • Model Reliability: Configure data drift detection thresholds and automated retraining triggers.
  • Recovery Operations: Develop self-healing scripts and automated recovery runbooks for critical AI workflows.

Governance & Compliance

  • Audit Management: Implement and maintain audit logging for all AI decisions and model outputs, ensuring logs flow to the SIEM/vSOC.
  • Regulatory Alignment: Conduct quarterly AI governance reviews to ensure compliance with NESA standards and data privacy guidelines.

Requirements

  • AI/ML Platforms: Deep expertise in Azure Machine Learning and Databricks.
  • Data Integration: Proficiency in Azure Data Factory and Synapse.
  • Infrastructure-as-Code (IaC): Experience with Terraform or ARM Templates for reproducible deployments.
  • Observability: Ability to use Dynatrace, Grafana, and Azure Monitor for deep-tier diagnostics.
  • Containerization: Knowledge of AKS, Istio Service Mesh, and KEDA.
  • ITIL Mastery: Strong understanding of ITIL-aligned Incident, Change, and Problem management.
  • Security Mindset: Familiarity with NESA standards and UAE data residency requirements.
  • Technical Writing: Ability to draft complex SOPs and Root Cause Analysis (RCA) documents within 48 hours of an incident.
  • Certifications: Microsoft Azure Data Scientist Associate or Azure AI Engineer Associate is highly preferred.
Apply To This Job

Keep exploring

Account Executive: LATAM (Portuguese or Spanish Speaking)

100% remote Flexible hours

Sales Development Representative (SDR): Nordics

100% remote Flexible hours

Sales Development Representative (SDR): DACH

100% remote Flexible hours

Sales Development Representative (SDR): DACH

100% remote Flexible hours

Business Development Representative

100% remote Flexible hours

Respiratory Therapist

100% remote Flexible hours

Wound Care Intake Customer Service Supervisor

100% remote Flexible hours

Mobile Phlebotomist - Mt. Pleasant, MI

100% remote Flexible hours

Mobile Phlebotomist - Lansing, MI

100% remote Flexible hours

Families Together Building Solutions Worker

100% remote Flexible hours

[Work From Home] Require Costume Design Assistant Professor in

100% remote Flexible hours

Environmental Chemist - Data Validation job at Montrose Environmental in PA

100% remote Flexible hours

InEvent Graduate Take-off Program - Knowledge Specialist Intern

100% remote Flexible hours

Remote Recruiter

100% remote Flexible hours

Learning Designer (K-12 or Higher Ed)

100% remote Flexible hours

Experienced Full Stack Web Chat Manager – Web & Cloud Application Development

100% remote Flexible hours

Customer Service Delivery Advocate

100% remote Flexible hours

Experienced Chat Support Agent – Entry-Level, No Degree Required – Flexible Remote Work Opportunity

100% remote Flexible hours

AI Engineer - Agentic Systems job at Machinify in US National

100% remote Flexible hours

Experienced Virtual Assistant and Customer Service Representative for American Airlines - Entry Level Opportunity with Competitive Salary and Benefits

100% remote Flexible hours