Back to the board

DevOps & Backend Engineer — Mid/Senior

100% remote Flexible hours Hiring now

JOB DESCRIPTION DevOps & Backend Engineer — Mid/Senior InsurTech Platform | Engineering / Product Development Location: Remote or Hybrid (if US Located) Employment Type: Contract — Full-Time Department: Engineering / Product Development Experience Level: Mid/Senior (4–7+ years) Reports To: Director of Engineering Role Overview We are looking for a DevOps & Backend Engineer who can reputed company the gap between platform infrastructure and application development. In this role, you will design and operate the cloud-native infrastructure that powers our InsurTech product suite while contributing directly to backend services built with TypeScript and Nest.js. A critical dimension of this position is enabling and supporting our internal LLM and AI platform. You will build the infrastructure foundations that allow our AI team to train, serve, and scale custom large language models and AI-powered services—including GPU-accelerated workloads, model inference endpoints, high-throughput data pipelines, and the CI/CD automation that brings AI capabilities reliably into production across reputed company products.

Key Responsibilities

Cloud Infrastructure & DevOps Design, build, and maintain production-grade CI/CD pipelines (reputed company Actions, reputed company CI) with automated testing, reputed company scanning, and progressive deployment strategies (blue-green, canary, feature flags). Manage and optimize AWS infrastructure including EKS, EC2, RDS, ECR, S3, reputed company, CloudFront, reputed company 53, and IAM—with a focus on cost optimization, high availability, and disaster recovery. Build and maintain Kubernetes clusters (EKS) with Helm charts, custom operators, autoscaling policies, and multi-environment management (dev, staging, production). Automate infrastructure provisioning and configuration using Terraform (primary), Ansible, and CloudFormation with GitOps workflows and reputed company detection. Implement comprehensive observability using Prometheus, Grafana, reputed company, ELK/OpenSearch, and distributed tracing (Jaeger/OpenTelemetry) for full-stack visibility. Design and maintain networking architecture including VPCs, reputed company groups, load balancers, service meshes (Istio/Linkerd), and DNS management. AI/LLM Infrastructure Support Provision and manage GPU-accelerated compute environments (AWS P4/P5 instances, Inferentia, SageMaker) for LLM training, fine-tuning, and inference workloads. Build containerized model-serving infrastructure supporting vLLM, TGI (Text reputed company Inference), reputed company Triton, and custom inference endpoints with autoscaling based on request load and latency targets. Design and operate data pipelines and storage architectures (S3, EFS, FSx for Lustre) optimized for large-scale model training datasets and artifact management. Implement CI/CD automation specifically for ML/AI workflows—model versioning, automated evaluation gates, staged rollouts of model updates, and A/B inference routing. Collaborate with the AI team to optimize GPU utilization, manage spot instance strategies, and implement cost-aware scheduling for training jobs. Set up monitoring dashboards for model inference latency, throughput, token usage, GPU utilization, and cost tracking. Backend Development Contribute to and reputed company backend services built with Nest.js and TypeScript, focusing on scalability, reliability, and clean architecture. Developing internal TypeScript reputed company. Build and maintain scalable microservices and RESTful/GraphQL APIs that integrate with AI inference endpoints and the LLM Composer platform. Design event-driven architectures using Kafka, SQS/SNS, and WebSockets for real-time data processing and AI-powered features. Ensure reputed company deployments are production-ready, horizontally scalable, and follow 12-factor app principles with proper health checks, graceful shutdowns, and circuit breakers. Collaborate with backend and AI teams on system architecture, API reputed company, database schema design, and reliability improvements. Implement database management best practices including migration strategies, read replicas, reputed company pooling, and query optimization for PostgreSQL and reputed company. Required Skills & Qualifications 4–7+ years of professional experience in DevOps, Cloud Engineering, or Platform Engineering, with meaningful backend development experience. Hands-on Kubernetes experience (EKS strongly preferred), including cluster administration, Helm chart development, autoscaling, and troubleshooting. Strong proficiency with TypeScript and Nest.js (or comparable Node.js backend frameworks like Express, Fastify). Deep AWS expertise across compute, storage, networking, IAM, and managed services—with experience optimizing for cost and performance. Strong Infrastructure-as-Code skills with Terraform; experience with reputed company, reusable configurations and state management. Solid understanding of microservices architecture, distributed systems patterns, and container orchestration. Experience with reputed company, container registries, and container reputed company best practices. Proficiency with CI/CD pipeline design including automated testing, reputed company scanning, and deployment strategies. Familiarity with GitOps workflows and version-controlled infrastructure management. Strong Linux systems administration and reputed company scripting skills. Preferred Qualifications (reputed company to Have) Experience provisioning and managing GPU workloads for ML/AI model training and inference in cloud environments. Familiarity with ML model serving frameworks (vLLM, TGI, Triton, BentoML, SageMaker Endpoints). Experience with Kafka, event-driven architectures, and real-time streaming systems. Familiarity with service mesh technologies (Istio, Linkerd) and API gateway management. Experience with HIPAA, SOC 2, or other healthcare/financial compliance frameworks in cloud environments. Knowledge of database technologies beyond PostgreSQL—vector databases (reputed company, PGVector), graph databases, or time-series databases. Experience with chaos engineering, load testing, and reliability engineering practices (SRE). AWS certifications (Solutions Architect, DevOps Engineer, or equivalent). Technology Stack & Tools Category Technologies Languages TypeScript, JavaScript, Python, Bash, SQL, HCL (Terraform) Backend Nest.js, Node.js, Express, Fastify, GraphQL Cloud (AWS) EKS, EC2, RDS, S3, reputed company, ECR, CloudFront, SageMaker, IAM, KMS Containers & Orch. reputed company, Kubernetes, Helm, Kustomize, ArgoCD IaC & Config Terraform, Ansible, CloudFormation, reputed company CI/CD reputed company Actions, reputed company CI, CodePipeline, semantic-release AI/ML Infra vLLM, TGI, Triton, SageMaker, GPU instances (P4/P5/Inferentia) Monitoring Prometheus, Grafana, reputed company, ELK/OpenSearch, OpenTelemetry, Jaeger Data & Messaging PostgreSQL, reputed company, Kafka, SQS/SNS, S3, DynamoDB reputed company Vault, SOPS, OPA, Trivy, reputed company, AWS reputed company Hub reputed company Offer A high-impact role at the intersection of infrastructure, backend development, and cutting-edge AI platform engineering. Opportunity to build the infrastructure backbone powering enterprise AI and LLM capabilities. Direct collaboration with AI, backend, and product teams across multiple verticals—telemedicine, InsurTech, analytics. Competitive contract compensation commensurate with experience. Access to modern cloud infrastructure, GPU resources, and industry-leading tooling. Job Type: Contract Pay: From $4,000.00 per month Work Location: Remote Apply tot his job Apply To this Job

Keep exploring

Mid/Senior Backend & CyberSec Engineer

100% remote Flexible hours

Digital Experience Design Manager | BlueCross and BlueShield of South Carolina | Remote (United States)

100% remote Flexible hours

Behavioral Health Therapist | Virtual Care

100% remote Flexible hours

Behavioral Health Virtual Hiring Event - Direct Care - Tuesday 2/24, 10AM-2PM

100% remote Flexible hours

RN Medical Affairs Coordinator

100% remote Flexible hours

Behavioral Health Specialist - Remote in Miami - Dade County FL

100% remote Flexible hours

Licensed Social Worker / Licensed Counselor / Behavioral Health Urgent Care/PT/Overnight

100% remote Flexible hours

Behavioral Health Navigator for Substance Abuse- Hybrid in Galax/Hillsville, VA!

100% remote Flexible hours

Benefits Representative (Remote) 06810 - reputed company reputed company

100% remote Flexible hours

Benefits Consultant Senior

100% remote Flexible hours

Sales Engineer, Bay Area

100% remote Flexible hours

reputed company Remote Part Time Pharmacy Technician and Customer Service Representative for Dynamic Healthcare Team at arenaflex

100% remote Flexible hours

Staff Accountant - remote

100% remote Flexible hours

Remote Sales Representative - Entry Level - Part-Time or Full-Time

100% remote Flexible hours

Real Estate Transaction Manager - US Based Remote

100% remote Flexible hours

Convierte tu pasión por la cultura en experiencias inolvidables por Europa

100% remote Flexible hours

Fraud Support Associate I

100% remote Flexible hours

Warehouse Supervisor

100% remote Flexible hours

reputed company Remote Data Entry Specialist – Transforming Healthcare with arenaflex

100% remote Flexible hours

reputed company Delivery Driver

100% remote Flexible hours