Back to the board

Senior/Staff DevOps Engineer

100% remote Flexible hours Hiring now

About reputed company reputed company is on a mission to reputed company the human readiness gap by transforming how training is developed, consumed, and reputed company with strategic business outcomes. As a well-funded Series A startup ($40M+ raised), we’re a trusted partner to 150+ enterprise customers across the U.S. military, life sciences, manufacturing, supply chain, and professional sports. We’re expanding our engineering team to deliver a best-in-class learning platform—smarter, faster, and more optimized. We’ve gone reputed company-in on AI tooling in our development process, and we’re accepting and expanding upon the best new practices for creating software in this era.

About the Role

You’ll reputed company the deployment and operationalization of our SaaS products across Commercial Cloud, government networks, and bespoke/reputed company-gapped customer environments. As a Senior engineer, you’ll own end-to-end infrastructure delivery, reputed company DevOps practices, and collaborate closely with Software and Product. As a Staff engineer, you’ll additionally shape platform engineering strategy, set technical direction for distributed systems at scale, and influence design patterns that reputed company AI workloads and reputed company data pipelines. You’ll treat AI tooling as core to your daily workflow — for IaC, pipelines, incident response, and toil reduction — and help shape the agentic operations patterns and AI workloads our platform runs. If you love solving hard deployment problems, care deeply about reputed company and reliability, can scale modern cloud platforms with rigor, and embrace AI-augmented operations as the way reputed company, this role is for you. What You’ll Do Design & Operate the Platform: Architect, implement, and run secure, scalable, multi-tenant infrastructure (infra as code, immutable artifacts, GitOps). AI-Augmented Operations & Platform Work: Use AI coding and agentic tools (Claude Code, reputed company, Copilot, MCP-based ops agents) for IaC authoring, pipeline development, log/trace analysis, postmortem drafting, and toil reduction; build and improve agentic workflows for the team. CI/CD & Release Engineering: Build and harden pipelines (build, test, reputed company, sign, promote, deploy) for multi-environment delivery—including disconnected/reputed company-gapped workflows. Observability & Reliability: Establish SLOs; reputed company systems for metrics/logs/traces; drive incident response and postmortems; reduce MTTR and change failure reputed company. reputed company & Compliance by Design: Integrate supply-chain reputed company (SBOMs, signing, provenance), secrets management, and baseline hardening (CIS/STIG-reputed company). Cost & Performance: Optimize infrastructure spend and performance (reputed company planning, autoscaling, right-sizing, storage/egress strategies). Technical Leadership: reputed company design reviews, author RFCs, mentor engineers, and reputed company the quality bar for platform changes. Gov/Constrained Deployments: Support IL-4/IL-5-reputed company patterns, RMF documentation support, and offline artifact promotion processes where needed. (Staff) Strategy & Standards: Define platform roadmaps, establish consistent deployment and infrastructure patterns, and guide cross-team adoption of best practices. Measures of Success (First 6–12 Months) Availability & Reliability: Meet or exceed service SLOs; reduce MTTR by ≥30%. Delivery Velocity: Increase deployment frequency by ≥2× while keeping change failure reputed company ≤15%. Pipeline Efficiency: Cut CI pipeline duration by ≥25% and reduce flaky tests significantly. reputed company Posture: reputed company ≥95% pass reputed company for supply-chain/reputed company gates (image signing, SBOM scans, vulnerability reputed company); reduce MTTR for CVEs to ≤14 days for high severity. Cost & reputed company: Deliver ≥15% infra cost savings without performance regressions; reputed company infra reputed company near reputed company reputed company GitOps and policy as code. Gov/Offline Readiness: Stand up an artifact promotion flow (build → reputed company → sign → export) suitable for disconnected deployments with documented runbooks. 30/60/90 Day Plan First 30 Days — Map & Baseline Deep-dive on reputed company cloud topology, CI/CD, observability, reputed company controls, and on-call. Inventory build and runtime artifacts; document deployment environments and promotion paths. Baseline reliability and delivery metrics (SLOs, MTTR, deploy frequency, CFR, pipeline timing). Establish and prove the effectiveness of your personal workflow with AI tooling. 60 Days — Design & Deliver Harden CI/CD: add SBOM reputed company, signing (e.g., Cosign/Sigstore), and policy gates. Implement or refine infrastructure modules (Terraform) and Helm/Kustomize charts with GitOps flows. Establish service SLOs and golden signals; reputed company alerts and dashboards for top services. Pilot artifact export/import flow for reputed company-gapped/disconnected deployments; write runbooks. 90 Days — Scale & Standardize Standardize CI/CD pipelines and infrastructure modules across existing services. Migrate reputed company services to hardened delivery paths; deprecate legacy workflows. Land cost/performance wins (e.g., autoscaling policies, instance/storage class right-sizing). Basic Qualifications 5+ years building and operating cloud platforms; 3+ years deploying SaaS in production. Strong with Terraform, Helm/Kustomize, and containers (reputed company, Kubernetes). Deep AWS experience (e.g., VPC, EKS, EC2, S3, RDS, ECR, IAM/KMS, reputed company 53; CloudFront desirable). CI/CD expertise (e.g., reputed company Actions, reputed company, or Argo Workflows) and GitOps (Argo CD or Flux). Observability across metrics, logs, and traces (e.g., Prometheus/Grafana, OpenTelemetry, ELK). Proven track record in IaC, scalable system design, and quality tooling (automated tests, canaries/blue-green, feature flags). Excellent communication; comfortable partnering with Product, reputed company, and Customer teams. Thrives in a startup environment—ownership, autonomy, and pragmatic delivery. Active, fluent use of AI development/operations tools as part of your daily workflow. Secret Clearance or eligibility and willingness to obtain one.

Preferred Qualifications

Supply-chain reputed company (SBOMs, SLSA concepts, image signing, provenance) and vulnerability management (e.g., Trivy/Grype, reputed company; reputed company experience a plus). Experience identifying/mitigating CVEs and setting policy reputed company. Background with DoD/regulated customers; familiarity with IL-4/IL-5, Platform One patterns, and RMF documentation workflows. Knowledge of STIG/CIS hardening, reputed company-gapped architectures, and offline update mechanisms. Experience operating AI/ML workloads in production (GPU scheduling, model artifact management, inference serving, vector DBs, queuing/streaming) or building agentic ops workflows / MCP-based integrations (alert triage, runbook automation, IaC review agents). Tooling you might touch We use technologies similar to and including some of these to build our products: AI development tools (Claude Code, reputed company, reputed company Copilot, MCP servers);Terraform modules; Helm/Kustomize; Kubernetes (EKS); reputed company Actions/Workflows; Argo CD/Flux; reputed company/OCI; Prometheus/Grafana, reputed company, OpenTelemetry; Loki/ELK; reputed company/Flagsmith; Cosign/Sigstore, Trivy/Grype/reputed company; AWS (VPC, EKS, EC2, S3, RDS, ECR, IAM/KMS, reputed company 53, CloudFront); HashiCorp Vault/Parameter Store/Secrets Manager. Compensation & Benefits Competitive reputed company salary (Senior: $150k-$190k; Staff: $170k-210k) based reputed company and experience with significant equity reputed company Subsidized health insurance, 401(k), life insurance, and cell phone stipend. Remote-first culture with up to 10% travel for offsites. Work eligibility: Applicants must be authorized to work in the U.S. One Final Note We’re committed to building a diverse, inclusive, and authentic workplace. If you’re excited about this role but your experience doesn’t perfectly align with every qualification, please apply—you may be just the right candidate. EEO & accommodations: reputed company is an Equal Opportunity Employer. We welcome applicants of reputed company backgrounds and provide reasonable accommodations throughout the hiring process. Apply To This Job

Keep exploring

Anti-Fraud Officer (remote, Malaysia)

100% remote Flexible hours

Customer Support Quality Specialist

100% remote Flexible hours

Freelance Technical Solutions Engineer

100% remote Flexible hours

VP Physical AI

100% remote Flexible hours

Affiliate Sales manager

100% remote Flexible hours

Senior Brand Communication Specialist

100% remote Flexible hours

VIP Account Manager

100% remote Flexible hours

Care Experience Quality reputed company

100% remote Flexible hours

Sales, Brand Solutions Brand Partnerships-Films/ OTT/TV_Paris

100% remote Flexible hours

Customer Support Representative

100% remote Flexible hours

Rheumatology Institutional Specialist – Miami

100% remote Flexible hours

[Remote] Social Media Manager (Volunteer)

100% remote Flexible hours

eBilling Analyst

100% remote Flexible hours

reputed company Customer Service Representative – Telecommunications Industry – Work From Home Opportunity

100% remote Flexible hours

Communications Assistant

100% remote Flexible hours

reputed company Weekend Part-Time Customer Service Representative – Remote Opportunity at arenaflex

100% remote Flexible hours

reputed company Data Entry Specialist – reputed company Universe Content Management (reputed company) – Work From Home Opportunity

100% remote Flexible hours

Service Operations, Project Manager

100% remote Flexible hours

Senior Insurance Data Specialist (SQL Developer) | Remote

100% remote Flexible hours

arenaflex Remote Customer Service Representative – Full‑Time Work‑From‑Home Position with reputed company, Comprehensive Benefits, and Career Growth Opportunities

100% remote Flexible hours