Back to the board

[Remote] Staff Site Reliability Engineer, Core AI Infrastructure

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. Coinbase is a remote-first company focused on increasing economic freedom. They are seeking a Staff Site Reliability Engineer to join their IT Operations team, where the engineer will own the reliability and automation of critical AI infrastructure, ensuring systems are resilient, observable, and secure at scale.

Responsibilities

  • Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros
  • Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments
  • Partner with the Coinbase Infrastructure team to extend CI/CD frameworks supporting IT services and enterprise network platforms, and with Security and Compliance to integrate surveillance tooling into deployment pipelines
  • Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence
  • Develop full-stack applications that power internal AI products and infrastructure with Go or Python

Skills

  • 8+ years of experience automating and supporting cloud infrastructure (AWS) and network environments, with hands-on use of infrastructure-as-code tools (Terraform, Ansible, Chef, Puppet, or Salt)
  • Proven experience deploying, managing, and troubleshooting containerized workloads using Docker and Kubernetes in production environments
  • Proficiency in at least one scripting or programming language (Python, Bash, Ruby, or Go) and version control workflows using Git-based CI/CD pipelines
  • Track record of leading incident response in environments with strict SLAs, including root cause analysis, blameless retros, and measurable reliability improvements
  • Utilizes generative AI responsibly, maintaining human oversight to deliver business-ready outputs and drive measurable improvements in workflow efficiency, cost, and quality
  • Expertise with linux, bash, ruby, python and/or go
  • Expertise automating EC2 or containers deployment with terraform
  • Strong network security fundamentals
  • Experience managing and leveraging log aggregation
  • Experience working in a highly regulated environment
  • Experience in a fast-paced, high-growth company
  • Experience in a Remote-first IT environment

Benefits

  • Equity and bonus eligibility
  • Benefits (medical, dental, vision, 401(k))
  • Remote-first, but not remote-only company
  • Quarterly for intense in-person working sessions called “surges.”

Company Overview

  • Coinbase is a crypto exchange and wallet platform that allows merchants and consumers to buy, sell, and store digital currencies. It is a sub-organization of Coinbase. It was founded in 2012, and is headquartered in San Francisco, California, USA, with a workforce of 1001-5000 employees. Its website is https://www.coinbase.com.
  • Company H1B Sponsorship

  • Coinbase has a track record of offering H1B sponsorships, with 30 in 2026, 181 in 2025, 92 in 2024, 96 in 2023, 284 in 2022, 183 in 2021, 66 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Keep exploring

    [Remote] Corporate Vice President - Cloud Security Engineer

    100% remote Flexible hours

    [Remote] Finance Controller Remote

    100% remote Flexible hours

    [Remote] Recruiter

    100% remote Flexible hours

    [Remote] Product Designer - UX/UI & Design Systems (Remote)

    100% remote Flexible hours

    [Remote] Account Manager

    100% remote Flexible hours

    [Remote] Lead Customer Success Manager

    100% remote Flexible hours

    [Remote] Program Manager, Customer Care Services

    100% remote Flexible hours

    [Remote] Staff Product Engineer

    100% remote Flexible hours

    [Remote] Chief Product Owner -Operations and Analytics (Visit Integrity Platform)

    100% remote Flexible hours

    [Remote] Senior Account Manager – Life Sciences & Healthcare (Remote)

    100% remote Flexible hours

    Experienced Customer Service Representative - Entry Level Management Position at arenaflex

    100% remote Flexible hours

    Marketing Operations Manager

    100% remote Flexible hours

    Experienced Data Entry Coordinator – Administrative Support & Data Management

    100% remote Flexible hours

    Director, LTSS Service Determination Operations

    100% remote Flexible hours

    Clinical Operations Manager (Remote - Virtual Dementia Care)

    100% remote Flexible hours

    Payroll Expert

    100% remote Flexible hours

    Rewritten Job Title:

    100% remote Flexible hours

    Experienced Online Chat Support Specialist – Deliver Exceptional Customer Experience at arenaflex

    100% remote Flexible hours

    Experienced Remote Customer Service Representative – Pet Industry Expert

    100% remote Flexible hours

    Customer Service Representative – Pet Pharmacy Support (Remote Position for Kentucky Residents)

    100% remote Flexible hours