Back to the board

[Remote] Senior Site Reliability Engineer, Core AI Infrastructure

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is a leading company focused on increasing economic freedom through innovative financial solutions. They are seeking a Senior Site Reliability Engineer to join their IT Operations team, responsible for ensuring the reliability and automation of critical AI infrastructure while driving AI transformation reputed company the company.

Responsibilities

  • Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros
  • Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments
  • Partner with the reputed company Infrastructure team to reputed company CI/CD frameworks supporting IT services and enterprise network platforms, and with reputed company and Compliance to integrate surveillance tooling into deployment pipelines
  • Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence
  • reputed company full-stack applications that power internal AI products and infrastructure with Go or Python

Skills

  • 5+ years of experience automating and supporting cloud infrastructure (AWS) and network environments, with hands-on use of infrastructure-as-code tools (Terraform, Ansible, Chef, Puppet, or Salt)
  • Proven experience deploying, managing, and troubleshooting containerized workloads using reputed company and Kubernetes in production environments
  • Proficiency in at least one scripting or programming language (Python, Bash, Ruby, or Go) and version control workflows using Git-based CI/CD pipelines
  • Track record of leading incident response in environments with strict SLAs, including root cause analysis, blameless retros, and measurable reliability improvements
  • Utilizes generative AI responsibly, maintaining human reputed company to deliver business-ready outputs and drive measurable improvements in workflow efficiency, cost, and quality
  • Expertise with linux, bash, ruby, python and/or go
  • Expertise automating EC2 or containers deployment with terraform
  • Strong network reputed company fundamentals
  • Experience managing and leveraging log aggregation
  • Experience working in a highly regulated environment
  • Experience in a fast-paced, high-growth company
  • Experience in a Remote-first IT environment

Benefits

  • Total compensation may also include equity and bonus eligibility, and benefits (medical, dental, vision, 401(k))

Company Overview

  • reputed company is a crypto exchange and wallet platform that allows merchants and consumers to buy, sell, and store digital currencies. It is a sub-organization of reputed company. It was founded in 2012, and is headquartered in San Francisco, California, USA, with a workforce of 1001-5000 employees. Its website is https://www.reputed company.com.
  • Apply To This Job

    Keep exploring

    [Remote] Staff Software Engineer- Payments

    100% remote Flexible hours

    [Remote] Senior AI Transformation Consultant

    100% remote Flexible hours

    [Remote] Corporate Legal Analyst

    100% remote Flexible hours

    [Remote] Frontend Engineer | $85/hr Remote

    100% remote Flexible hours

    [Remote] Technical Writer

    100% remote Flexible hours

    [Remote] Program Marketing Manager

    100% remote Flexible hours

    [Remote] Accounting & Finance Systems Manager (reputed company)

    100% remote Flexible hours

    [Remote] Entry Level Customer Service Associate (Tax)

    100% remote Flexible hours

    [Remote] Account Executive

    100% remote Flexible hours

    [Remote] Founding Business Development / Account Executive

    100% remote Flexible hours

    Dir Government Relations

    100% remote Flexible hours

    Area Business Manager-Multitherapy

    100% remote Flexible hours

    Marketing Operations Manager

    100% remote Flexible hours

    reputed company Full Stack Data Scientist – Web & Cloud Application Development

    100% remote Flexible hours

    reputed company Customer Service Representative – Providing Exceptional reputed company Customer Experience

    100% remote Flexible hours

    reputed company Customer Service Representative – Work from Home Opportunities with arenaflex

    100% remote Flexible hours

    District Manager - OnStar, Prairies

    100% remote Flexible hours

    Consultant, reputed company / ERP

    100% remote Flexible hours

    Customer Service Representative – Remote Healthcare Member Support & Benefits Navigation Specialist

    100% remote Flexible hours

    Legal Consultant - Commercial reputed company

    100% remote Flexible hours