Back to the board

Senior Site Reliability Engineer - Remote EST

100% remote Flexible hours Hiring now

Job Description

Join us as a Senior SRE where you'll bridge the gap between cutting-edge AI innovation and rock-solid production stability. Working independently from the East Coast, you will collaborate with our global DevOps teams to automate 70% of your workload while owning the reliability of our AWS/Kubernetes environment. This is a role for a production-hardened engineer who wants a strong voice in technology decisions and the opportunity to build the future of AI-driven operations. This is a fully remote role, however, you must be physically located in EST and be willing and able to work EST hours Monday-Friday and participate in on-call rotations. We cannot consider candidates located in CST, MST or PST at this time. Base salary for this role ranges from $160,000 - $210,000 per year. Job Requirements

  • 5+ years of experience as a Senior SRE or Production Engineer (this is a hard requirement).
  • Deep Production Expertise: You must have extensive experience managing live, high-traffic SaaS environments; developer-only backgrounds without ops experience will not be a fit.
  • Cloud & Orchestration: Proven mastery of Kubernetes and AWS in production settings.
  • Coding/Scripting: Advanced proficiency in Python (preferred) or Go for automation; we need more than just Bash skills.
  • AI Knowledge: A strong understanding of or direct experience with AI/LLM technologies.
  • Observability: Hands-on experience with Datadog for monitoring and incident response.
  • Autonomy: Ability to work independently without direct daily oversight, managing production incidents and on-call responsibilities.
  • Time Zone: Located in the East Coast time zone to provide coverage overlap with our global teams.

Job Responsibilities

  • Design, build, and operate production-grade Kubernetes infrastructure on AWS
  • Developing Ai Agents to handle incidents and root cause analisys
  • Build and maintain GitOps-based CI/CD pipelines using GitHub Actions and ArgoCD
  • Develop internal DevOps tooling and developer self-service platforms
  • Own monitoring, observability, and operational excellence using Datadog
  • Collaborate with engineering teams to improve delivery speed and reliability

Benefits

HiBob is a village filled with amazing people and we're especially proud of that. It's a place where Bobbers can be themselves. We're about fun, dreams, hopes and ambition, just as much as we are about precision, growth, and top performance. Becoming a Bobber means you'll receive competitive compensation, benefits, and pre-IPO equity alongside all of this:

  • Stock options at a high-growth unicorn startup
  • 100% subsidized medical, dental, and vision coverage for employees
  • 401(k) with a 3% company match starting from Day 1
  • Hybrid working model for bobbers in the NY metro area
  • Work from home allowance to get your home office set up!
  • Temporary remote work-from-anywhere in the world for up to 2 months after 6 months of employment
  • Annual Headspace subscription and wellness benefits
  • Two social impact days per year for volunteering
  • Bob balance days - 4 additional days within a calendar year - Enjoy a company-wide long weekend at the beginning of each quarter
  • Employee referral program - $2,500 bonus for each successful referral with an additional ambassador bonus
  • Fun and frequent social events (in-person and virtual)
  • We love birthdays - take the day off and receive a special gift
  • Dog-friendly office

If this sounds like something you've been looking for, we'd love to have you. Come on, join our village! Location Eligibility: While this is a remote position, HiBob is currently authorized to hire in the following states: CA. CO, CT, DC, FL, GA, IL, IN, KS, MA, MD, MN, NC, NH, NJ, NV, NY, OH, OK, OR, PA, RI, SC, TN, TX, UT, VA, WA. Will consider Canadian residents as well! Candidates must reside in one of these states to be considered for employment. Apply tot his job Apply To this Job

Keep exploring

Site Reliability Engineer, Core Streaming (Remote - United States)

100% remote Flexible hours

Site Reliability Engineer job at EPAM Systems in Mountain View, CA

100% remote Flexible hours

Sr. Site Reliability Engineer III (6448) Remote / Telecommute Jobs

100% remote Flexible hours

Solution Engineer, Data Engineering Specialist

100% remote Flexible hours

Social Media Analyst, Platform

100% remote Flexible hours

Remote Social Media Manager - Content Creation AND Account Management

100% remote Flexible hours

[Remote] Paid Social Media Manager (Senior, Meta Focus)

100% remote Flexible hours

Software Architect (Remote) - React, Node

100% remote Flexible hours

Lead Software Architect job at Honeywell in Atlanta, GA

100% remote Flexible hours

Advisory Software Architect Remote, United States

100% remote Flexible hours

Remote Data Entry & Customer Service Specialist – Flexible Work From Home Typing Professional (Full-Time and Part-Time Opportunities)

100% remote Flexible hours

Experienced Customer Support Representative for Innovative Shopify App Development at blithequark

100% remote Flexible hours

[PART_TIME Remote] Delta Airlines Remote Work From Home Job

100% remote Flexible hours

Customer Service Representative - Entry Level

100% remote Flexible hours

Experienced Remote Data Entry Clerk – Flexible Work Arrangements and Competitive Compensation

100% remote Flexible hours

Director of Customer Success Management - 100% Remote - North America

100% remote Flexible hours

[Remote] Business Development Representative

100% remote Flexible hours

Customer Success Specialist

100% remote Flexible hours

Experienced Pharmacy Technician I, Data Entry – Digital Pharmacy Experience at arenaflex in Tucson, AZ

100% remote Flexible hours

Sr. Sales Specialist, Fusion Territory

100% remote Flexible hours