Back to the board

Software Engineer II - Site Reliability

100% remote Flexible hours Hiring now

About the Role

We're looking for a Software Backend Engineer with strong DevOps expertise to support our team managing restricted government cloud environments for federal customers. This role involves both building scalable infrastructure and supporting distributed systems, with a focus on reliability, performance, and compliance.

As a Software Backend and DevOps Engineer, you'll troubleshoot production issues, enhance infrastructure, and ensure a smooth, secure experience for federal users. You’ll also collaborate with cross-functional teams to drive continuous improvements in deployment, monitoring, and system design.

The ideal candidate:

  • Brings a combination of technical depth and problem-solving skills
  • Navigates ambiguity and works effectively in complex, distributed systems
  • Collaborates across teams and escalates issues when needed to maintain progress
  • Identifies and addresses inefficiencies in workflows and operations
  • Contributes to clarity, accountability, and process improvement

Key Responsibilities

  • Automate and build tools to eliminate repetitive operational tasks and reduce toil
  • Maintain and scale reliable software applications using DevOps best practices
  • Build and enhance CI/CD pipelines for automated testing, builds, and deployments
  • Optimize and maintain Kubernetes-based orchestration systems for performance and reliability
  • Troubleshoot complex production issues across application, infrastructure, and distributed system layers
  • Participate in on-call rotations and support incident response
  • Collaborate with stakeholders and product teams on infrastructure and deployment requirements
  • Ensure compliance with government cloud standards across applications and infrastructure

Must-Have Qualifications

  • Proven ability to maintain 99.99% uptime in production environments
  • 6+ years of overall experience, including 3+ years in software development and 2+ years in DevOps practices.
  • 2+ years of experience with Kubernetes, Terraform, Python or Go, and AWS
  • 2+ years of experience working with distributed systems
  • Experience in fast-paced or startup-like environments
  • Strong collaboration and communication skills across cross-functional teams and divisions
  • Ability to ramp up quickly and contribute in complex, large-scale environments
  • Demonstrated leadership in incident management and operational reliability

Nice-to-Have Qualifications

  • Experience with FedRAMP compliance and government security requirements
  • Track record of implementing secure CI/CD pipelines in restricted or regulated environments
  • Familiarity with Redis, Kafka/PubSub, and relational databases

At Abnormal AI, certain roles are eligible for a bonus, restricted stock units (RSUs), and benefits. Individual compensation packages are based on factors unique to each candidate, including their skills, experience, qualifications and other job-related reasons. We know that benefits are also an important piece of your total compensation package. Learn more about our Compensation and Equity Philosophy on our Benefits & Perks page.

Base pay range:$148,800—$175,000 USDSan Francisco/New York Base pay range:$165,800—$195,000 USD

Originally posted on Himalayas

Apply To this Job

Keep exploring

Director of Strategic Accounts (Great Plains)

100% remote Flexible hours

Remote Substance Abuse Counselor

100% remote Flexible hours

Customer Success Manager (Sales-focused) - APAC

100% remote Flexible hours

Sr. Engineer - Frontend - Terraform Actions

100% remote Flexible hours

Nurse Practitioner, Oncology Urgent Care (11-8pm E Shift)

100% remote Flexible hours

Growth Designer

100% remote Flexible hours

Senior Software Engineer, Infrastructure

100% remote Flexible hours

Field Service Engineer 3 - DC and Northern VA / X-Ray

100% remote Flexible hours

Product Manager - Reservation Sales

100% remote Flexible hours

Sales Engineer

100% remote Flexible hours

Field Service Engineer 3 - Long Island, NY

100% remote Flexible hours

Staff Accountant

100% remote Flexible hours

Delivery Solutions Architect

100% remote Flexible hours

Corporate Sales Engineer, Next-Gen SIEM - SME Team (Remote)

100% remote Flexible hours

Experienced Full Stack Data Entry Specialist – Remote Live Chat Support

100% remote Flexible hours

Career Opportunities: Clinical Documentation Specialist (661403)

100% remote Flexible hours

Remote Data Entry Specialist – Accurate Data Management for Global Entertainment Leader arenaflex

100% remote Flexible hours

UI/UX Designer - Digital Payments Apps

100% remote Flexible hours

Experienced Bilingual Data Entry Clerk – High-Tech Data Management and Customer Service Expert

100% remote Flexible hours

Healthcare Consulting Manager [Primary Care/Beh...

100% remote Flexible hours