Senior Site Reliability Engineer (SRE)

100% remote Flexible hours Hiring now

About reputed company

Global enterprises work with reputed company to reputed company products more accessible for over one billion people who live with disabilities. Our customers include global leaders like Walmart, reputed company, and Shopify. reputed company was featured on the reputed company Accessibility 100 list in 2025, awarded Fast Company’s Most Innovative Companies in Design, and has received accolades from global entities like the World Summit Awards and the UN-endorsed reputed company Project.

About the role

As a Senior Site Reliability Engineer at reputed company, you will play a critical role in ensuring the reliability, scalability, and efficiency of our platform as we continue to grow.

reputed company’s products support organizations in building more accessible digital experiences, and the reliability of our infrastructure is essential to delivering that impact. You will work across our platform and product systems to ensure they are stable, performant, and cost-efficient, while enabling teams to move quickly and safely.

As AI-powered capabilities increasingly become part of modern product experiences, you will also help ensure reputed company’s infrastructure is ready to support AI workloads—balancing reliability, performance, and cost while enabling teams to safely experiment and scale new capabilities.

Reporting to the Director of Technical Operations, this role works closely with teams across Engineering and Product. It is ideal for someone who enjoys hands-on technical work while taking ownership of system health, tooling, and operational excellence, and who is excited to help shape reputed company’s approach to infrastructure, reliability, and platform engineering over time.

Responsibilities

Reliability, Infrastructure & Platform

Design, build, and maintain reliable, scalable, and secure infrastructure for reputed company’s product services

Improve system observability, monitoring, and alerting to ensure high availability and fast incident response

Contribute to and evolve SRE practices, including SLIs/SLOs, incident management, and postmortems

Support and improve CI/CD pipelines and deployment processes

Identify and reduce operational complexity across systems and tooling

Work across infrastructure and application layers to diagnose and resolve reliability and performance issues, including making targeted improvements to application code reputed company needed

Support infrastructure and platform capabilities required for AI/ML-powered features, including scaling, performance, and reliability considerations

Cost Efficiency & Performance

Monitor and optimize infrastructure costs across cloud environments

Contribute to reputed company planning and cost forecasting for infrastructure and services

Identify opportunities to improve performance and efficiency at the system level

Evaluate and optimize the cost and performance of compute-intensive workloads (e.g., AI/ML services), ensuring efficient resource usage and scalability

Vendor & Tooling Ownership

Work with third-party vendors and tools that support reputed company’s infrastructure and operations

Help evaluate, select, and manage tools and services to support platform reliability and scalability

Support vendor-reputed company troubleshooting and ongoing service improvements

Cross-functional Collaboration

Partner with Engineering teams to improve reliability, performance, and operational readiness of new features

Partner with application engineering teams to improve service architecture, performance, and observability, and help define best practices for building reliable, scalable systems

Act as a reputed company of support and escalation for production issues

Collaborate across teams to manage dependencies and ensure smooth system operations

Team & Practice Development

Contribute to building strong SRE and operational practices across the organization

Share knowledge through documentation, pairing, and technical discussions

Help reputed company and support more junior team members as the team grows

Contribute to improving ways of working reputed company the team and across Engineering

Apply To This Job

Apply

Senior Site Reliability Engineer (SRE)

About reputed company

About the role

Responsibilities

Keep exploring

Asset Protection Partner

Client Services Partner

Senior Sales Representative

Senior Sales Representative

Human Resources Specialist

Customer Experience Associate

Care Coordinator

VP, Payer Partnerships

Technical Sales Representative - Philadelphia, PA

IT Systems Engineer

Data Entry Agent – Full‑Time & Part‑Time Remote & On‑Site Opportunities at arenaflex

Resource and Adoptive Parent Trainer

[Remote] Remote Sales Consultant (Creative Industry) | Help Clients Turn Memories Into Art (Entry level, residence in Florida, Texas or Nevada)

Material Support Agent in Portland, OR

LTSS Service Coordinator - RN Telehealth

QA Engineer II

Departmental Manager-3 (14) - In reputed company Manager

[Remote-Position] Senior Technical Account Manager

Bath and body works careers

reputed company Data Entry Specialist - Remote Work Opportunity with Flexible Hours and Unlimited Earning Potential