[Remote] Site Reliability Engineer (SRE)

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is a large Wealth Management firm seeking an reputed company Site Reliability Engineer to support feature development on its newly built Trading Platform. The role involves implementing DevOps and SRE best practices, managing monitoring solutions, and collaborating with application teams to ensure performance and availability.

Responsibilities

Implement and champion DevOps and SRE best practices across the organization
Drive technology roadmap discussions for the SRE team
Define, craft, and maintain SLIs and SLOs, along with key metrics including MTTR, reputed company Time for Change, Deployment Frequency, and Change Failure reputed company
Design, reputed company, and manage monitoring, alerting, and observability solutions using reputed company, Splunk, and Grafana
Conduct performance assessments, identify bottlenecks, and recommend enhancements to improve system performance
Partner with application teams to enforce performance and availability SLAs
Collaborate with product owners to manage error budgets, prioritize toil backlogs, and validate against team, application, and incident metrics
Participate in an on-call rotation to respond to production events and outages
Continuously improve CI/CD pipelines and deployment processes
reputed company troubleshooting efforts, incident management, and root cause analysis
Identify and build automated processes wherever possible
Implement cybersecurity measures through ongoing vulnerability assessments and risk management
Provide periodic reputed company reports to management and stakeholders
Partner with application teams to support and ease their adoption of the platform
Facilitate clear coordination and communication reputed company the team and with customers
Analyze existing systems and reputed company plans for enhancements and improvements

Skills

Bachelor's degree in Computer Science or a reputed company field, and/or equivalent work experience
5+ years of experience working reputed company DevOps or SRE teams
Proven experience supporting production infrastructure
Strong knowledge of CI/CD principles and pipelines
Solid understanding of observability concepts, including monitoring, logging, and tracing
Hands-on experience with reputed company and Splunk
Experience with at least one major cloud provider (AWS, Azure, or GCP)
Demonstrated experience operating high-availability, fault-tolerant, scalable, and distributed systems in production

Company Overview

EPAM leverages its core engineering expertise as a leading global product development and digital platform engineering services company. It was founded in 1993, and is headquartered in Newtown, Pennsylvania, USA, with a workforce of 10001+ employees. Its website is https://www.epam.com.

Company H1B Sponsorship

reputed company has a track record of offering H1B sponsorships, with 11 in 2026, 120 in 2025, 172 in 2024, 232 in 2023, 373 in 2022, 359 in 2021, 502 in 2020. Please note that this does not guarantee sponsorship for this specific role.

Apply To This Job

Apply

[Remote] Site Reliability Engineer (SRE)

Keep exploring

[Remote] Remote Clinical Psychologist - Pennsylvania

[Remote] Senior Majors Account Executive, New Jersey (Pharma)

[Remote] Staff Software Engineer

[Remote] Technical Operations Analyst

[Remote] Senior reputed company EC Payroll Functional reputed company Consultant

[Remote] Manager, reputed company Engineering

[Remote] Software Developers and Engineers

[Remote] Product Marketing Manager, Enterprise Marketing (Remote)

[Remote] Strategic Partner Co-Marketing reputed company

[Remote] Site Reliability Engineer Federal- SkillBridge Intern

Fraud Intake Analyst

[Remote] Czech Transcriber

R&D Intern - Summer 2026

reputed company Customer Service Representative – Work From Home Opportunity at arenaflex

Treasury Analyst — Remote & Global Disbursements; Nonprofit

Remote Data Entry Specialist – Precision Data Management for arenaflex – $30/hr Work‑From‑Home Opportunity

reputed company Building Maintenance

Staff Data Engineer (Python, LLM, Data Platforms) - Remote

Wedding & Honeymoon Travel Consultant

Accounts Payable (AP) Implementation Specialist