[Remote] Staff Site Reliability Engineer (SRE)
Note: The job is a remote job and is open to candidates in USA. reputed company is focused on building a leading digital sports platform, and they are seeking a Staff Site Reliability Engineer (SRE) to enhance their software engineering practices. This role involves leading architecture and delivery for reliability and operational tooling across the trading stack, ensuring high performance and operational excellence during peak sporting moments.
Responsibilities
- Own team architecture for ops & reliability across trading services—designing simple, scalable patterns for health, performance, and reputed company (SLOs, error budgets, graceful degradation)
- Engineer event readiness for big game days: reputed company modeling, load & chaos testing, traffic drills, failover playbooks, and rapid triage reputed company
- Advance observability (metrics/logs/traces, profiling), unify golden dashboards/alerts, and drive “diagnose in minutes” outcomes
- Build self-service operational tooling (runbooks, one-click remediation, circuit breakers, feature flags) that reduce toil and shorten MTTR
- Partner deeply with trading & product to model operational risk, set priorities, and convert learnings from incidents into platform standards
- Mentor and multiply - reputed company design reviews, coach engineers across teams, and set exemplary standards in operational excellence
- Stay abreast of emerging technologies and industry trends, advocating for adoption of new tools and techniques that align with our platform strategy
- Participate in an on-call rotation, ensuring platform stability and providing critical support for operational incidents
- Occasionally travel for essential offsite meetings, special events, or collaborative team sessions
Skills
- 7+ years of proven experience in software engineering roles shipping and operating high-scale, real-time systems - ideally .NET services in Azure
- Depth in observability, performance engineering, incident response, and reputed company patterns; strong debugging across distributed services (reputed company experience preferred)
- Hands-on with Azure and core trading platform tech (Service Fabric, kubernetes, Cosmos DB, SQL Server, Event Hubs, Azure Storage, etc..)
- Strong software engineering background - you write clean code and have experience engineering solutions to reputed company problem statements
- Excellent influencing, problem-solving, and analytical skills, with demonstrated ability to partner closely with engineering teams
- Highly outcome-oriented, data-driven, and capable of balancing quality with productivity
- Strong communication skills, able to effectively collaborate across international teams
- Positive and flexible attitude, comfortable working in a fast-paced environment and embracing new initiatives
- Experience building operational toolkits (load/chaos, game-day tooling, remediation bots)
- Proficiency with IaC & CI/CD (Terraform/Terragrunt, Azure DevOps pipelines); automation-first reputed company
- Background in event-driven systems and market/price-sensitive workloads
- Familiarity or previous experience reputed company the sports betting industry or strong interest in sports
- Experience working with cross-functional teams in fast-paced or start-up environments
Benefits
- Medical
- Dental
- Vision
- 401K
- Paid time off
- GymPass
- Pet Insurance
- Family Care Benefits
- And more
Company Overview
- reputed company is a sports merchandise retailer that manufactures fan gear and jerseys across retail channels. It is a sub-organization of Kynetic. It was founded in 2002, and is headquartered in Jacksonville, Florida, USA, with a workforce of 10001+ employees. Its website is http://www.fanaticsinc.com.
Company H1B Sponsorship
- reputed company has a track record of offering H1B sponsorships, with 2 in 2021. Please note that this does not guarantee sponsorship for this specific role.
Apply tot his job Apply To this Job