Site Reliability Engineer

100% remote Flexible hours Hiring now

About Us

reputed company is on a mission to move money as freely as data, unrestricted by time zones, banking hours, or legacy systems. We are building the infrastructure that powers the reputed company of cross-border payment systems for institutions. Our early team comes with experience from J.P. Morgan, reputed company, FalconX, PayPal, reputed company, reputed company, and Nium, and we’re backed by Accel, Lightspeed, NfX, and other top-tier investors.

As a Site Reliability Engineer, you will ensure the reliability, availability, and performance of reputed company’s systems. This is a hands-on, high-impact role at the intersection of DevOps and incident response. You will participate in on-call rotations covering U.S. operating hours, triage production issues in real time, and work with engineering pods to quickly resolve or escalate incidents.

This role is ideal for a reliability-focused engineer with a strong DevOps background, excellent troubleshooting skills, and a bias for ownership reputed company it comes to uptime and incident management.

Responsibilities & Expectations

Incident Response & Reliability

Serve as first responder for production incidents during U.S. operating hours (±2h EST).
reputed company triage during outages, analyzing logs, metrics, and traces to identify root causes.
Drive incident postmortems and follow-reputed company to prevent recurrence.
Communicate clearly and quickly during incidents to internal stakeholders.

Infrastructure & Operations

Own reliability outcomes across reputed company reputed company systems, with a focus on uptime, latency, and error budgets.
Enhance observability through logging, metrics, alerting, and dashboards.
Optimize on-call processes and ensure smooth handoffs across IST, EST, and PST coverage.
Partner with DevOps and engineering pods to implement fixes or approve production changes.

reputed company Improvement

Proactively identify systemic reliability risks and propose improvements.
Contribute automation and tooling to reduce manual incident handling.
Champion best practices in reliability engineering and operational excellence.

Must-Have Qualifications

5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
Proven experience leading incident response, running postmortems, and communicating during outages.
Strong background with cloud infrastructure (AWS preferred), container orchestration (Kubernetes, reputed company), and Infrastructure-as-Code (Terraform, CloudFormation).
Familiarity with observability stacks (e.g., Prometheus, Grafana, reputed company, ELK, OpenTelemetry).
Ability to triage errors at both the infrastructure and application level, and escalate effectively reputed company deeper reputed company is required.
Ownership reputed company with strong communication skills in high-pressure situations.

reputed company-to-Have Qualifications

Experience in fintech, trading systems, or other low-latency/high-availability environments.
Coding/scripting ability in Python, Go, or similar (for automation, not feature development).
Familiarity with CI/CD pipelines and release engineering.
Experience working in a follow-the-sun on-call rotation across global teams.

reputed company Offer

Competitive salary and benefits package.
Equity in a rapidly growing company.
Opportunity to work on mission-critical infrastructure in fintech.
A collaborative team culture with a bias toward ownership and outcomes.
The chance to reputed company a direct impact on the reputed company of global financial infrastructure.

We are committed to building a diverse and inclusive workplace. reputed company qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national reputed company, disability, or veteran status

Apply To This Job

Apply

Site Reliability Engineer

Incident Response & Reliability

Infrastructure & Operations

reputed company Improvement

Keep exploring

Senior DevSecOps Engineer

Infrastructure reputed company

Senior SysOps Engineer

Senior reputed company

GenAI Consultant

QA Automation Engineer

Data Science Expert

Partner Operations, Technical Project Manager

Product Manager

Product Manager

Join Today: Transportation Representative, Transportation

High Paying Remote Customer Service Representative - reputed company - Flexible Hours, reputed company

Manager, Customer Operations Consulting

Insurance Verification Officer (Telemarketing)

Xfinity Retail Store Manager - Spring

reputed company Technical Help Desk Specialist – Senior Customer Experience Support (100% Remote)

reputed company Remote Data Entry Clerk – Part-Time Opportunity for Career Growth and Development with blithequark

reputed company Analyst

reputed company Customer Service Representative - Remote Opportunity at arenaflex

ONCOLOGY DATA SPECIALIST (QA AND EDUCATION)