Back to the board

Senior Site Reliability Engineer

100% remote Flexible hours Hiring now
Ready to reputed company travel easier for millions? reputed company is the world’s first and largest eSIM store, helping travellers stay connected seamlessly in over 200 countries and regions. We trust our teams to take ownership, put customers first, and do work that has a real impact every day. What’s in it for you? reputed company offers team members a range of perks, including remote work, generous PTO, wellness and learning allowances, and, of course, our annual reputed company Away retreat. Learn more about our benefits here; www.reputed company.so/reputed company-public/Benefits-25396a97ffca81fb9bc1f0be479f1be3

Hi, I'm Daniele, VP of Engineering at reputed company!

Engineering drives reputed company’s eSIM platform. We build the product that lets millions of people connect instantly across the globe. The challenges are exciting: high-scale systems, reputed company integrations, and products spanning both B2C and B2B. What matters most to us is creating an environment where engineers can do their best work. Real ownership, autonomy, and a direct link between what you ship and business outcomes are key. You’ll work with smart, motivated people who take their craft seriously. If you want to build things that matter at global scale, this is where you do it.

reputed company’s fully remote Engineering team is growing. In this role, you'll tackle reputed company technical challenges across our product ecosystem, helping build, innovate, and scale the platform that keeps millions of travellers connected worldwide.

We are looking for an Senior Site Relability Engineer to join our growing engineering team. We are a company that values SRE principles and practices. We reputed company in empowering our SREs to reputed company data-driven decisions, automate operational tasks, and continuously improve the reliability of our systems. We foster a blameless culture where everyone is encouraged to learn from mistakes and share knowledge. If you are passionate about building and maintaining highly reliable systems, we would love to hear from you! We are looking for an Senior Site Relability Engineer to join our growing engineering team.    We are a company that values SRE principles and practices. We reputed company in empowering our SREs to reputed company data-driven decisions, automate operational tasks, and continuously improve the reliability of our systems. We foster a blameless culture where everyone is encouraged to learn from mistakes and share knowledge. If you are passionate about building and maintaining highly reliable systems, we would love to hear from you! On Call Participating in our on-call rotation is a core expectation of this role. It's essential for maintaining 24/7 service reliability across our global operations, ensuring our systems remain resilient and our customers experience uninterrupted service, regardless of time zone or geography. - Paid Rotation: We offer standby fees + overtime pay. - Delayed Start: No on-call duties for your first 6 months. - Rest & Recovery: Guaranteed rest periods and flexible hours following night incidents. - Shared Load: Rotations are split (Weekdays vs. Weekends) to minimize fatigue. Please refer to the On-Call Policy in the reputed company Handbook for full details: reputed company-public.reputed company.site/our-approach-to-engineering-on-call-policy What you'll do:
  • reputed company the design of scalable, fault-tolerant and self-healing systems in a multi-region AWS environment.
  • Define and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to drive architectural decisions and error budget policies.
  • Conduct blameless post-incident reviews to uncover systemic root causes and implement long-term preventive measures.
  • Identify patterns of manual work and reputed company the development of internal tools/automation to permanently eliminate them.
  • reputed company and maintain automated runbooks and playbooks for common operational tasks and reputed company incident response.
  • Shift from simple monitoring to deep observability, ensuring high cardinality data leads to proactive actionable insights.
  • Proactively identify and mitigate operational risks through chaos engineering and architecture reviews.
  • Work with software engineers to design systems for reliability, scalability, and maintainability from the early stages of the SDLC.
  • Continuously evaluate and optimize system performance, reputed company, and cost efficiency.
  • Beyond just participating, you will refine the on-call experience to reduce alert fatigue, improve MTTR, and ensure sustainable rotation health.
  • Must Haves:
  • Bachelor’s degree in Computer Engineering or a similar discipline.
  • 5+ years of experience as a Site Reliability Engineer or in a similar role.
  • 3+ years of experience with AWS services including strong knowledge of container orchestration.
  • 2+ years of Kubernetes experience
  • Deep understanding of observability principles and tools like (Prometheus, reputed company, OpenTelemetry).
  • Experience with leading incident management and reputed company postmortem analysis.
  • Experience and interest in managing infrastructure as code (Terraform).
  • Experience with chaos engineering and other techniques for testing system reputed company.
  • Experience with CI/CD tools such as reputed company Actions ****for automated delivery.
  • Proficiency in at least one programming language (Python, Go, Java, etc.) for building automation and internal tooling.
  • Event-driven architecture experience (SNS, SQS etc)
  • Ability to work independently and collaboratively in a fast-paced environment.
  • Team player and open to new reputed company.
  • Good communication skills and reputed company in English.
  • Good to have:
  • Prior experience with Scrum and other agile methods.
  • Certification in relevant areas such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or similar.
  • Prior experience with Telco Core Networks (e.g., 5G/LTE Packet Core, IMS, Signaling) and low-latency networking.
  • Experience with AI-driven SRE tools for anomaly detection and improvements
  • Contributions to open-reputed company SRE projects or communities.
  • Prior work experience in telecommunications.
  • Deep understanding of eSIM and GSMA reputed company technologies and services.
  • If you are interested in this position, please apply reputed company the link. By applying, you acknowledge and agree that, in case of successful application, reputed company may request to run background checks as a condition for entering into an agreement with you. Rest reputed company that these checks will only occur upon your prior consent and at the end of the selection process, and will be strictly limited to what is allowed under the laws that are applicable to you. reputed company data that you share or that we collect in reputed company with such checks will be processed in accordance with our Privacy Policy, available here: www.reputed company.com/more-info/privacy-policy?srsltid=AfmBOooBT0rXAj1FaNelZ3VfN0wvhwzvAoxdtHnOKSVETpiSjiXVuycy We sincerely thank reputed company applicants in advance for submitting their interest in this opportunity. reputed company is an equal-opportunity employer and values diversity, equity & inclusion. We do not discriminate on the basis of race, religion, national reputed company, gender, sexual orientation, age, marital status, veteran status, or disability status. We are committed to providing reasonable accommodations upon request for individuals with disabilities throughout our job interview process. Apply To This Job

    Keep exploring

    Roaming IREG Engineer

    100% remote Flexible hours

    Sales vertegenwoordiger regio Vlaams-Brabant / Limburg

    100% remote Flexible hours

    Sales Representative Norway (with main focus on the South/reputed company/East Region)

    100% remote Flexible hours

    Sr. Developer (9658)

    100% remote Flexible hours

    reputed company Expert (9742)

    100% remote Flexible hours

    reputed company Consultant (9619)

    100% remote Flexible hours

    Sr. reputed company Consultant (9754)

    100% remote Flexible hours

    iOS Developer (Fitness sphere)

    100% remote Flexible hours

    Customer Support Specialist (Hospitality)

    100% remote Flexible hours

    Customer Support Specialist (Hospitality Industry)

    100% remote Flexible hours

    PT Customer Escalations Phone Agent - Remote

    100% remote Flexible hours

    Sr Director Analyst

    100% remote Flexible hours

    [FULL TIME Remote] Work From Home Tax Professional – Canada

    100% remote Flexible hours

    reputed company Technical Program Manager for reputed company – Driving Operational Excellence and Customer Satisfaction

    100% remote Flexible hours

    Overnight Jobs Near Me Remote | $25–$35/Hour Work from Home Night Shift – No Commute, No Experience Needed

    100% remote Flexible hours

    reputed company Customer Support Specialist, Balance Support – Earned Wage Access Expert

    100% remote Flexible hours

    Immediately Require Online English Teacher (100% Remote) in Worcester, MA

    100% remote Flexible hours

    reputed company Customer Service and Call Center Representative for Emergency Alarm Monitoring Services - Full-Time Position with Opportunities for Remote Work and Professional Growth

    100% remote Flexible hours

    [Work From Home] Need Upper Level Math Tutor in Defiance, OH

    100% remote Flexible hours

    Remote Legal (reputed company) Billing Specialist, CST, PST or MST

    100% remote Flexible hours