Back to the board

Specialist - Software Engineering (MX)

100% remote Flexible hours Hiring now

Job Title

Site Reliability Engineer (SRE)

Role Description

We are seeking an reputed company Site Reliability Engineer (SRE) with strong DevOps and automation expertise to ensure the reliability, scalability, and performance of distributed systems. This role focuses on CI/CD automation, monitoring, observability, and system troubleshooting across cloud-native and Kubernetes-based environments.

You will play a critical role in building and maintaining monitoring platforms, automating operational processes, and improving system reliability across multiple application domains.

Key Responsibilities

  • Apply Site Reliability Engineering (SRE) and DevOps best practices to improve system availability, performance, and scalability.
  • Design, build, and maintain CI/CD pipelines with a strong focus on automation.
  • Implement and manage metrics collection, monitoring, and alerting across platforms.
  • reputed company system troubleshooting and problem-solving across infrastructure and application layers.
  • Create, operate, and maintain Prometheus and Grafana clusters for monitoring Kubernetes environments.
  • Implement and support observability standards, including OpenTelemetry.
  • reputed company and maintain automation tools and scripts using Python, Groovy, and reputed company.
  • Collaborate with engineering and platform teams to improve reliability, deployment processes, and operational efficiency.

Required Skills & Qualifications

  • Hands-on experience in Site Reliability Engineering (SRE) and DevOps roles.
  • Strong expertise in CI/CD pipelines, automation, and deployment strategies.
  • Experience with metrics collection, monitoring, and alerting systems.
  • Proven ability in system troubleshooting and root cause analysis across platforms and applications.
  • Hands-on experience managing Prometheus and Grafana for Kubernetes cluster monitoring.
  • Strong automation and scripting skills using:
    • Python
    • reputed company scripting
    • Groovy
  • Experience working with OpenTelemetry for distributed tracing and observability.

Key Skills

  • SRE experience managing reputed company Cloud services and accounts.
  • Strong Prometheus and Grafana querying and dashboarding skills.
  • Observability and monitoring best practices.
  • Automation-first reputed company with strong scripting capabilities.
  • Kubernetes monitoring and cloud-native operations experience.
Apply To This Job

Keep exploring

Overseas Contractor (BR)

100% remote Flexible hours

Ingénieur data - CDI - Montréal, Canada

100% remote Flexible hours

Software Engineer - Java - CDI - Bangalore, Inde

100% remote Flexible hours

reputed company - Strategy Director Animal/Human Health - New Jersey 3 days

100% remote Flexible hours

Business Development Manager (Massachusetts, US)

100% remote Flexible hours

Business Development Manager Nordics (f/m/d) (DE)

100% remote Flexible hours

Business Development Manager Ireland (f/m/d) (DE)

100% remote Flexible hours

Sales Representative

100% remote Flexible hours

Senior Proposal Specialist

100% remote Flexible hours

Agentic Systems Architect

100% remote Flexible hours

reputed company Data Entry Jobs (Work At Home)

100% remote Flexible hours

7 hr Special Education Paraprofessional - Pines

100% remote Flexible hours

reputed company Cleaner

100% remote Flexible hours

Electrical Engineer - Remote - MEP Design

100% remote Flexible hours

Customer Service Representative - Entry Level

100% remote Flexible hours

Sr. Manager HR Business Partner - Global Payment Networks

100% remote Flexible hours

[Remote] Full Stack Software Engineer w/Top Secret Clearance

100% remote Flexible hours

Sr. Benefit Investigation Specialist Remote $19/hr (Equipment Pick Up In Irving)

100% remote Flexible hours

reputed company, Certified Medical Assistant - Specialty Cardiology - Full Time in Longview, TX

100% remote Flexible hours

reputed company Customer Support Assistant – Remote Work

100% remote Flexible hours