Back to the board

Site Reliability Engineer II ( Remote )

100% remote Flexible hours Hiring now

LivePerson (NASDAQ LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are powered by our Conversational Cloud each month. You'll be successful at LivePerson if you are excited to build something from the ground up. You excel by finding daily opportunities to grow at the same pace as the technology we're building, and you build partnerships that improve our business. Likewise, you're someone who sees feedback as a chance to learn and grow and believe decisions powered by data are the norm. You care about the well-being of others and yourself.

Job Description

Site Reliability Engineer (Platform Engineer) Mid Level (L2) Location India (Remote)

Overview

We are seeking a Mid-Level Site Reliability Engineer (SRE) to join our global Platform Engineering team. As an SRE, your primary responsibility is to ensure that our platform is reliable, scalable, and performant. You’ll be the bridge between development and operations — designing automation, improving observability, and maintaining the health of our production systems. You should have what it takes to ask the right questions, identify potential risks early, and raise flags when necessary to maintain a culture of reliability and continuous improvement. You will

  • Collaborate closely with Developers, QA, and Product teams during sprint planning to understand release plans, dependencies, and infrastructure requirements.
  • Participate in the application release cycle, ensuring deployments are automated, consistent, and reliable.
  • Manage and operate Kubernetes clusters in Google Kubernetes Engine (GKE) and Amazon Elastic Kubernetes Service (EKS).
  • Develop and manage Terraform modules for provisioning and configuring cloud infrastructure across GCP and AWS.
  • Standardize service deployments using Helm for templating and versioned releases.
  • Build and enhance observability with Prometheus, Grafana, and Datadog to monitor application and platform performance.
  • Design, implement, and maintain GitLab CI/CD pipelines for build, test, and deployment automation.
  • Drive an automation-first culture by developing scripts and tooling in Python, Go, or Shell to minimize manual effort and improve efficiency.
  • Participate in a 24/7 on-call rotation, ensuring quick detection, mitigation, and resolution of incidents.
  • Perform root cause analysis (RCA) and contribute to post-incident reviews to prevent recurrence.
  • Proactively identify reliability or scalability gaps, raise early warnings, and partner with teams to address systemic risks. You have
  • 5-8 years of experience as a Site Reliability Engineer, Platform Engineer, or DevOps Engineer.
  • Hands-on experience managing Kubernetes clusters (GKE, EKS) in GCP and AWS.
  • Strong knowledge of Terraform, Helm, and GitLab CI/CD pipelines.
  • Proficiency in Python, Go, or Shell scripting for automation and tooling.
  • Experience implementing and managing observability stacks (Prometheus, Grafana, Datadog).
  • Deep understanding of Linux systems, cloud networking, and container orchestration concepts.
  • Experience working in Agile/Scrum environments and partnering closely with developers.
  • Excellent analytical skills with a proactive attitude — able to question assumptions and escalate potential risks early. Good to Have
  • Experience with ArgoCD or Flux (GitOps-based workflows).
  • Familiarity with service mesh (Istio, Linkerd) or API gateways.
  • Knowledge of cloud cost optimization, autoscaling, or security best practices.
  • Experience with incident management tools such as PagerDuty, ServiceNOW Why Join Us
  • Build and operate modern cloud-native platforms using Kubernetes, Terraform, GitLab, Datadog, and Grafana.
  • Be part of a global SRE team that values automation, reliability, and innovation.
  • Work in a collaborative culture that encourages ownership, learning, and continuous improvement.
  • Enjoy flexible working arrangements, competitive compensation, and career growth opportunities including certifications and mentorship. Why you’ll love working here As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where u Apply tot his job Apply To this Job

Apply To This Job

Keep exploring

Senior Site Reliability Engineer, GeForce NOW

100% remote Flexible hours

System Administrator, Contract

100% remote Flexible hours

Linux and AWS Technical Operations Engineer - Work From Home

100% remote Flexible hours

Sr. Staff – Kernel / Linux Virtualization Engineer

100% remote Flexible hours

Entry Level Cyber Security Analyst | Remote $85...

100% remote Flexible hours

Network and Cybersecurity SME

100% remote Flexible hours

Security Engineer/ Architect - Local to Columbia, SC

100% remote Flexible hours

Backup Program Manager job at Eliassen Group in Washington, DC

100% remote Flexible hours

100% REMOTE: Sr Scrum Master / Technical Project Manager

100% remote Flexible hours

Technical Writer / Remote, 6+ Months Contract

100% remote Flexible hours

Experienced Part-Time Data Entry Associate – Remote Work Opportunity with arenaflex

100% remote Flexible hours

Experienced Customer Service Associate – Delivering Exceptional Experiences at arenaflex

100% remote Flexible hours

Business Development Director, Pharma

100% remote Flexible hours

Staff Software Engineer (Remote)

100% remote Flexible hours

Junior java developer spring boot /Data engineer/BI Analyst-Remote

100% remote Flexible hours

Experienced Data Entry Specialist – Live Chat Support for arenaflex's Customer Support Team

100% remote Flexible hours

Data Center Technician – Enterprise Server Installation, Cabling & Cloud Infrastructure Operations (Full‑Time, 10‑Hour Shifts)

100% remote Flexible hours

Software Development Engineer (Agentic AI & LLM Platforms) – Work Remotely (EST Hours) – Must Be Able to Obtain Public Trust – No 3rd Parties

100% remote Flexible hours

Floating Property Manager - West

100% remote Flexible hours

Senior Operations Analyst

100% remote Flexible hours