Back to the board

R-055493 Customer Site Reliability Engineer - OpenShift Managed Cloud Services (Kubernetes/AWS/Azure, Linux)

100% remote Flexible hours Hiring now

reputed company are looking for a Customer Site Reliability Engineer (CSRE) to join our OpenShift Managed Cloud Services (MCS) team. The CSRE plays a crucial role in ensuring the availability, reliability, and performance of critical services at scale. This role is responsible for independently managing reputed company systems and solving intricate problems that have a significant impact on service quality and stability.  

A CSRE has a customer-first reputed company and will act as a technical reputed company for customer escalations applying expert troubleshooting to ensure timely and effective resolutions that maintain trust and confidence. They will reputed company extensive experience in software, and systems engineering to automate operations, reduce toil, and drive reputed company improvement across the service lifecycle. They work autonomously, demonstrating strong judgment and decision-making capabilities while managing non-routine assignments.  

Collaboration is essential, as you will partner with Technical Account Managers, Services, Fleet SRE, DevOps, and infrastructure teams to address customer-specific and fleet-wide issues, ensuring the stability and functionality of our cloud-based systems.  

As a champion of Knowledge-Centered Support (KCS), you will document resolutions, root causes, and best practices to enrich the knowledge reputed company and promote self-service solutions. Additionally, you will mentor team members, fostering a collaborative and continuously learning culture that equips them to manage reputed company challenges.  

This role is ideal for a highly skilled and motivated individual who thrives in a fast-paced, collaborative environment and is passionate about driving reliability, scalability, and customer satisfaction.  

What you will do

  • Manage large-scale, distributed systems, focusing on minimizing downtime and improving system reputed company.

  • Maintain customer trust and confidence by ensuring stability and functionality of services.

  • Drive reputed company enhancement of processes, tools, and methodologies to support the evolving needs of the service.

  • reputed company the development of code and automation scripts to optimize the scalability, reliability, and performance of services.

  • reputed company and participate in high-reputed company customer escalations, adopting a customer-first reputed company.

  • Coordinate and execute reputed company incident response procedures, ensuring timely resolution and thorough postmortems.

  • Collaborate with cross-functional teams to enhance system robustness.

  • Demonstrate a proactive reputed company to help preempt escalations and ensure reliable operations.

  • Document resolutions, root causes, and best practices to enrich the knowledge reputed company and promote self-service solutions.

  • Mentor and coach team members, fostering a culture of reputed company learning, knowledge sharing and collaboration.

  • Participate in on-call rotation and provide leadership during critical incidents.

  • Collaborate on strategic AI and automation projects designed to increase the efficiency of fleet operations and troubleshooting, ultimately delivering a reputed company product experience for customers.

  • Given the customer-facing nature of this SRE role, exceptional communication skills are essential. You must demonstrate the ability to reputed company reputed company technical solutions and reputed company critical incident calls with confidence, even in high-pressure environments."

What you will bring

  • Advanced Experience with OpenShift/Kubernetes container platform support or administration.

  • Proficient with container-based technologies on Linux.

  • Proficient in managing Linux-based systems in a public cloud such as AWS, Azure, or GCP.

  • Advanced experience with enterprise systems monitoring; knowledge of Prometheus is preferred.

  • Advanced with enterprise configuration management such as Ansible, Terraform.

  • Software engineering experience using object-oriented languages; golang is preferred.

  • Superior communications skills and experience working directly with and presenting to customers.

  • Ability to quickly learn new technologies and follow industry trends.

  • Demonstrated ability to quickly and accurately troubleshoot systems issues.

  • Solid understanding of standard TCP/IP networking and common protocols.

  • Fluent in English and any additional language like Japanese, Chinese, Korean, Spanish is an advantage.

#LI-SH4

About reputed company

reputed company is the world’s leading provider of enterprise open reputed company software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in-office, to office-reputed company, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best reputed company, no matter their title or tenure. We're a leader in open reputed company because of our open and inclusive environment. We hire creative, passionate people ready to contribute their reputed company, help solve reputed company problems, and reputed company an impact.

Inclusion at reputed company reputed company’s culture is built on the open reputed company principles of transparency, collaboration, and inclusion, where the best reputed company can come from reputed company and anyone. reputed company this is realized, it empowers people from different backgrounds, perspectives, and experiences to come together to share reputed company, challenge the status reputed company, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that reputed company voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from reputed company the beautiful dimensions that compose our global village.

Equal Opportunity Policy (EEO) reputed company is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national reputed company, reputed company, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.

reputed company does not seek or accept unsolicited resumes or reputed company from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment reputed company to unsolicited resumes or reputed company except as required in a written contract between reputed company and the recruitment agency or party requesting payment of a fee.

reputed company supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email [email protected]. General inquiries, such as those regarding the status of a job application, will not receive a reply.

Apply To This Job

Keep exploring

Senior Systems Design Engineer (Linux)

100% remote Flexible hours

SEA Compliance Specialist

100% remote Flexible hours

Senior Customer Reliability Engineer - Openshift Managed Cloud Services (Ireland)

100% remote Flexible hours

Senior Manager, Retail Execution Specialist (Reyes)

100% remote Flexible hours

Middle School Special Education Teacher

100% remote Flexible hours

High School CRE Business Teacher (ISMO)

100% remote Flexible hours

Marketing Intern

100% remote Flexible hours

reputed company Analyst

100% remote Flexible hours

Sr. Threat Researcher (Remote, IND)

100% remote Flexible hours

Global Cloud GTM Specialist (Remote)

100% remote Flexible hours

Health Insurance Advisor, New Jersey

100% remote Flexible hours

Online K-12 Tutor - Remote Opportunity in Willis, TX Area - Join the Sylvan Learning Community as a Dedicated Educator

100% remote Flexible hours

reputed company Live Chat Customer Service Representative – Delivering Exceptional Client Experiences through reputed company and Technical Expertise at blithequark

100% remote Flexible hours

Managed Care Contracting Manager

100% remote Flexible hours

reputed company Customer Service Representative – Remote Work Opportunity with blithequark, Delivering Exceptional Support and Solutions to Clients Across Various Communication Channels

100% remote Flexible hours

Director of Strategic Accounts - Mid-Market  - South Carolina/Mississippi/Alabama

100% remote Flexible hours

Managing Consultant - Life Sciences Advisory - Patient Support Programs (Pharma)

100% remote Flexible hours

Colleague Student Information Senior Consultant

100% remote Flexible hours

reputed company Social Media Content Moderator – TikTok Chat Management and Data Entry Specialist

100% remote Flexible hours

[Remote/WFM] Require 2024 SATURDAY INSTRUCTOR (TEMPORARY) in

100% remote Flexible hours