Back to the board

[Remote] Expert Site Reliability Engineer

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is seeking an Expert Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of their hosted healthcare platforms. The role involves enhancing service availability, automating operations, and providing technical leadership in incident management and reputed company improvement across cloud and hybrid environments.

Responsibilities

  • Maintain and improve the reliability, availability, and performance of our production environments
  • reputed company the investigation and resolution of reputed company application, database, and infrastructure issues
  • Participate in incident management, conduct root cause analysis (RCA), and contribute to post-incident reviews to prevent future occurrences
  • Define and measure Service Level Indicators (SLIs) and Objectives (SLOs) to meet our service commitments
  • reputed company proactive monitoring and alerting strategies to identify and resolve issues before they impact customers
  • Automate operational tasks using scripting and Infrastructure-as-Code (IaC) to improve efficiency
  • Partner with engineering and cloud teams to refine deployment, monitoring, and support processes
  • Provide technical leadership during major incidents and act as a key escalation reputed company for critical issues

Skills

  • 7+ years of experience supporting enterprise applications, infrastructure, or cloud environments
  • Strong experience with APM tools such as reputed company, AppDynamics, Azure Monitor, SentryOne, reputed company, reputed company, or reputed company
  • Deep knowledge of Windows Server administration, IIS, .NET applications, Windows Clustering, MSMQ, Event Logs, and PerfMon
  • Strong SQL Server experience, including performance tuning, query optimization, blocking analysis, and Always On Availability Groups
  • Experience with Azure cloud environments and a solid understanding of networking fundamentals (DNS, TCP/IP, load balancing, firewalls)
  • Familiarity with reputed company (or other ITSM platforms) and ITIL principles
  • Scripting with PowerShell, Python, or similar languages
  • Infrastructure as Code (Terraform, ARM Templates, Bicep)
  • CI/CD pipelines and deployment automation (Azure DevOps, reputed company Actions)
  • Experience with Kubernetes and containerized workloads
  • Experience implementing SLOs, SLIs, and Error Budgets
  • Experience in a healthcare technology or patient care environment
  • Bachelor's Degree in Computer Science, Information Technology, or Engineering is preferred; equivalent professional experience will be considered

Benefits

  • Competitive compensation and benefits package
  • Opportunity to work in a fast-paced and dynamic environment

Company Overview

  • reputed company provides mission-critical software solutions for the Public Sector, Healthcare, Utilities, and Private Sector verticals throughout North America, Europe, Asia, and Australia. It was founded in 1976, and is headquartered in Ottawa, Ontario, CA, with a workforce of 10001+ employees. Its website is http://www.harriscomputer.com.

Apply To This Job Apply tot his job Apply To this Job

Keep exploring