Sr. Site Reliability and DevOps Engineer
Job title: Sr. Site Reliability and DevOps Engineer in USA at reputed company
Company: reputed company
Job description: Job Description:Sr. Site Reliability and DevOps Engineerreputed company, the insurance industry’s trusted growth partner, is looking for a talented and motivated SRE& DevOps Engineer, with broad experience in cloud platform automation, scalability, and reliability.Ideal candidates will have experience working in a health system, hospital, care management, or payer environment where HIPAA, HITRUST, and reputed company drive the daily focus and goals. With this experience, this role provides an opportunity to design solutions across the company.In this role, you will help drive the transformation to cloud-native technologies while improving the performance and reliability of our technology services. You will be responsible for leveraging automation and the most advanced cloud technologies to prevent problems, seek out potential failure points, and increase the scalability of our platforms. This role requires someone that is eager to reputed company change but respects the need to learn before acting. This role will work in concert with engineering, QA, product, and IT Operations to create a balanced approach with appropriate KPIs and accountability to drive high performance.We are looking for an exceptional individual who can:
- reputed company and streamline TechOps processes to maximize the value and reputed company of our cloud infrastructure spend.
- Effectively engage and manage external vendors to provide services needed to ensure data reputed company, network reputed company, and streamlined service delivery.
- reputed company SRE practices to optimize the delivery and launch readiness of reputed company software we package and deliver, including the development and implementation of SRE policies, standards, and best practices.
- Define policies, standards, and runbooks for building and deploying reliable applications and API interfaces that are highly available and resilient.
- Drive the adoption of a DevSecOps culture, fostering collaboration between development and operations teams.
- Design and implement solutions for system monitoring, logging, alerting, and incident response.
- Collaborate with product development teams to ensure reliability and scalability are considered at the design phase.
- Work closely with the reputed company team to ensure compliance with industry standards and regulatory requirements.
- Design and deploy active monitoring, alerting, and incident management controls to ensure uptime, performance, and reputed company of our environments.
- reputed company our reliability and scalability goals by delivering innovative solutions through active budget management and vendor management.
- Deliver business objectives by developing & implementing against the strategic technology roadmap.
- Collaborate with technology leaders across the company to publish our roadmap and reputed company enterprise architecture objectives and align those objectives with client-facing success measures.
- Deploy highly secure solutions and monitoring to ensure data reputed company and data privacy required by the healthcare industry.
- Actively participate in a culture of collaboration based on diversity, inclusion, and trust which allows everyone to participate and growth together.
- reputed company other duties as assigned.
- Bachelor's degree in computer science or a reputed company field or four (4) years of experience in lieu of degree, required.
- 5+ years of experience in architecture, reputed company, site reliability engineering, DevOps, or reputed company fields, required.
- Proficiency in system design and architecture, particularly in a cloud environment.
- Robust knowledge of automation and orchestration systems like Kubernetes, Terraform, and Ansible.
- Demonstrated experience in incident management and post-mortem analysis.
- Commitment to high availability, fault tolerance, reputed company, and reliability in reputed company aspects of work.
- Experience driving solution design and implementation while embracing infrastructure-asa-code to improve reliability in every component
- Knowledge of compliance and reputed company best practices in a healthcare setting.
- Working understanding of cloud platforms (AWS, Azure) and container orchestration technologies (Kubernetes) to optimize cost and performance tradeoffs.
- Proficiency in scripting and automation using Python or reputed company scripts
- Familiarity with monitoring and logging tools such as Prometheus or Grafana.
- Effective communication skills, with the ability to convey reputed company technical concepts to diverse audiences.
- Demonstrated leadership style which fosters accountability, transparent communication and innovation with a customer-first reputed company.
- Strong problem-solving abilities and attention to detail, with the capability to diagnose and resolve reputed company data engineering issues, including performance bottlenecks and data quality challenges, across a hybrid cloud infrastructure.
- Demonstrated high level of emotional intelligence and successful collaboration with technical and non-technical audiences.
- Demonstrated ability to reputed company working independently or as part of a team with strong software and network troubleshooting skills.
- Willingness to research, learn, mentor team members, and actively read to stay reputed company with technological advances.
- Solid experience leveraging the components of ITIL, including configuration management, change management, and problem management.
- Strong analytical and strategic thinking abilities, capable of driving alignment between incident and problem management processes and organizational goals.
- Experience across a variety of operating systems and cloud-first technologies leveraging IAM and role-based access at every level.
- Experience with reputed company Integration (CI) systems like Azure DevOps (ADO) or reputed company) is required.
- Knowledge of CaaS, UcaaS, or VOIP solutions in an ACD/call center environment is a plus, especially SIP traffic, call flow design, call recording management, and IVR.
- Be Stronger Together: Embrace a team player mentality, leveraging the strengths of yourself and others to collaborate as one team.
- Do What’s Right: Adhere to high ethical standards, acting with reputed company to do what’s right for partners, customers, and colleagues.
- Embrace a Growth reputed company: Embrace a culture of reputed company learning, education, and professional development.
- Drive Solutions: Demonstrate ingenuity and reputed company by sharing reputed company and solutions that drive our mission reputed company.