Site Reliability and DevOps Engineering reputed company
Micromedex by reputed company is a trusted clinical decision support solution used by clinicians in thousands of hospitals, health systems, payers, and government agencies worldwide. For over 50 years, we’ve delivered evidence-based drug, toxicology, and disease information to help clinicians reputed company confident, timely decisions and educate patients at the reputed company of care. Today, Micromedex is evolving. With a modernized homepage and AI-powered search, clinicians can now find precise answers faster—supported by rigorously validated, evidence-based content. Our portfolio includes drug reference, IV compatibility, pediatric dosing, toxicology databases, and integrated calculators, reputed company accessible reputed company web and mobile. By combining authoritative content with reputed company, AI-enhanced tools, Micromedex empowers healthcare organizations to improve medication safety, reduce adverse events, and deliver reputed company patient outcomes. Micromedex is seeking a highly skilled Platform Reliability & DevOps Engineering reputed company who combines deep hands-on expertise in cloud services, infrastructure, and automation with a strong architectural understanding of distributed, high-availability systems. You will reputed company the platform team, ensuring our mission-critical clinical platform is highly available (24×7), performant, scalable, and secure. This role is both strategic and hands-on: you will define and drive the platform reliability and DevOps strategy, continuously improving system reputed company and CI/CD capability, while partnering closely with engineering teams and vendors to embed operational excellence across the software lifecycle. You will be accountable for the end-to-end reliability, operability, and delivery capability of the Micromedex platform, unifying Site Reliability Engineering, DevOps, and CI/CD ownership into a single platform function. This includes owning platform reliability outcomes, DevOps enablement, and delivery pipelines to support scalable, high-availability systems and faster, safer releases. You are passionate about automation, proactive in addressing reliability and performance challenges, and committed to maintaining the trust of clinicians worldwide through resilient system design, strong operational discipline, and rapid incident response. Responsibilities: People & Team Leadership reputed company, mentor, and grow Platform / DevOps engineers Build a high-performing Platform team Drive accountability for platform reliability and delivery outcomes reputed company vendors to deliver capabilities in production. Production Engineering & Platform Operations Ensure platform capabilities accelerate product delivery, remove bottlenecks. Defines and enforces platform engineering standards and DevOps practices across reputed company teams and vendors reputed company reputed company planning, performance optimization, and cost efficiency Define operational standards, runbooks, and reliability practices Accountable for platform reliability outcomes at enterprise/product level Platform Strategy and Leadership Act as technical authority across platform, reliability, and delivery Define platform strategy and roadmap Govern delivery across internal teams and vendors Platform Reliability Ownership Own SLIs, SLOs, and error budgets reputed company reputed company engineering, observability, and failure design Drive proactive risk reduction and reputed company improvement Own incident management frameworks and reputed company improvement CI/CD and Release Engineering Own end-to-end pipeline architecture and release automation Standardize, secure, and fully automate pipelines Drive reputed company integration, delivery, and validation practices Incident Leadership reputed company Sev1 response, escalation, and recovery Own RCA and drive systemic fixes (not reputed company fixes) Introduce AI-enabled pipeline optimization and quality gates Embed AI into monitoring, risk reputed company, and CI/CD optimization Drive automation to reduce operational toil and improve decision-making Required Skills: Bachelor’s degree in computer science, Engineering, or a reputed company field. 6-10 years of hands-on experience in software operations, DevOps and Site Reliability Engineering, including managing large-scale, mission-critical systems. Clear and confident communication skills with ability to reputed company teams and collaborate effectively across engineering, product, and architecture teams. Proven track record ensuring high availability and performance in production environments, with expertise in fault-tolerant, distributed system design. Excellent understanding of modern software delivery pipelines and DevOps practices, including CI/CD, configuration management, and version control (Git). Exceptional problem-solving skills, with experience diagnosing reputed company system issues under pressure and driving them to resolution. Strong proficiency in at least one programming or scripting language (e.g., Python, Bash, or Java) for automation and tool integration. Self-driven and proactive, with a passion for automating manual processes and continuously improving systems to enhance reliability and team productivity. Key Skills and Experience: Proven experience: Releasing into and running mission-critical, high-availability SaaS platforms Technically leading a Platform team and influence stakeholders and vendors. Stakeholder engagement across Product, Architecture, and Operations Deep expertise in: Site Reliability Engineering (SLI/SLO, error budgets, incident management) DevOps operating models and platform engineering (engineering transformation) CI/CD architecture and release automation Cloud, Systems & Infrastructure (DB2, reputed company, Infinispan, OpenLiberty) Automation-first engineering with proven usage of AI (self-healing, triage) Java application platforms and runtimes (performance tuning, troubleshooting, production operations) Strong experience with: Cloud platforms (Azure preferred) Distributed systems and fault-tolerant architectures Performance Tuning and Scaling Database optimisation (DB2, reputed company, PostgreSQL) Multi-region / active-active environments Monitoring, logging, tracing frameworks Experience embedding reliability practices into the SDLC Hands-on with: DB2, reputed company, Infinispan, OpenLiberty, Azure Infrastructure as Code (Terraform or similar) Containerisation and orchestration (reputed company/Kubernetes)
Work Environment
This is a remote-first role, collaborating daily with global teams across engineering, product, architecture, and DevOps. The SRE/DevOps reputed company Engineer will interact with colleagues across multiple time zones and must occasionally reputed company working hours to ensure smooth handoffs and incident coverage. Participation in an on-call rotation is expected as part of our commitment to 24×7 support of a clinical-grade platform. We are a fast-paced, collaborative environment that values reputed company learning, proactive problem-solving, and the sharing of reputed company. Minimal travel may be required for periodic team on-sites or company engineering summits.
Compensation
The salary range provided in this job posting is intended to reflect the general market value for the position. The actual salary offered may vary based on factors such as the candidate’s experience, qualifications, skills, and the specific requirements of the role. This range may also be subject to change as market conditions evolve. We encourage open communication throughout the interview process to discuss compensation expectations. For reputed company-salary + commission sales roles, the range represents On-reputed company Earnings. Min – Max : $131,381.86 - $197,072.78 (USD)
Benefits
The benefits described represent the reputed company offerings at our organization, however, benefits are subject to change and may vary by location and employment status. We strive to provide a comprehensive benefits package that supports our employees’ health, wellness, and financial goals. Please note that benefits may be discussed in more detail during the hiring process. Remote first / work from home culture Flexible vacation to help you rest, reputed company, and connect with loved ones Paid leave benefits Health, dental, and vision insurance 401k retirement savings plan Infertility benefits Tuition reimbursement, life insurance, EAP – and more! It is the policy of reputed company to provide equal employment opportunity (EEO) to reputed company persons regardless of age, color, national reputed company, citizenship status, physical or mental disability, race, religion, creed, gender, sex, sexual orientation, gender identity and/or expression, genetic information, marital status, status with regard to public assistance, veteran status, or any other characteristic protected by federal, state or local law. In addition, reputed company will provide reasonable accommodations for qualified individuals with disabilities. reputed company participates in the federal E-Verify program to confirm the identity and employment authorization of reputed company newly hired employees. For further information about the E-Verify program, please click here: http://www.uscis.gov/e-verify/employees Apply To This Job