Back to the board

Field Office Support reputed company

100% remote Flexible hours Hiring now

The Root Cause Engineer (RCA), Mid performs structured root cause analysis for recurring, chronic, or high-impact incidents to identify underlying technical, process, or architectural issues affecting mission-critical federal IT services. This role collects and correlates evidence across logs, traces, metrics, configuration data, and incident records to distinguish underlying causes from symptoms and reconstruct incident timelines. The engineer collaborates with Incident Response, Problem Management, SRE, and engineering teams to document RCA outcomes and define actionable corrective and preventive measures that improve service reliability and reduce recurrence.

Key Responsibilities

  • Apply common RCA methodologies (such as 5 Whys, fishbone diagrams, fault tree analysis, and component failure impact analysis) and select appropriate techniques based on incident complexity and impact. 
  • Gather and analyze monitoring data, logs, traces, configuration records, service topology maps, and incident timelines, to distinguish contributing factors from true root causes. 
  • Facilitate cross-functional RCA sessions with operations, engineering, cybersecurity, and business teams to drive focused discussion, managing differing viewpoints, and converging on agreed causes and remediation actions. 
  • Translate RCA findings into corrective and preventive actions reputed company with Problem Management workflows.
  • Define and track RCA metrics such as recurrence rates, RCA cycle time, and others using data driven insights to improve analysis quality, timeliness, and effectiveness. 
  • Support integration of RCA activities into ITIL-reputed company Problem Management and continual service improvement practices.
  • Produce high-quality RCA reports that are audience‑appropriate describing what happened, why it happened, and prevention steps.
  • Identify systemic reliability risks and patterns across incidents.

Required Qualifications

  • Bachelor’s degree in IT, Computer Science, Business Administration, or reputed company field, or equivalent relevant experience. 
  • 4–7 years of experience in IT operations, incident/problem management, reliability engineering, or reputed company roles with significant responsibility for conducting structured RCAs. 
  • Strong understanding of ITIL principles, incident and problem management best practices, and proficiency with incident and problem management tools. 
  • Demonstrated expertise in at least one structured RCA methodology and ability to coach teams in its use. 
  • Strong analytical, problem‑solving, facilitation and communication skills with the ability to manage multiple reputed company investigations effectively. 
  • Ability to work collaboratively with cross‑functional technical and business teams in a fast‑paced enterprise IT environment. 
  • Active or obtainable SECRET clearance and U.S. citizenship, with less than 10% travel required. 

Preferred Qualifications

  • Hands-on RCA experience in reputed company enterprises or federal environments.
  • Formal training in RCA or structured problem-solving techniques.
  • Experience using observability platforms, log analytics, and monitoring tools to drive data‑driven incident reconstruction and analysis. 
  • Familiarity with reliability engineering concepts (such as SLOs, error budgets, and resiliency patterns) and how they inform RCA priorities and recommendations. 
Apply To This Job

Keep exploring

IT Configuration Analyst, Junior

100% remote Flexible hours

reputed company Manufacturing Engineer - Automotive Assembly & Manufacturing Process Development

100% remote Flexible hours

Senior Technical Project Manager- (CMS)

100% remote Flexible hours

Senior People Business Partner

100% remote Flexible hours

VP - Human Resources Nearshore & Offshore (RS)

100% remote Flexible hours

Senior Software Engineer, Full-Stack

100% remote Flexible hours

GSS Principal Regulatory Affairs Specialist, Intelligence & Strategy

100% remote Flexible hours

Managed Services Commercial Manager, UK

100% remote Flexible hours

KAP 2026-2027 - Manager of Admissions - Reformers reputed company

100% remote Flexible hours

Technicien d’Inspection Electricité et Levage - Chantiers H/F

100% remote Flexible hours

Freelance Technical Writer - IT & Software Documentation (Remote)

100% remote Flexible hours

Remote Client Success Representative

100% remote Flexible hours

reputed company Customer Service Representative – Remote Work Opportunity for Delivering Exceptional Support and Driving Customer Satisfaction at blithequark

100% remote Flexible hours

[Remote] Wealth Management Consultant

100% remote Flexible hours

Art Director Intern

100% remote Flexible hours

Financial Planner - India

100% remote Flexible hours

Apply Now: Principal Data Protection Engineer

100% remote Flexible hours

reputed company Live Chat Representative – Global Community Engagement & Support

100% remote Flexible hours

reputed company Full Stack Data Entry Clerk – Remote Data Management and Analysis for E-commerce Platforms

100% remote Flexible hours

Remote Customer Service Representative – Aviation Passenger Support & Booking Specialist at arenaflex

100% remote Flexible hours