Submit

Reliability-Centered Maintenance & Failure Analysis

Asset Management, EAM, Fleet

A systematic methodology for determining the optimal maintenance strategy per asset based on failure modes, consequences.

Problem class

Manufacturer-recommended maintenance schedules are conservative and generic. RCM identifies which failures actually matter, which maintenance tasks are effective, and which can be safely run-to-failure — optimizing maintenance investment versus risk.

Mechanism

Failure Mode and Effects Analysis (FMEA) identifies how each asset can fail and the operational, safety, and environmental consequences. For each failure mode, RCM determines the optimal strategy — condition-based monitoring, scheduled restoration, scheduled replacement, failure-finding task, redesign, or accept run-to-failure. Root cause analysis (RCA) investigates actual failures to identify systemic causes beyond immediate symptoms. Reliability analytics (Weibull analysis, survival curves) model failure probability over time, optimizing maintenance intervals.

Required inputs

  • Asset failure history data from work orders and incident records
  • FMEA worksheets per critical asset class
  • Consequence-of-failure classification (safety, environmental, operational)
  • Reliability statistical analysis tools and failure-distribution data

Produced outputs

  • Optimized maintenance strategy per asset class and failure mode
  • RCA reports identifying systemic failure causes with corrective actions
  • Reliability models predicting failure probability over time
  • Maintenance task optimization reducing unnecessary preventive work

Industries where this is standard

  • Aviation where RCM was invented (MSG-3 methodology for aircraft maintenance)
  • Nuclear power with safety-critical equipment reliability mandates
  • Oil and gas with process-safety management reliability requirements
  • Mining with heavy-equipment reliability programs for fleet optimization
  • Military and defense with reliability, availability, and maintainability standards

Counterexamples

  • Applying RCM to every asset regardless of criticality wastes analytical effort; RCM should target the 20% of assets responsible for 80% of downtime and maintenance cost.
  • Completing RCM analysis without implementing the recommended maintenance strategy changes produces engineering documents that sit on shelves without operational impact.

Representative implementations

  • RCM methodology originated from United Airlines' MSG-3 process, reducing aircraft maintenance costs by 25% while improving safety and reliability since the 1970s.
  • A refinery applied RCM to its top-50 critical equipment items, reducing maintenance cost by $4.2M annually while improving process-unit availability from 92% to 97%.
  • The US Navy's RCM program across submarine and surface-fleet equipment reduced preventive maintenance tasks by 40% while maintaining or improving equipment reliability.

Common tooling categories

RCM analysis platforms, FMEA worksheet tools, root cause analysis frameworks, and Weibull reliability modeling software.

Share:

Maturity required
High
acatech L5–6 / SIRI Band 4–5
Adoption effort
High
multi-quarter