The Fractal Solutions Blog

Function and Failure Modes - Important Factors in Reliability Centered Maintenance (RCM) Part II

Feb 15, 2018 2:00:00 PM / by Larry Johnson

refinery-3127588_1280In our previous blog, Function and Failure Modes - Important Factors in Reliability Centered Maintenance, we looked at the first two factors Equipment Function, and Function Failure. Now let’s look at Failure Modes, their effects and consequences.

Failure Modes

As much as we don’t want to hear it, assets can fail in multiple ways. Historically when an asset had a problem, we just said it had failed, whereas with Reliability Centered  Maintenance we are able to clearly differentiate between the functional failure and the failure modes (the events that trigger a failure).

When we breakdown these failure modes, we have to consider 3 categories:

The first is when the asset’s performance is reduced to below our optimal value and is no longer fulfilling its function. This can be anything from wear and tear, to lack of lubrication to human errors to dust or dirt.

The second is when the operating context requires more from the asset and the performance has to increase. This can make the asset wear out faster due to increased ‘stress’ on it or the asset is unable to meet the requirements or results.

The third, is when the asset was never actually capable of delivering the required performance. This could be due to a deficient design or a failure in defining the asset’s requirements when acquiring the asset.

In total, we’d expect between one and thirty failure modes per functional failure. Thus, we need to be careful with our level of analysis, and tackle the more likely failure modes; including ones from the failure history (if there is any), the failure modes that are currently being managed by an maintenance plans, and potential failures that haven’t occurred yet- but could in the future.

Failure Mode and Effects

In the event of a failure mode occurring, it is rarely an isolated event. Failure mode and effects define what happens to the asset in the operating context, when the failure mode or modes occur. It is important to include any information related to the following:

  • Any damage/s from the failure, be that to the asset or ‘knock on’ effects.
  • Repair steps to put the asset back into the operation
  • Safety threats to either the environment or staff and how it could affect either
  • Operational impact (e.g. delays or defective goods)
  • Evidence of the failure (Including any visible signs such as an alert or alarm)

Consequences of the Failure

In the failure mode effects, we identify facts corresponding with the failure modes. The next stage is analyzing the failure consequences, performing a thorough qualitative evaluation which determines the significance of the failure mode/s for the overall operation. Upon completion of the analysis, we can learn the severity of the failure. The next steps are dependent on the type of function we are analyzing, functions that are either evident or hidden.

For evident functional failures, the failure can be detected by the staff member, and has varying levels of importance, with human and environmental safety being the top. Next we’d look at operational issues with higher costs related to quality and/or productivity. And finally we’d examine non-operational consequences with the cost related to just the asset repair.

For hidden functional failures, as the name suggests, the cause of the failure isn’t obvious straight away and cannot be detected by staff members. This is normally true in safety or protective devices. Hidden functional failures are much more complex to deal with, making the RCM analysis tougher and assets usually include protective devices to minimize consequences of different failures.

When the failed state can be detected in the protective device, it can considered a fail-safe device and the RCM analysis is less vital. When the failed state cannot be detected, we trust the device will act when something goes wrong, however, the part may have been in a failure state for a while and won’t be noticed until it’s needed.

This creates the need for an in-depth analysis that involves using the failure probability of the protective device/s and other extraneous information to help determine how to improve the safety of the system.


Here at Fractal Solutions we can help you with your Reliability Centered Maintenanceand Asset Management, to discuss your plant, company or site’s needs contact us here:

Contact an Asset Management Specialist

Fractal Solutions is an Asset Management and Reliability consulting company. We specialize in program startup, facilitation, staffing, and training.

Written by Larry Johnson

Larry Johnson is the founder and president of Fractal Solutions.  He is an internationally known consultant, lecturer, and trainer in asset management and equipment reliability programs.

 Connect with Larry on LinkedIn


Subscribe to Email Updates