Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • Way of the Quality Warrior
    • Critical Talks
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • on Leadership & Career
      • Advanced Engineering Culture
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • ReliabilityXperience
      • RCM Blitz®
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Feed Forward Publications
    • Openings
    • Books
    • Webinars
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Reliability Analysis Methods online course
    • Measurement System Assessment
    • SPC-Process Capability Course
    • Design of Experiments
    • Foundations of RCM online course
    • Quality during Design Journey
    • Reliability Engineering Statistics
    • Quality Engineering Statistics
    • An Introduction to Reliability Engineering
    • Reliability Engineering for Heavy Industry
    • An Introduction to Quality Engineering
    • Process Capability Analysis course
    • Root Cause Analysis and the 8D Corrective Action Process course
    • Return on Investment online course
    • CRE Preparation Online Course
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home

by Doug Plucknette 2 Comments

Equipment Reliability Depends On Understanding Failure Modes!

Equipment Reliability Depends On Understanding Failure Modes!

One of the curses of Reliability Centered Maintenance and Root Cause Analysis is a great Facilitator makes the process look easy. So easy in fact that after watching a good facilitator lead one event we’d like to believe we could jump right up and facilitate the next one with no training or assistance. In Mentoring RCM Blitz™ facilitators during the certification process we work to perfect 2 dozen skills with the hope of turning out great facilitators. The skill we typically spend the most time working on is writing failure modes.

While there are many places that new RCM facilitators struggle when it comes to facilitating a thorough and useful RCM analysis, most errors in the process start at the failure mode level. Writing good failure modes requires an expert level of understanding of hundreds of different of components. What is the component intended to do (Function) and what are the ways that this component can functionally fail (Failure Modes).

Over the past 20 years I have tried a couple of different ways to teach how to write good failure modes. In performing hundreds of RCM Blitz™ analyses with different facilitators, and practitioners for companies around the world we have come to understand that good failure modes should be written in three parts.

  1. Part
  2. Problem
  3. Specific Cause of Failure

As an example: Cooling water pump bearing (Part) seizes (Problem) due to lack of lubrication (Specific Cause of Failure)

The part is the location or source of where the failure mode begins. Looking at the cooling water pump listed above, a rookie facilitator might be tempted to say the cooling water pump failed. While this is true, where did the failure begin? It began when the bearing was not lubricated.

The problem portion of the three-part failure mode is the undesired condition that results from specific cause of failure. If we neglect to lubricate the cooling pump bearing it will vibrate, heat up and eventually seize. While the bearing has been failing for some time when it seizes we now have a problem.

The third part of a good failure mode is the specific cause of failure. As we write each failure mode we should recognize that the purpose of RCM is to develop a task that will be applicable and effective at mitigating the cause. If we don’t get the specific cause written at the correct level your team will never select or develop a good mitigating task. Again, with the end in mind if we miss the specific cause the outcome of your analysis will surely miss at eliminating or mitigating the failure mode.

So, what exactly is a specific cause of failure? This is where experience in Root Cause Analysis or Cause Mapping becomes extremely valuable. Failure Modes are all about understanding the relationship between cause and effect. The trick is to learn to discuss each failure mode at a level where a sound maintenance task can mitigate or eliminate the failure mode. To understand this let’s go back to the cooling water pump.

Cooling Tower Pump Fails – Some would consider this a failure mode, I would not. It only contains two pieces of a three part failure mode, the pump and at a high level, and the problem. How would one mitigate this failure? Is there a maintenance task to detect, reduce or eliminate this failure mode? Would this task be applicable and effective in detecting, eliminating or mitigating this failure mode? Being honest, this failure mode is nearly useless. The only way to deal with this failure mode is to replace the pump.

Cooling Tower Pump Bearing Fails – Again, just two parts here, there is not enough information here to make a sound task decision. Some would say that we could perform vibration analysis and detect the bearing failure. While in most cases this might be true, without knowing the specific cause we cannot be sure. In many cases, there are specific causes of failure where vibration analysis is clearly not the best task for mitigating the failure mode. As an example, I don’t want to use vibration analysis to tell me that we have not lubricated a bearing.

Cooling tower pump bearing seizes due to improper lubrication – While we have three parts here, how do I deal with this specific cause of failure? What does improper lubrication mean? There could be several specific causes buried within this one failure mode. For instance, improper lubrication could mean, too much lubrication, not enough lubrication, the incorrect type of lubrication, lubrication at the incorrect interval. It is extremely important to remember we need to have the specific cause written at a level where we know the maintenance task will be both applicable and effective in eliminating the failure mode. Each of the separate causes listed in regard to lubrication would result in a different mitigating task. Combine the causes and we now risk missing a failure mode and a task.

Remember, the failure modes we identify are the key to developing our complete maintenance strategy and most important failure modes to identify the failures that result from the context and environment in which we operate our equipment.

More information on writing good failure modes can be found in my book Reliability Centered Maintenance using RCM Blitz™.

In closing, I would love to hear your comments regarding failure modes.  Did this article help you to better understand the process of identifying good failure modes? Have you ever completely missed a failure modes that resulted in the failure reoccurring several times before it was properly identified?

Doug Plucknette, Principal & World-Wide RCM Discipline Leader at Allied Reliability Group is a Reliability Engineering Consultant and Published Author of “Reliability Centered Maintenance using RCM Blitz™ and Clean, Green and Reliable. Having created the RCM Blitz™ Methodology he has been an RCM Practitioner and Trainer for over 20 years. Doug resides in Spencerport, NY and can be reached at plucknetted@alliedreliability.com

Filed Under: Articles, on Maintenance Reliability, RCM Blitz

About Doug Plucknette

Doug Plucknette joined Allied Reliability Group as RCM Discipline Leader in 2007. As the founder of RCM Blitz® and author of the book Reliability Centered Maintenance using RCM Blitz®, he has provided reliability training and services to numerous companies around the world, large and small, including such Fortune 500 companies as Cargill, Whirlpool, Honda, Coors Brewing, Energizer, Corning, Invista, and Newmont Mining.

« A 2-Step Process to Improve Plant Performance
The “Mid-Life Crisis” Use-Case »

Comments

  1. Thiago Lima says

    June 6, 2018 at 2:13 AM

    Dear Reliability Engineer,

    This article was one of the best ones I’ve ever read about RCM process.

    In addition, I would also like to comment about the property of giving us this powerful and useful tool to help other’s struggle.

    It will be always a pleasure to read issues like that. Furthermore, I would like to buy this book in a near run.

    Thank you very much!

    My best regards,

    Thiago from Brazil.

    Reply
  2. John says

    March 21, 2019 at 5:37 AM

    Hi Doug,
    An interesting post discussing equipment reliability, failure modes and RCM process. Well, if we talk about equipment failure then yes it’s very important to figure out the failure pattern and then follow steps to avoid equipment failure, downtime and increase their reliability. We need to implement the right lubrication program and regular maintenance of equipment for max reliability. Quality filtration systems, effective lubrication plan, and best oil handling practice we need at the facility lube room! Thanks for sharing these guidelines and Valuable Book!
    Regards,

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RCM Blitz® series
by Doug Plucknette

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • test
  • test
  • test
  • Your Most Important Business Equation
  • Your Suppliers Can Be a Risk to Your Project

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy