Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • Way of the Quality Warrior
    • Critical Talks
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • on Leadership & Career
      • Advanced Engineering Culture
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • ReliabilityXperience
      • RCM Blitz®
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Feed Forward Publications
    • Openings
    • Books
    • Webinars
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Reliability Analysis Methods online course
    • Measurement System Assessment
    • SPC-Process Capability Course
    • Design of Experiments
    • Foundations of RCM online course
    • Quality during Design Journey
    • Reliability Engineering Statistics
    • Quality Engineering Statistics
    • An Introduction to Reliability Engineering
    • Reliability Engineering for Heavy Industry
    • An Introduction to Quality Engineering
    • Process Capability Analysis course
    • Root Cause Analysis and the 8D Corrective Action Process course
    • Return on Investment online course
    • CRE Preparation Online Course
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home

by James Kovacevic Leave a Comment

Linking Failure Codes To A Proactive Maintenance Strategy

Using Failure Data to Drive Sustainable Improvements

If you are lucky enough to have good failure data history in your CMMS, you are one of the few.  But even if you have the data, can you use it to make a difference to your organization?  Obviously, the data can be used to perform certain reliability engineering analyses, but what can those without reliability engineering experience do with the data?

Bad Actor / Pareto Analysis

Two simple analysis that anyone can use with their failure history are;

  • A Bad Actor analysis is identifying equipment that is experiencing repetitive failures.  The Bad Actor analysis focuses on the frequency of failure only.
  • Pareto Analysis is an analysis that identifies what equipment are contributing to any key factor such as unplanned downtime, total downtime, the number of failures, the cost of maintenance, the cost of lost production, etc.

Either of these analyses can be used based on the failure data collected.  A unique way of using these analyses is instead of using them to look at equipment, use them for the failure mode (which is the object code + the damage code + the cause code).   By using the analysis with the failure modes, you may be able to identify underlying issues that are impacting your entire operation.   For example, if you find that Bearing Overheating Improper Lubrication is the most prevalent failure mode, then look at the lubrication program across the site.  It may be that the bearings are not being lubricated or are being over lubricated.

By taking this approach, addressing an issue on one asset can impact many assets across the site. However, while these analysis techniques can be used to identify issues and causes, they may not be enough to validate the effectiveness of your maintenance strategy.

Linking Failures to Maintenance Strategies

Over the past year or two, there has been some discussion around linking the failure codes from your CMMS to the maintenance strategy development technique (RCM, FMEA, MTA, PMO, RCM Blitz).  Many software packages do not enable this linkage to be performed.   I have heard of a few approaches, but not having done this before (although I fully see the value), I decided to build out a way to do this without any special software, that is simple enough for any organization to use.  By utilizing this approach, any organization can not only look to see if their Maintenance Strategy is effective but to also identify the gap in it.

Codifying the Maintenance Activities

The first step in being able to link the failure codes to a maintenance activity is to go through all of the maintenance activities and codify the failure mode each task is addressing.  For example, a PM activity to take a measurement of a conveyor slate is in place to monitor the wear on the conveyor.   Looking at this task, the failure mode it is trying to address is Conveyor Slate Worn   Normal Wear.

This activity needs to be completed for each specific task covered in the RCM, FMEA, or other analysis.   Ideally, a column would be added to the analysis to hold this Failure Mode.

An additional benefit to performing this activity is that it will identify gaps in the failure coding (master library).  By updating the master library, more accurate data will be provided, and there will be less “other” or free text codes provided by the frontline staff.

Comparing Failure Data to Maintenance Activities

With all of the maintenance activities codified, the comparison can now begin.  When comparing the failure data to the maintenance activities, start at the asset level.  Trying to do this across many assets or areas can be overwhelming, so start small.   There are a few ways in which the data can be compared;

  • Maintenance Activities Not Present (MANP) – Identify any failure modes from the failure data that are not present in the maintenance activities’ codes.    MANP identifies a gap in the maintenance strategy.
  • Maintenance Activities Not Frequent (MANF) – Identify any frequent failure modes from the failure data that are not very frequent in the maintenance activities codes.  MANF identifies a potential gap in the maintenance strategy.
  • Failure Data Not Present (FDNP) – Identify any failure modes in the maintenance activities codes that are not present in the failure data.  FDNP identifies over maintaining equipment and wasted resources.
  • Failure Data Not Frequent (FDNF) – Identify any frequent failure modes in the maintenance activities’ codes that are not frequent in the failure data.  FDNF identified a potential over maintaining equipment and potentially wasted resources.

These four relationships can provide some great insights to not only the effectiveness of the maintenance strategy but also the efficiency of it.

Addressing the Gaps Between Failure Data & Maintenance Activities

With the comparison complete, the gaps can start to be addressed.  As with all analysis, start addressing the gaps which would provide the greatest return for the least amount of input.   Each of the gaps above can be addressed;

  • MANP – Perform an RCM, FMEA, etc. analysis to address the missing activities via maintenance activities or other activities such as redesign.
  • MANF – Review the RCM, FMEA, etc. analysis to identify potential gaps or opportunities to improve the existing maintenance strategy based on the frequent failure data failure modes.
  • FDNP – Review all follow-up work generated from PMs, PdMs, etc.  to see if the specific failure modes have been caught before they resulted in a functional failure.  If the maintenance strategy is not generating follow-up work, perform a risk analysis to remove the maintenance activity.  If it is generating follow-up work, perform a Weibull analysis to determine the optimum frequency of the maintenance activity
  • FDNF – Review all follow-up work generated from PMs, PdMs, etc.  to see if the specific failure modes have been caught before they resulted in a functional failure.  If the maintenance strategy is generating the occasional follow-up work, perform a Weibull analysis to determine the optimum frequency of the maintenance activity.

By linking the failure data to the maintenance activities, any organization can determine how effective and efficient their maintenance program is.  This will allow the organization to achieve the balance between asset performance, cost, and risk.

On a side note, be sure to track any changes made to the maintenance program based on this analysis.  Identify the cost savings attributed to a more effective (avoid unplanned downtime) or a more efficient (reduction in PM workload) maintenance program.  You might just be surprised at the return on this analysis.

In Summary to link the failure data to maintenance activities you must;

  1. Codify the maintenance activities
  2. Compare failure data to maintenance activities
  3. Identify gaps between the failure data and maintenance activities
  4. Address the gaps between the failure data and maintenance activities
  5. Perform the accepted analysis (RCM, FMEA, etc.) to address any missing failure modes from the analysis
  6. Update the master library with any missing failure codes
  7. Update maintenance activities with findings
  8. Repeat!

Do you link your failure data to your maintenance activities?  How do you perform the linkage and analysis?   If you don’t current link the two together, how do identify gaps in your maintenance program?

Remember, to find success; you must first solve the problem, then achieve the implementation of the solution, and finally sustain winning results.

I’m James Kovacevic
Eruditio, LLC
Where Education Meets Application
Follow @EruditioLLC

References;

  • RCM2 By John Moubray
  • The Basics of FMEA by Raymond J. Mikulak, Robin McDermott, Michael Beauregard
  • The What & More Importantly, The Why of the Weibull Analysis
  • Reliability Centered Maintenance using… RCM Blitz by Doug Plucknette
  • RCM Reengineered by Jesus R. Sifonte, James V. Reyes-Picknell

 

Filed Under: Articles, Maintenance and Reliability, on Maintenance Reliability

About James Kovacevic

James is a trainer, speaker, and consultant that specializes in bringing profitability, productivity, availability, and sustainability to manufacturers around the globe.

Through his career, James has made it his personal mission to make industry a profitable place; where individuals and manufacturers possess the resources, knowledge, and courage to sustainably lower their operating costs.

« SCRM VUCA
Inspired By Art to Learn a Trade – Lieutenant John Grieco RFD »

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Maintenance & Reliability series


by James Kovacevic
High Performance Reliability

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • test
  • test
  • test
  • Your Most Important Business Equation
  • Your Suppliers Can Be a Risk to Your Project

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy