Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • Way of the Quality Warrior
    • Critical Talks
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • on Leadership & Career
      • Advanced Engineering Culture
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • ReliabilityXperience
      • RCM Blitz®
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Feed Forward Publications
    • Openings
    • Books
    • Webinars
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Reliability Analysis Methods online course
    • Measurement System Assessment
    • SPC-Process Capability Course
    • Design of Experiments
    • Foundations of RCM online course
    • Quality during Design Journey
    • Reliability Engineering Statistics
    • Quality Engineering Statistics
    • An Introduction to Reliability Engineering
    • Reliability Engineering for Heavy Industry
    • An Introduction to Quality Engineering
    • Process Capability Analysis course
    • Root Cause Analysis and the 8D Corrective Action Process course
    • Return on Investment online course
    • CRE Preparation Online Course
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home

by nomtbf Leave a Comment

Please don’t remove MTBF part 1

Please don’t remove MTBF, part 1

A forum post recently correctly found two of my many arguments for the eradication of MTBF incorrect or invalid. Maybe the author (HL) has a valid point. Let’s take a closer look at the note and the writer’s reasoning.

“MTBF is not useful”

The first argument in HL’s note refutes that MTBF is not useful. He cites the definition of MTBF as being a mean (statistical average or indication of central tendency). This is true, as MTBF is the first moment of the exponential distribution. Additionally, for those rare cases when you desire to know the point in time when 63% of the items have failed, the MTBF value is the go-to value.

Or is it?

The underlying assumption is that the rate of failure is constant. That assumption is the primary reason I find MTBF useless. I know of very few items that truly have a constant failure rate. Furthermore, and more directly regarding the notion that MTBF is the mean of a distribution, it is rare that the mean value is useful alone. For MTBF is a single parameter distribution, yet the comparison to other normally distributed measures implies that the first moment alone is sufficient to make decisions. Even the desire to know if a sample of students in a class with an average height above 2 meters is meaningful to conclude the population of all students also have an average height above 2 meters. We need the variance term to make a convincing judgment.

MTBF is a mean value of times to failure data. If I review some field return data and calculate the MTBF it is pretty straightforward. One needs to just tally the total time all units have been in operation and divide by the number of failures. It is an unbiased estimator of the first moment (mean) of the exponential distribution. Now lets say we want to recommend a maintenance time period (like 2 years, or 20k miles) such that we could improve the reliability of the system with regular maintenance.

Keep in mind that MTBF has the interesting property of being memory-less. The value MTBF is the 1/MTBF chance per hour that an item will fail. This is totally unrelated, and not conditional on the age of the item. This is very accurate when the item only experiences failure at a truly constant rate and failure is totally random. Further, even if we set a time period, say 2 years, then what changes? If we replace the item with a similar item, even if its brand new, the chance of failure the next hour is still the same value: 1/MTBF. No improvement and the very real possibility of damage during the maintenance activity.

So even with the expense of doing a replacement with a new item, there is no improvement in the system reliability. The only maintenance approach that makes any sense, for an item which is accurately represented by MTBF, is to replace the item when it fails. No other approach makes sense to me.

The last part of the first refutation indicates that if the MTBF value is changing over different time periods of consideration, then using a distribution which includes the rate of change would be more accurate. Yet, for complex systems with relatively constant failure rates over the duration of interest MTBF is “a quite good estimate”. Given the ease of using the appropriate math for reliability statistics, I wonder if a short study where we compare the results using both methods would reveal the same answers? In the many situations where I’ve been asked to review reliability data the comparison has been stark. In the past, using MTBF, ‘mistakes were made’, becomes the general conclusion. If maintenance costs, downtime, inventory costs, and customer satisfaction are of little concern, then go ahead, use MTBF and the ‘good enough’ approach. If you want to understand the reliability of your product, save money and time, improve availability and enjoy the praise of happy customers – then do the math and do it right.

I’m interested in anyone’s ability to use MTBF in a beneficial way. Please write to me and let me know– how you do it? What are some situations where using MTBF is the best and least error prone method?

HL does have a second argument he refutes – let’s explore that next week.

Filed Under: Uncategorized

« K Out of N
Reliability Goals »

Comments

  1. Chet Haibel says

    June 8, 2013 at 5:59 PM

    Fred: The above intends to make some good points, but fails to distinguish between the failure rate, which follows an exponential distribution, and the hazard rate, which is constant. MTBF is not 1/failure rate, it is 1/hazard rate. Then by erroneously citing failure rate in a number of places, you find all kinds of things wrong with MTBF.

    Also, no one schooled in Reliability Centered Maintenance would proactively replace working components (or subsystems) unless their Weibull Beta is 2 or higher. Only a very uninformed person would proactively replace components (or subsystems) with a constant hazard rate.

    Reply
    • Fred Schenkelberg says

      June 8, 2013 at 6:17 PM

      Hi Chet,

      Guilty and I did and have often confused the two terms. Maybe you could draft up a short article (blog post) for the NoMTBF site on the difference between hazard rate and failure rate and help us all understand and use the proper terminology. Also, what could go wrong if we use the concept of constant failure rate when that is not what the math means?

      cheers,

      Fred

      Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

[popup type="" link_text="Get Weekly Email Updates" link_class="button" ]

[/popup]

The Accendo Reliablity logo of a sun face in circuit

Please login to have full access.




Lost Password? Click here to have it emailed to you.

Not already a member? It's free and takes only a moment to create an account with your email only.

Join

Your membership brings you all these free resources:

  • Live, monthly reliability webinars & recordings
  • eBooks: Finding Value and Reliability Maturity
  • How To articles & insights
  • Podcasts & additional information within podcast show notes
  • Podcast suggestion box to send us a question or topic for a future episode
  • Course (some with a fee)
  • Largest reliability events calendar
  • Course on a range of topics - coming soon
  • Master reliability classes - coming soon
  • Basic tutorial articles - coming soon
  • With more in the works just for members
Speaking of Reliability podcast logo

Subscribe and enjoy every episode

RSS
iTunes
Stitcher

Join Accendo

Receive information and updates about podcasts and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Dare to Know podcast logo

Subscribe and enjoy every episode

RSS
iTunes
Stitcher

Join Accendo

Receive information and updates about podcasts and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Accendo Reliability Webinar Series podcast logo

Subscribe and enjoy every episode

RSS
iTunes
Stitcher

Join Accendo

Receive information and updates about podcasts and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • test
  • test
  • test
  • Your Most Important Business Equation
  • Your Suppliers Can Be a Risk to Your Project

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy