Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • Way of the Quality Warrior
    • Critical Talks
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • on Leadership & Career
      • Advanced Engineering Culture
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • ReliabilityXperience
      • RCM Blitz®
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Feed Forward Publications
    • Openings
    • Books
    • Webinars
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Reliability Analysis Methods online course
    • Measurement System Assessment
    • SPC-Process Capability Course
    • Design of Experiments
    • Foundations of RCM online course
    • Quality during Design Journey
    • Reliability Engineering Statistics
    • Quality Engineering Statistics
    • An Introduction to Reliability Engineering
    • Reliability Engineering for Heavy Industry
    • An Introduction to Quality Engineering
    • Process Capability Analysis course
    • Root Cause Analysis and the 8D Corrective Action Process course
    • Return on Investment online course
    • CRE Preparation Online Course
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home

by nomtbf Leave a Comment

High MTBF with Low Reliability

High MTBF with Low Reliability

Can You Have a High MTBF and Low Reliability?

As regular readers know, MTBF by itself is misleading. When representing actual data it can be deceptive as well. Just because you have a high MTBF value doesn’t mean it is reliable.

In a previous article, 10 Reasons to Avoid MTBF, I mentioned that it is possible to have a relatively high MTBF value when the actual reliability is low. Ashley sent me the following note:

Hi Fred, i love reading your articles they are very informative. I have a question about something you said in a comment which i am hoping you will be able to clarify for me. You said products with higher MTBF can actually be less reliable than products with a lower MTBF

I have tried to find information on how this is possible online, and tried to do the maths myself to make this happen but i have to admit i am struggling.

No worries, Ashley, let’s work out an example to illustrate what I meant.

A Sample Set of Data

Let’s create an example data set with a decreasing hazard rate. I used R and the command of

round(rweibull(10,0.5,500))

This provided a set of 10 values drawn at random from a Weibull distribution with a beta = 0.5 and eta = 500. The values are:

56, 5, 2559, 1147, 486, 931, 1, 1166, 786, 2.

Let’s say this is in hour of operation till failure from a set of 10 motors. We have complete data, no censoring, nice and simple.

The MTBF Value

Let’s calculate the MTBF of these items. You may argue we should calculate MTTF here, since we are not repairing the motor, yet the calculation is the same.

We would like to know as we are considering buying a new type of motor if the measured reliability (MTBF) is below the manufacturers claim of 500 hours MTBF. The use of these motors is for 168 hour (1 week) runs and we’d like to maintain a relatively high reliability over 168 hours.

The classic way to calculate MTBF is to tally up the run times and divide by the number of failures. We have a sum of 7,139 and with 10 failures estimate MTBF as 713.9 hours. This is above the vendor’s claim of 500 so we are supporting the notion these are good motors.

The Weibull Based MTBF

A quick inspection of the data shows a cluster of early failures then quite a bit of time between failures as the equipment go older. Seems to be a decreasing hazard rate at play here, thus our assumption underlying using MTBF may be suspect.

Let’s fit a Weibull distribution to the data. Firing up Weibull++ and using default fitting for a Weibull 2-parameter distribution we find beta = 0.39664 and eta = 454.137744. The data has a beta below 1 thus shows a decreasing hazard rate over time.

Using the MTBF calculation based on the Weibull distribution fitted parameters we determine MTBF is 1,545 hours. See the article Determine MTBF Given a Weibull Distribution for details on the calculation.

Even more evidence based on the data the performance is well above the vendor’s claim of 500 hours MTBF. Let’s double the order of these fine machines.

Let’s Consider Reliability Instead

We run these motors for 168 hours at a time. So what is the probability a motor will survive 168 hours once installed?

Using the exponential distribution (MTBF estimate) we find the reliability from time 0 till 168 hours is 79%.  Using the exponential reliability function, R(t) = exp [ –  t / θ ], here.

A similar question is what is the chance of successful operation over 168 hours the 10th time we run the motor (from 1,512 to 1680 hours of life time operation or the tenth run). This is assuming the motor has survived through 9 runs. In this case, we find, not surprisingly given the assumed constant hazard rate and memoryless property of the exponential distribution the expected reliability is 79%.

Using the Weibull distribution we find the reliability from time 0 till 168 hours is 51%. Much lower than the estimate based on the MTBF calculation. We could make a decision based on the 1,545 hours MTBF value or the estimate of a 50% survival rate over the first 168 hours. 50% is not high reliability, yet 1,545 hours seems rather high.

The 10th run reliability using the Weibull fit likewise assumes the motor has survived running for 9 runs or the 1,512 hours. The reliability over the 10th run is 93%. Much higher then the MTBF based estimate.

Conclusion

The data suggests first that the assumption the exponential distribution describes the data is not true. Thus the calculation of MTBF based on the assumption of a constant hazard rate or the exponential distribution provides a misleading result.

The extra step of estimating MTBF after fitting a Weibull distribution just makes the motors appear ‘better’ then the initial estimate. An almost 3x increase in MTBF is due to the slope of the fitting distribution. It is the same data, yet accounting for the decreasing hazard rate results in a higher value for the MTBF. Keep in mind that the MTBF is the mean of the distribution and a Weibull distribution with a beta of 0.5 is heavily right skewed. (Long tail to the right…)

Based on the Weibull it is suggesting that some of the motors would run for a very, very long time without failure, even though more than half failure rather quickly.

The reliability estimate depends on the time frame of interest. For the exponential distribution fit the reliability over 168 hours is 79%, while over 1,680 hours (ten runs) it is 9.5%. For the Weibull distribution fit the reliability over 168 hours is 51% and over 1,680 hours is 18.6%.

Bottom line, using just MTBF we would buy more of the same motors and ‘enjoy’ the experience of about half the motors failing within their first week of use.

Do you have an example that shows just how bad using MTBF misleads you and decision makers? Send it over or add a comment below.

Filed Under: Articles, NoMTBF Tagged With: distribution, exponential, weibull

« What is Criticality Analysis? How Does it Work?
Prognostics »

Comments

  1. Mark Powell says

    May 31, 2017 at 2:22 PM

    Fred,

    Had a great example of this in http://nomtbf.com/2012/06/the-worst-reliability-requirement/.

    Reply
  2. William says

    May 31, 2017 at 3:40 PM

    Very nice and educational text. I have a question: What is the phisical meaning of the characteristic life (eta)?

    Reply
  3. William says

    May 31, 2017 at 3:43 PM

    Sorry, but I put my e-mail address wrongly.

    My question is: What is the physical meaning of the characteristic life (η)

    Reply
    • Fred Schenkelberg says

      May 31, 2017 at 3:51 PM

      Hi William, just as the mean is the center of mass of a normal distribution, or any distribution, the Weibull parameter eta often called the characteristic life is the point in time corresponding to 63.2 percentile of the distribution. It means that roughly 2/3 of the failures occur by that point in time.

      Think of a way to define a line, all you need is a slope and a point. For the Weibull distribution the slope is the beta parameter and describes the rate of change of the hazard function. The point is the characteristic life, defined at the 63.2 percentile point.

      Physically, it doesn’t have any meaning relative to specific failure mechanisms.

      I saw it once, the derivation of the exponential family, which includes the Weibull distribution. The 63.2 percentile falls out of the derivation and if I recall correctly has something to do with the exponential element… recall that e^(-1) = 0.368 roughly.

      hope that helps.

      Cheers,

      Fred

      Reply
      • William says

        May 31, 2017 at 4:19 PM

        Yes, thank you Fred.

        Reply
  4. Yi Kang says

    June 5, 2017 at 10:04 AM

    Great learn, thanks Fred!
    Following I would plan my operation of Rel evaluation program:
    – Identify my gate (baseline), in your case would be 168 hrs.
    – Data set sorting: (56, 5, 2559, 1147, 486, 931, 1, 1166, 786, 2), any failures below the gate would be picked: 56,5,1,2, FR already 40%, project failed and defects send back to vendor for FA, I want the RCA with correct action approved.
    – Fitting with Weibull only for rest of data, deliver a baseline requirement to compare with later. (I personally do not believe on constant FR for all kind of materials, so….Weibull)
    – Vendor re-apply and repeat program until 0% FR below 168, then let’s discuss the price…

    The gate of baseline should came from VOC, on top of reliability, let’s consider business instead as well.

    Reply
  5. Piyush says

    June 5, 2017 at 9:20 PM

    Hi Fred,
    Hope you are doing good
    very nice article Sir.
    I am not able to understand this line written in article, “A similar question is what is the chance of successful operation over 168 hours the 10th time we run the motor (from 1,512 to 1680 hours of life time operation or the tenth run).”
    My doubt is 10th time we run the motor that means only one motor is being tested and failure is checked
    But in earlier case we have taken 10 motor failure.i.e.”6, 5, 2559, 1147, 486, 931, 1, 1166, 786, 2. Let’s say this is in hour of operation till failure from a set of 10 motors. ”
    then how can we compare these two.

    Reply
    • Fred says

      June 5, 2017 at 9:24 PM

      Hi Piyush,

      I should have stated the question as a conditional probability. If the motor runs for 9 cycles of running for a week, 168 hours each week, and survived, what is the chance it will survive over the next cycle (168 hrs) after not failing for the first 9 cycles?

      Does that help?

      Cheers,

      Fred

      Reply
  6. Piyush says

    June 5, 2017 at 9:36 PM

    Hi Fred,
    Hope you are doing good
    very nice article Sir.
    I am not able to understand this line written in article, “A similar question is what is the chance of successful operation over 168 hours the 10th time we run the motor (from 1,512 to 1680 hours of life time operation or the tenth run).”
    My doubt is 10th time we run the motor that means only one motor is being tested and failure is checked
    But in earlier case we have taken 10 motor failure.i.e.”6, 5, 2559, 1147, 486, 931, 1, 1166, 786, 2. Let’s say this is in hour of operation till failure from a set of 10 motors. ”
    then how can we compare these two and i am not able to understand meaning of 10 run Is it run of 10 different motor or single motor. if it is of single motor then one motor is running for 1680 hrs and if it is for 10 different motor that means single motor has not accumulated more than 168 hrs.
    kindly clear my doubt.

    Thanks,
    Piyush

    Reply
  7. Piyush says

    June 5, 2017 at 11:00 PM

    Hi Sir,
    That means only one motor is being tested if its like that how and why commutative hrs of 1680 is referred?
    Thanks
    Piyush

    Reply
    • Fred says

      June 6, 2017 at 8:04 AM

      Hi Piyush – let’s say we have a motor, we did prior testing on another batch of motors and have some data.

      Now, let’s say we install this new motor and it runs without failure for 9 cycles, 9 x 168 hours as each cycle is a week. All good, and we have a a motor that is 9 x 168 =1,521 hours old.

      Great, so the question is what is the probability the motor, that is 1,521 hours old, what is the probability it will run without failure for the next cycle of 168 hours?

      Cheers,

      Fred

      Reply
      • Piyush says

        June 6, 2017 at 7:37 PM

        Hi sir,
        Thank you very much.
        Now its clear.
        Thanks,
        Piyush

        Reply
  8. Enock Okyere says

    May 7, 2020 at 11:11 PM

    Good morning Fred. I have interests in aircraft maintenance though I’m not a technical person on this field. I want to understand what the general implications of the following is:
    1.) High and low Removal Rate of aircraft components
    2.) PIREPS analysis showing figures of components below the Calculated Alert Rate
    3.) Relationship between Alert Rate and Exceedence Rate

    Your kind explanation of these in simple terms will be very much appreciated.
    Thanks

    Reply
    • Fred Schenkelberg says

      May 8, 2020 at 7:39 AM

      Hi Enock,

      I’m not that familiar with the aircraft maintenance industry and am unable to answer your questions. maybe someone else that reads this blog will be able to comment.

      Cheers,

      Fred

      Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

[popup type="" link_text="Get Weekly Email Updates" link_class="button" ]

[/popup]

The Accendo Reliablity logo of a sun face in circuit

Please login to have full access.




Lost Password? Click here to have it emailed to you.

Not already a member? It's free and takes only a moment to create an account with your email only.

Join

Your membership brings you all these free resources:

  • Live, monthly reliability webinars & recordings
  • eBooks: Finding Value and Reliability Maturity
  • How To articles & insights
  • Podcasts & additional information within podcast show notes
  • Podcast suggestion box to send us a question or topic for a future episode
  • Course (some with a fee)
  • Largest reliability events calendar
  • Course on a range of topics - coming soon
  • Master reliability classes - coming soon
  • Basic tutorial articles - coming soon
  • With more in the works just for members
Speaking of Reliability podcast logo

Subscribe and enjoy every episode

RSS
iTunes
Stitcher

Join Accendo

Receive information and updates about podcasts and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Dare to Know podcast logo

Subscribe and enjoy every episode

RSS
iTunes
Stitcher

Join Accendo

Receive information and updates about podcasts and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Accendo Reliability Webinar Series podcast logo

Subscribe and enjoy every episode

RSS
iTunes
Stitcher

Join Accendo

Receive information and updates about podcasts and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • test
  • test
  • test
  • Your Most Important Business Equation
  • Your Suppliers Can Be a Risk to Your Project

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy