Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • Way of the Quality Warrior
    • Critical Talks
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • on Leadership & Career
      • Advanced Engineering Culture
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • ReliabilityXperience
      • RCM Blitz®
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Feed Forward Publications
    • Openings
    • Books
    • Webinars
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Reliability Analysis Methods online course
    • Measurement System Assessment
    • SPC-Process Capability Course
    • Design of Experiments
    • Foundations of RCM online course
    • Quality during Design Journey
    • Reliability Engineering Statistics
    • Quality Engineering Statistics
    • An Introduction to Reliability Engineering
    • Reliability Engineering for Heavy Industry
    • An Introduction to Quality Engineering
    • Process Capability Analysis course
    • Root Cause Analysis and the 8D Corrective Action Process course
    • Return on Investment online course
    • CRE Preparation Online Course
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home

by Fred Schenkelberg 5 Comments

Life Testing Question

Life Testing Question

Hi Fred,

I would take this opportunity to ask the reliability guru about bathtub curve for hardware reliability. I am running 27 units for life test for a million cycles around 555 hours. I have one failure at 300,000 cycles, and the rest of the units are running fine. Would this be classified as an early life failure? Also, how do I make a determination of when the early life failure time interval ends and constant failure rate starts in this example based on failure rate of remaining units? Thanks.

–and my response —

You have two next steps. Continue to run the testing till you have more failures and plot the results. The units that have not failed are censored, yet you do need enough failures to determine the slope and distribution fit (at least five).

Second, determine the root cause of the failures. If it is a wear out mechanism, like solder fatigue, then it’s pretty safe to say it’s wear out. IF it’s a poorly assembled unit, you may lean toward early life failure.

In the best case, you’ll experience the same failure mechanism for each failure. This permits you to use published literature about the failure mechanism to confidently fit a life model to your data. If each failure is different, then modeling at the system level and using only an empirical or non-parametric fit may provide some information about the expected performance, yet it will be difficult to assign an acceleration factor to the results.

Keep in mind that there are many ways a product can fail and they do not arrange themselves into convenient groups. In a life test, you most likely have a specific stress that is being applied that will excite the acceleration of only a subset of possible failure mechanisms.

Good luck with your testing and data analysis.

Cheers,

Fred
On 03/11/12 8:10 AM, Kartik Ramaswamy wrote:
——————–
Hi Fred,

I would take this opportunity to ask the reliability guru about bathtub curve for hardware reliability. I am running 27 units for life test for a million cycles around 555 hours. I have one failure at 300,000 cycles, and the rest of the units are running fine. Would this be classified as an early life failure? Also, how do I make a determination of when the early life failure time interval ends and constant failure rate starts in this example based on failure rate of remaining units? Thanks.


Related:

Sources of Reliability Data (article)

Reliability Management Terminology (article)

Failure modes and mechanisms (article)

 

Filed Under: Articles, CRE Preparation Notes, Reliability Testing Tagged With: failure mechanism, failure mechanisms, Failure Rate, Life Test

About Fred Schenkelberg

I am the reliability expert at FMS Reliability, a reliability engineering and management consulting firm I founded in 2004. I left Hewlett Packard (HP)’s Reliability Team, where I helped create a culture of reliability across the corporation, to assist other organizations.

« CRE exam in May at WCQI
System or component testing »

Comments

  1. prabhakar says

    March 12, 2012 at 9:14 AM

    hello fred,

    One question here…pertaining to your above response. all said above is right..but one aspect i felt was more required was why the sampling considered as 27 units? I think this will also have an impact.

    please correct me..

    regards,
    prabhakar

    Reply
    • Fred Schenkelberg says

      March 12, 2012 at 9:30 AM

      Hi Prabhakar,

      27 units for a sample size may or may not be adequate – it really depends on what sampling risk you are willing to take. I’m a statistician and would always like to see more samples, yet the reality is we often have budget, space, or other constraints. I have found that about 20 samples if often good enough to estimate a distribution when there are plenty of failures. If I’m making assumptions on the failure mechanism and acceleration model – we often run a test expecting no failure unless the unknown true failure rate is above some threshold – the sample size really doesn’t help with those assumptions, yet it is common practice.

      27 isn’t a magic number – it’s not too low, yet depending on the details of the test design, expected variation, and tolerance for risk that the sample doesn’t represent the population.

      cheers,

      Fred

      Reply
  2. Dave Wakefield says

    March 13, 2012 at 12:53 PM

    Fred,
    Why a minimum of five failures to determine slope and fit? I’ve heard three before, but only as a rule of thumb.

    Dave Wakefield

    Reply
    • Fred Schenkelberg says

      March 13, 2012 at 9:21 PM

      Hi Dave,

      Actually if you ask a random group of statisticians you’ll get a different answer from each on average.

      One failure point and an assumption on the shape parameter is done – with risk or the assumption being wrong.

      Two failure to fit a straight line and risk of the slope be very, very wrong.

      three failures to fit a line and check for curvature – absolute minimum and generally risky

      5 out of 20 samples provides information about the lower quartile – while more failures and more samples would be great – it’s a trade off on time, testing costs, sample costs, and the early estimate with less risk then having less information.

      Cheers,

      Fred

      Reply
  3. Dave Wakefield says

    March 18, 2012 at 10:17 PM

    Fred,
    Thanks for the great explanation. Pretty obvious now that I’ve thought it through!

    Respectfully,
    Dave

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

CRE Preparation Notes

Article by Fred Schenkelberg

Join Accendo

Join our members-only community for full access to exclusive eBooks, webinars, training, and more.

It’s free and only takes a minute.

Get Full Site Access

Not ready to join?
Stay current on new articles, podcasts, webinars, courses and more added to the Accendo Reliability website each week.
No membership required to subscribe.

[popup type="" link_text="Get Weekly Email Updates" link_class="button" ]

[/popup]

  • CRE Preparation Notes
  • CRE Prep
  • Reliability Management
  • Probability and Statistics for Reliability
  • Reliability in Design and Development
  • Reliability Modeling and Predictions
  • Reliability Testing
  • Maintainability and Availability
  • Data Collection and Use

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy