Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • Way of the Quality Warrior
    • Critical Talks
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • on Leadership & Career
      • Advanced Engineering Culture
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • ReliabilityXperience
      • RCM Blitz®
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Feed Forward Publications
    • Openings
    • Books
    • Webinars
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Reliability Analysis Methods online course
    • Measurement System Assessment
    • SPC-Process Capability Course
    • Design of Experiments
    • Foundations of RCM online course
    • Quality during Design Journey
    • Reliability Engineering Statistics
    • Quality Engineering Statistics
    • An Introduction to Reliability Engineering
    • Reliability Engineering for Heavy Industry
    • An Introduction to Quality Engineering
    • Process Capability Analysis course
    • Root Cause Analysis and the 8D Corrective Action Process course
    • Return on Investment online course
    • CRE Preparation Online Course
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home

by Fred Schenkelberg 1 Comment

Electronics Failure Analysis Done the Right Way

Electronics Failure Analysis Done the Right Way

“Failures are gold”

Early in my career the engineering manager relished discovering equipment failure.

It didn’t matter if it was human, electronic, mechanical or software in nature, the glint in his eye soon gave way to a flood of possibilities. He enjoyed the process of investigating the fundamental reasons a failure occurred.

He wanted to know the cause(s) so we could design and operate a better system less likely to invoke those causes leading to failure. He was relentless in the pursuit of understanding what happened.

At times in my career, I’ve sent suspected bad parts or confirmed failed components to vendors for root cause analysis. That was often pure folly.

The responses often included:

“Yes, it failed” with no other explanation
“It’s working now, we do not see a failure here.”
“We know about this and it’s fixed already.”
“Oh, it was electrostatic discharge, it was your fault.” They had not received the part yet.
“You overstressed the part, it was a victim of poor design or handling.”

It was rare that a vendor would provide meaningful root cause analysis that provided actionable information for improvement. If it’s our design, in what way?

 

Failure Mechanisms and Modes

A failure mode is what is observed indicating a failure. It’s the plume of smoke rising from the back of the monitor. It’s the lack of response to the power switch. It’s the wrong answer appearing in your calculator. It’s how you know the product is not working.

A failure mechanism is the physical, chemical, thermodynamic or other processes that result in failure, which we then can observe as a failure mode.

Failure mechanisms are where the science exists, it’s what excited my boss. Understanding the science of failures allows us to select better materials, improve manufacturing processes, or minimize loads with the effect of improving reliability.

 

When a failure occurs do Root Cause analysis

The common term, “failure analysis” generally implies that we:

  • verify the failure occurred, try the power switch again
  • gather evidence exhibited indicating a failure, what are the symptoms or failure modes
  • recommend corrective or preventative action which may include short and long term solutions

This common approach tends to jump to action before understanding what actually happened. If you have a power supply that is not working and by replacing the analog controller component, the power supply works again, is it fair to conclude the replaced part was the root cause? No.

What about the replaced part failed? Was it faulty due to corrosion or mechanical bending? Was the part merely a victim of another element in the system causing a current overload that causes the part to fail?

Elements of a system fail for many different reasons and until you fully understand the root cause of the failure, any corrective or preventative action is little more than a guess at what will work.

Root Cause Analysis (RCA) include the detailed process of determining the failure mechanisms and the factors that cause that failure to occur.

RCA helps you understand if the faulty part is due to material contamination, poor material selection, over-stress, excessive degradation, etc. Understanding that the component is unable to withstand repeated inrush current cycles as each cycle accumulated damage to the aluminum lines on the IC, you can either change parts to one that can handle the stress or reduce the stress.

Simply replacing the part and saying the vendor is at fault for poor workmanship, is not going to solve the problem.

Failure analysis is largely done in the lab with the tools and expertise to tease out the root cause of a failure mode. Recently Bhanu Sood of NASA Goddard presented at the IPC APEX Expo on failure analysis, his slide set, Failure Analysis for Improved Reliability, is available online.

He provided an extensive overview of failure mechanisms and failure analysis techniques along with a couple case studies. It’s a very good read and certainly will spark your interest in celebrating failures.

Think of what you can learn and improve. I hope you get as excited by failures as my former boss.

Filed Under: Articles, Musings on Reliability and Maintenance Topics, on Product Reliability Tagged With: Failure, failure analysis

About Fred Schenkelberg

I am the reliability expert at FMS Reliability, a reliability engineering and management consulting firm I founded in 2004. I left Hewlett Packard (HP)’s Reliability Team, where I helped create a culture of reliability across the corporation, to assist other organizations.

« Eliminating early life failures
General Human Factors Design Principles »

Comments

  1. Pavel Bochian says

    November 22, 2016 at 12:16 PM

    Good article.

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Article by Fred Schenkelberg
in the Musings series

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • test
  • test
  • test
  • Your Most Important Business Equation
  • Your Suppliers Can Be a Risk to Your Project

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy