Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • Way of the Quality Warrior
    • Critical Talks
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • on Leadership & Career
      • Advanced Engineering Culture
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • ReliabilityXperience
      • RCM Blitz®
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Feed Forward Publications
    • Openings
    • Books
    • Webinars
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Reliability Analysis Methods online course
    • Measurement System Assessment
    • SPC-Process Capability Course
    • Design of Experiments
    • Foundations of RCM online course
    • Quality during Design Journey
    • Reliability Engineering Statistics
    • Quality Engineering Statistics
    • An Introduction to Reliability Engineering
    • Reliability Engineering for Heavy Industry
    • An Introduction to Quality Engineering
    • Process Capability Analysis course
    • Root Cause Analysis and the 8D Corrective Action Process course
    • Return on Investment online course
    • CRE Preparation Online Course
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home

III. A. 8. Fault Tolerance

by Bryan Christiansen Leave a Comment

Reliability Techniques For Analyzing And Improving Fault Tolerance

Reliability Techniques For Analyzing And Improving Fault Tolerance

When designing equipment and processes, engineers leave a safety margin that ensures equipment remains functional when a fault or defect is affecting it partially or wholly. Minor defects affecting production assets should not cause immediate breakdowns. A fault-tolerant system remains operational for predetermined intervals before undertaking corrective measures. Faults affecting the operation of different systems emanate from more than a single source. [Read more…]

Filed Under: Articles, CMMS and Reliability, on Maintenance Reliability Tagged With: Fault Tolerance, FMEA, FTA, reliability

by Fred Schenkelberg Leave a Comment

The Downside of a Fault Tolerant System

The Downside of a Fault Tolerant System

Maintaining high reliability or availability is a marked advantage for any system. A system that achieves the ability to avoid system downtime due to a single failure event, is essential in many applications. Yet, the fault tolerant capability comes at a price.

A system that achieves the ability to avoid system downtime due to a single failure event, is essential in many applications. Yet, the fault tolerant capability comes at a price.

Here is a short list and brief description of fault tolerant design disadvantages:

Masking or obscuring low-level failures

The nature of a fault tolerance design is to continue to operate normally even with a component failure.

Thus if the ability to detect a component failure relies on a loss of function or capability, it may be difficult to detect the failure. This sets the stage for a second component failure to cause a system downing event. [Read more…]

Filed Under: Articles, CRE Preparation Notes, Reliability in Design and Development Tagged With: Fault Tolerance

by Fred Schenkelberg Leave a Comment

Deciding What Should Have Fault Tolerance

Deciding What Should Have Fault Tolerance

In some circumstances, it is desirable to ensure the system continues to operate even if there is an internal failure. An aircraft navigation system should be able to operate even if an internal dc-dc regulator fails, for example.

Not everything within some systems benefits by being fault tolerant.

For example, a failure of a cabin reading light over a passenger seat is not critical to the safe operation of the aircraft, thus is likely not created to be fault tolerant. One criterion to determine what should be fault tolerant is the criticality of the function the system provides.

This also applies to specific subsystems within a system allowing some elements to be created fault tolerant and others within the system not. [Read more…]

Filed Under: Articles, CRE Preparation Notes, Reliability in Design and Development Tagged With: Fault Tolerance

by Fred Schenkelberg Leave a Comment

Fault Tolerance Basics

Fault Tolerance Basics

Fault tolerance is a system that is reliant to the failure of elements within the system. It also may be called a fail safe design.

A fault tolerant system may continue to operate just fine, after one of the power supplies fails, for example. Or it may operate in a reduced or degraded state.

Other systems may have a ‘limp home’ condition, allowing the system to save critical data or allowing you to drive to a safe place to change a flat tire. [Read more…]

Filed Under: Articles, CRE Preparation Notes, Reliability in Design and Development Tagged With: Fault Tolerance

CRE Preparation Notes

Article by Fred Schenkelberg

Join Accendo

Join our members-only community for full access to exclusive eBooks, webinars, training, and more.

It’s free and only takes a minute.

Get Full Site Access

Not ready to join?
Stay current on new articles, podcasts, webinars, courses and more added to the Accendo Reliability website each week.
No membership required to subscribe.

[popup type="" link_text="Get Weekly Email Updates" link_class="button" ]

[/popup]

  • CRE Preparation Notes
  • CRE Prep
  • Reliability Management
  • Probability and Statistics for Reliability
  • Reliability in Design and Development
  • Reliability Modeling and Predictions
  • Reliability Testing
  • Maintainability and Availability
  • Data Collection and Use

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy