Why HALT Won't Give You an MTBF (And Why You Shouldn't Care)

Why HALT Won’t Give You An MTBF And Why You Shouldn’t Care

Mark L. MorelliReliability & Test Engineer (Retired)

Hobbs Engineering Webinar June 4, 2014

Author’s Bio

• Recently retired after 32-year career in reliability and test engineering– Aerospace, military, and commercial building systems– Applied HALT > 200 products (> 500 separate testing

activities)– Applied HASS to ~ 10 product lines (many thousands of

units tested)• BSEE Univ. of Hartford• Adjunct Professor, Univ. of Hartford• Authored and presented numerous technical papers • Presently a freelance writer for The Motley Fool (finance site)

with a focus on technology and innovation

Agenda

• Discussion of MTBF

• Why HALT doesn’t provide a MTBF value

• Why HALT will improve reliability

Mean Time Between Failures (MTBF)

• MTBF = elapsed time between inherent failures– Assumes a renewal process (system repaired upon

failure)• Does not provide a failure distribution or pattern• Most prediction methods (e.g. MIL-HDBK-217)

use “old” data

MTBF = Cumulative Fleet Op Time ÷ Failures

MTBF = time based parameter but does not account for product failure distribution/patterns

MTBF and failure distributionsT (hrs) System 1 failures System 2 failures System 3 failures

100 X (2)

2000 x

10000 x

15000 x X (2)

20000 x X (2)

All systems have same MTBF = 20,000 ÷ 4 = 5,000 hours

MTBF and failure distribution

Why HALT ≠ MTBF• HALT is exploratory process used on electronics that seeks to

identify weaknesses though application of increasing and varying stress– Difficult to correlate to a precise time period but process does identify

most failure types that occur during product life cycle– Difficult to calculate an acceleration factor between test and deployment

• Typically a small number of test articles are used and not every sample will have all stresses (temperature, vibration, electrical, etc.) applied– Nearly impossible to accurately measure reliability w/ low sample sizes– My experience indicates that many (if not most) product failures are due

to lot-related part defects or process variations that can not be found in a single test at a single point in time

HALT = stress-based tool that addresses typical product failure types

Example: Capacitor failures

Aug-93 Mar-94 Sep-94 Apr-95 Oct-95 May-96 Dec-960

When ProducedWhen Failed

# Failures

Design testing completed Jan 1994: No capacitor failures

What HALT does do

• Makes product more robust– Can withstand (sometimes) unknown factory and

field environments• Allows development of ongoing reliability

tests (ORT), including production screening regimen (e.g. HASS)– Not all failures are related to design and will creep

into product over time

Ongoing reliability testing

• Needed to account for issues creeping into product over time– Lot-related (e.g. capacitor) problems– Process (e.g. solder) variation

• Periodic re-HALT• Production screening– HASS

Summary

• MTBF is a time-based parameter that is unrelated to a failure distribution

• HALT is a stress-based tool that is related to failure patterns (and distributions)

Contact information

Mark L. Morellimathman6577@gmail.comTwitter: @mathman6577LinkedIn: Mark Morelli (Greater NYC area)

Why HALT Won't Give You an MTBF (And Why You Shouldn't Care)

Engineering

Transcript of Why HALT Won't Give You an MTBF (And Why You Shouldn't Care)

Why You Shouldn't Say No To Your Dog

Why you shouldn't use social media for AGX

Why Technical Writers Shouldn't be "Writers"

5 Scheduling Nightmares: Why you shouldn't use Excel for Planning Resources

How i became a journalist and why you probably shouldn't

three stories why you shouldn't dump your next big Idea " Arabic "

Why You Shouldn't Adopt Amazon's Work Culture

Why Facebook Shouldn't be the Center of Your Social Strategy

Report REP 632 Disclosure: Why it shouldn't be the default

Finding Good Bets in the Lottery, and Why You Shouldn't Take Them

Why Toys Shouldn't Work Like Magic: Children's Technology ...i523/gross.pdf · Why Toys Shouldn't Work "Like Magic": Children's Technology and the Values of Construction and Control

Cheaters Shouldn't Prosper and Consumers Shouldn't Suffer ...

three stories why you shouldn't dump your next big Idea

Tech net Why you shouldn't send sensitive emails

Upa why usability shouldn't come first

7 Reasons Why you Shouldn't Work for an Ad Agency

Why Shouldn't I Be Able To Open This Queue? MQ and CICS ...€¦ · Why Shouldn't I Be Able To Open This Queue? MQ and CICS Security Topics - 16544 Lyn Elkins –elkinsc@us.ibm.com

TEDxNBU: Why we shouldn't be afraid of failure by Krassimir Dobrev

Tracie Waecker - Why Data Shouldn't Be a One Night Stand

Mark Broadfoot - Why Data Shouldn't Be a One Night Stand