Mean Time Between Failures (MTBF)

What is Mean Time Between Failures (MTBF)?

Mean Time Between Failures (MTBF) is a reliability metric that tells you, on average, how long a repairable asset runs before it breaks down. If a machine has an MTBF of 800 hours, you can expect roughly 800 hours of operation between one failure and the next.

MTBF doesn't predict when a specific failure will happen. It's a statistical average. But over time and across multiple assets, it's one of the most reliable indicators you have for planning maintenance, budgeting for repairs, comparing equipment, and deciding when it's time to replace rather than keep fixing.

It applies only to repairable assets, things you fix and put back into service. For items you replace entirely when they fail (like light bulbs or batteries), the equivalent metric is MTTF (Mean Time To Failure). Standardized methodology for collecting the failure data that feeds MTBF calculations is described in ISO 14224:2016.

How to Calculate MTBF

Basic Formula

MTBF = Total Operating Time / Number of Failures

"Operating time" means the hours the asset was actually running, not calendar time, not the time it sat idle, and not the time it was down for repair.

Calculation Examples

Single asset, simple case: A compressor ran for 4,800 hours over two years and failed 6 times. MTBF = 4,800 / 6 = 800 hours

Multiple identical assets (fleet calculation): You have 10 delivery trucks. Over the past year, they operated a combined 45,000 hours and experienced 30 breakdowns across the fleet. Fleet MTBF = 45,000 / 30 = 1,500 hours

Converting to practical time: An MTBF of 2,000 hours for a machine that runs 8 hours/day, 250 days/year (2,000 hours/year) means you can expect roughly one failure per year.

The same MTBF for a machine running 24/7 (8,760 hours/year) means roughly 4.4 failures per year.

Context matters. The same MTBF number means very different things depending on how intensively the asset is used.

Failure Rate

Failure rate is the inverse of MTBF:

Failure Rate (λ) = 1 / MTBF

If MTBF = 800 hours, the failure rate is 1/800 = 0.00125 failures per hour, or about 1.25 failures per 1,000 operating hours.

Metric	What It Measures	Applies To	Formula
MTBF	Average operating time between failures	Repairable assets	Total operating time / Number of failures
MTTF	Average time until first (and only) failure	Non-repairable items	Total operating time / Number of units
MTTR	Average time to fix a failure	Repairable assets	Total repair time / Number of repairs
Availability	Percentage of time the asset is operational	Repairable assets	MTBF / (MTBF + MTTR)

How They Work Together

These metrics aren't useful in isolation. Combining them gives you availability, the percentage of time the asset is actually operational:

Asset Availability = MTBF / (MTBF + MTTR)

Example: A machine has an MTBF of 400 hours and an MTTR of 8 hours. Availability = 400 / (400 + 8) = 400 / 408 = 98.0%

Now imagine another machine with the same MTBF of 400 hours but an MTTR of 40 hours (it's harder to fix). Availability = 400 / (400 + 40) = 400 / 440 = 90.9%

Same reliability, but much lower availability because repairs take longer. This is why you need both metrics.

The Bathtub Curve: Failure Patterns Over Time

Most equipment follows a predictable failure pattern called the "bathtub curve" because of its shape:

1. Early Life (Infant Mortality): First weeks/months

Higher failure rate due to manufacturing defects, installation errors, wrong settings
MTBF appears low during this period
Mitigation: burn-in testing, commissioning checklists, warranty claims

2. Useful Life (Random Failures): Main operating period

Low, relatively constant failure rate
Failures are random and unpredictable
This is where MTBF is most meaningful and stable
Mitigation: preventive maintenance, operator training

3. Wear-Out (End of Life): Final period

Failure rate increases as components age and degrade
MTBF drops noticeably
Mitigation: increased inspection frequency, planned replacement, predictive maintenance

Understanding where an asset is on this curve helps you interpret its MTBF correctly and choose the right maintenance strategy.

Key Benchmarks

MTBF varies enormously by asset type and industry. These are general guidelines:

Asset Type	Typical MTBF Range	What "Good" Looks Like
Industrial motors	20,000–100,000 hours	50,000+ hours
Pumps	10,000–50,000 hours	25,000+ hours
HVAC systems	5,000–20,000 hours	15,000+ hours
Forklifts	2,000–6,000 hours	4,000+ hours
Laptops/PCs	25,000–50,000 hours	35,000+ hours
Servers	50,000–200,000 hours	100,000+ hours
Vehicle fleet	1,500–5,000 hours	3,000+ hours

Important: Manufacturer-stated MTBF is typically calculated under ideal lab conditions. Real-world MTBF is almost always lower. Track your actual numbers. Don't rely on spec sheets.

Who Needs MTBF and When

Maintenance managers: Monthly/quarterly. Track reliability trends per asset and asset class. Identify chronic problem equipment. Justify preventive maintenance investment by showing MTBF improvement.
Operations managers: Monthly. Understand which equipment is most likely to disrupt production. Plan backup capacity.
Finance teams: At budget cycles. Use MTBF trends to forecast repair costs and justify capital replacement. Compare repair spending vs. MTBF improvement.
Procurement: At purchase decisions. Compare MTBF ratings between vendors. Factor reliability into total cost of ownership calculations.
Reliability engineers: Continuously. Perform root cause analysis on failures. Drive MTBF improvement programs. Model system reliability.

Real-World Examples

Example 1: Manufacturing Line Reliability

A food manufacturer tracked 8 packaging machines over 12 months:

Machine	Operating Hours	Failures	MTBF (hours)	MTTR (hours)	Availability
PKG-01	3,800	2	1,900	3.5	99.8%
PKG-02	3,750	4	938	6.0	99.4%
PKG-03	3,600	9	400	8.0	98.0%
PKG-04	3,820	3	1,273	4.0	99.7%
PKG-05	3,500	12	292	12.0	96.1%
PKG-06	3,780	2	1,890	3.0	99.8%
PKG-07	3,650	5	730	7.0	99.0%
PKG-08	3,700	3	1,233	5.0	99.6%

PKG-05 stood out immediately: lowest MTBF (292 hours) and highest MTTR (12 hours). This one machine was responsible for 35% of all packaging line downtime.

Root cause analysis revealed: PKG-05 was 3 years older than the rest and had been "repaired" multiple times with non-OEM parts. Each repair fixed the immediate symptom but introduced new failure modes.

Decision: Replace PKG-05 ($85,000) rather than continue repairing ($42,000/year in repair costs + $65,000/year in estimated downtime costs). The new machine's first-year MTBF: 2,100 hours. Payback period: 10 months.

Example 2: Fleet Maintenance Strategy Shift

A utility company had a fleet of 40 service vehicles. They tracked MTBF over three years:

Year 1 (reactive maintenance only): Fleet MTBF = 1,200 hours. 142 total breakdowns. Average roadside breakdown: $800 including towing, rental, and emergency repair.
Year 2 (introduced preventive maintenance program): Fleet MTBF = 1,850 hours. 94 breakdowns. PM cost: $45,000/year.
Year 3 (PM + predictive monitoring on 15 critical vehicles): Fleet MTBF = 2,600 hours. 61 breakdowns. PM + PdM cost: $62,000/year.

Financial summary:

Year	Breakdowns	Breakdown Cost	PM/PdM Cost	Total	MTBF
1	142	$113,600	$0	$113,600	1,200 hrs
2	94	$75,200	$45,000	$120,200	1,850 hrs
3	61	$48,800	$62,000	$110,800	2,600 hrs

Year 2 looked like it cost more overall, but MTBF improved 54%. By Year 3, total costs were lower and reliability was 117% better than Year 1. The trend continued into Year 4 with an MTBF of 3,100 hours.

Common Mistakes

Counting calendar time instead of operating time. A machine that's installed for 8,760 hours (one year) but only runs for 2,000 hours has very different reliability implications. MTBF must be based on actual operating hours.
Ignoring "minor" failures. If you only count major breakdowns and ignore jams, stalls, and minor malfunctions, your MTBF will be artificially high, and your maintenance strategy will be based on fantasy.
Comparing MTBF across different contexts. An MTBF of 500 hours for a jackhammer is excellent. For a server, it's terrible. Always compare within the same asset class and operating context.
Not enough data. MTBF from 2 failures over 6 months is statistically meaningless. You need enough data points over enough time for the number to be useful. As a minimum, aim for 10+ failure events per calculation.
Using manufacturer MTBF as gospel. Manufacturer numbers are calculated under controlled conditions with new components. Your actual MTBF will be lower. Track your own data.
Treating MTBF as a guarantee. An MTBF of 1,000 hours doesn't mean the asset will run exactly 1,000 hours before failing. Failures are probabilistic. Some will happen at 200 hours, some at 2,000.

How to Improve MTBF

Implement preventive maintenance. Regular servicing catches degradation before it causes failure. Organizations that move from reactive to preventive maintenance typically see MTBF improve by 30–60% within the first year.
Perform root cause analysis (RCA) on every failure. Don't just fix the symptom, understand why it failed. Was it a worn component? Operator error? Environmental factor? Design flaw? Address the cause, not just the effect.
Train operators properly. A significant percentage of equipment failures are caused by incorrect operation: overloading, improper startup/shutdown sequences, ignoring warning indicators. Training is cheaper than repairs.
Control the operating environment. Temperature extremes, dust, moisture, and vibration all reduce MTBF. Simple environmental controls (air filters, climate control, vibration dampening) can dramatically extend equipment life.
Use quality replacement parts. Non-OEM or counterfeit parts may be cheaper upfront but often have shorter lifespans and can introduce new failure modes. Track MTBF before and after parts changes to verify.
Implement predictive maintenance. Vibration analysis, thermal imaging, oil analysis, and other condition-monitoring techniques can detect problems weeks or months before failure, giving you time to plan a repair during a scheduled window instead of experiencing an unplanned breakdown.
Review and act on trends. A declining MTBF trend is an early warning signal. Don't wait until it becomes a crisis. Investigate when you see the numbers drop over two or more consecutive measurement periods.

Best Practices

Automate data collection. Manual failure logging is unreliable. Use runtime counters, IoT sensors, and maintenance management systems that automatically capture operating hours and failure events.
Calculate MTBF at multiple levels. Track it per individual asset (to spot problem equipment), per asset class (to compare models and vendors), and per site (to identify environmental or operational differences).
Pair MTBF with MTTR. Neither metric is complete alone. High MTBF with high MTTR still gives you poor availability. Low MTBF with very low MTTR might be acceptable for non-critical assets.
Set improvement targets. Don't just measure MTBF, set goals. "Improve fleet MTBF by 15% over 12 months" is actionable. "Track MTBF" is not.
Use MTBF in replacement decisions. When an asset's MTBF drops below a threshold (and especially when repair costs approach replacement cost), it's time to plan a replacement, not another repair.
Review quarterly. Monthly MTBF can be noisy (especially with low failure counts). Quarterly reviews smooth out the data and show real trends.

Mean Time To Repair (MTTR): The companion metric measuring how long repairs take
Preventive Maintenance, scheduled maintenance that directly improves MTBF
Predictive Maintenance: Condition-based maintenance that catches failures before they happen
Asset Utilization Rate, How intensively assets are used, which affects failure rates
Total Cost of Ownership, MTBF directly impacts lifetime maintenance costs
IoT Asset Monitoring, Sensors that provide real-time operating data for MTBF calculations
Work Order, The mechanism for recording and tracking each failure and repair event

MTBF as a Decision Tool

MTBF is your best single number for understanding equipment reliability. It tells you how often things break, helps you predict when they'll break next, and, when tracked over time, shows you whether your maintenance investments are actually working. The organizations that take MTBF seriously don't just react to failures; they systematically drive them down, reducing costs and increasing uptime with every improvement cycle.

Tracking MTBF with UNIO24

UNIO24 logs every maintenance event, failure, and repair for each asset with timestamps and details. This data history lets you calculate MTBF automatically, per asset, per category, per location. Spot your least reliable equipment, track reliability trends over time, compare assets side by side, and build a data-driven case for repair-vs-replace decisions. When your MTBF improves, you'll have the numbers to prove it.

Core Products

Additional Features

Industries

Popular Use Cases

Resources