Mean Time To Repair (MTTR)

What is Mean Time To Repair (MTTR)?

Mean Time To Repair (MTTR) measures how long it takes, on average, to get a broken asset back to working condition. The clock starts when the failure is detected and stops when the asset is fully operational again. If your MTTR for a production machine is 6 hours, that's 6 hours of lost capacity every time it breaks down.

MTTR is one of the most practical maintenance metrics because it directly translates to downtime cost. A machine with excellent reliability (high MTBF) but terrible repairability (high MTTR) can still cause major operational problems. MTTR tells you how good your organization is at recovering from failures — not just preventing them.

Important distinction: MTTR is sometimes also used to mean "Mean Time To Recover" or "Mean Time To Respond." In this article, we focus on the maintenance definition: the average time from failure detection to full restoration of service.

How to Calculate MTTR

Basic Formula

MTTR = Total Repair Time / Number of Repairs

"Repair time" includes everything from the moment a failure is noticed to the moment the asset is back in normal operation.

Calculation Examples

Simple case: A conveyor belt failed 5 times over the past year. Total time spent on repairs: 35 hours. MTTR = 35 / 5 = 7 hours per repair

Fleet average: Your fleet of 20 forklifts had 40 repair events this quarter. Combined repair time: 120 hours. Fleet MTTR = 120 / 40 = 3 hours per repair

Using MTTR to estimate annual downtime: If an asset has an MTBF of 1,000 hours and an MTTR of 8 hours, and it runs 4,000 hours/year:

  • Expected failures per year: 4,000 / 1,000 = 4
  • Expected annual downtime: 4 × 8 = 32 hours/year

What MTTR Includes

The repair clock covers the entire sequence — not just wrench time:

The MTTR Timeline

PhaseDescriptionTypical % of Total MTTRKey Driver
DetectionRecognizing the failure occurred5–15%Monitoring systems, operator awareness
ResponseGetting a technician to the asset10–25%Staffing levels, geography, scheduling
DiagnosisIdentifying the root cause15–30%Technician skill, documentation, diagnostics
Parts procurementObtaining replacement parts0–40%Spare parts inventory, supplier lead times
Repair executionPerforming the actual fix20–35%Technician skill, tool availability, complexity
Testing & validationVerifying the fix works5–10%Test procedures, quality requirements

The biggest surprise for most organizations: the actual hands-on repair is often less than a third of total MTTR. The rest is waiting — waiting for someone to notice, waiting for a technician, waiting for parts.

This is important because it means MTTR improvements often come from logistics and process changes, not just better technical skills.

MTTR Variations: What Exactly Are You Measuring?

The acronym MTTR is used for several related but different metrics. Be clear about which one you're tracking:

VariationFull NameClock StartsClock StopsIncludes
MTTR (Repair)Mean Time To RepairWhen repair work beginsWhen asset is functionalDiagnosis + repair + testing
MTTR (Recover)Mean Time To RecoverWhen failure occursWhen asset is back in productionEverything including detection and response
MTTAMean Time To AcknowledgeWhen failure occursWhen someone begins respondingDetection + response time only
MTTRSMean Time To Restore ServiceWhen failure occursWhen service is restored (may use backup)All time, but can use workarounds

Recommendation: Use "Mean Time To Recover" as your primary metric — it captures the full downtime experience. But also track the sub-components (response time, parts wait time, wrench time) to identify where improvements are possible.

Asset Availability: Combining MTBF and MTTR

The ultimate goal of tracking both MTBF and MTTR is to calculate availability — the percentage of time an asset is operational.

Availability = MTBF / (MTBF + MTTR)

ScenarioMTBFMTTRAvailabilityInterpretation
Reliable, quick repair2,000 hrs4 hrs99.8%Best case — rare failures, fast fixes
Reliable, slow repair2,000 hrs48 hrs97.7%Good reliability undermined by slow repairs
Unreliable, quick repair200 hrs2 hrs99.0%Breaks often but recovered quickly
Unreliable, slow repair200 hrs24 hrs89.3%Worst case — frequent and prolonged downtime

Key insight: Improving MTTR from 48 hours to 4 hours on a reliable machine (MTBF 2,000) increases availability by 2.1 percentage points. The same improvement on an unreliable machine (MTBF 200) increases availability by 9.7 percentage points. MTTR improvements have the biggest impact on frequently-failing equipment.

Who Needs MTTR and When

  • Maintenance managers — Weekly/monthly. Track team performance, identify bottlenecks (is it diagnosis time? parts wait? response time?), set improvement targets.
  • Operations managers — Monthly. Understand how much production capacity is lost to repairs. Plan buffer capacity around high-MTTR equipment.
  • IT managers — Per incident. Track resolution times for servers, network equipment, and end-user devices against SLAs.
  • Facilities managers — Monthly. Monitor repair responsiveness for HVAC, elevators, security systems — anything that affects building occupants.
  • Finance teams — Quarterly. Calculate the cost of downtime. If MTTR × downtime cost per hour × failures per year = a big number, that justifies investment in faster repair capability.
  • Procurement — At purchase decisions. Equipment that's hard to repair (proprietary parts, specialized tools required) will have high MTTR regardless of reliability. Factor repairability into buying decisions.

Real-World Examples

Example 1: Reducing MTTR in a Distribution Center

A distribution center had 12 conveyor systems. Average MTTR: 4.2 hours. With an average of 8 failures/month, that was 33.6 hours of downtime per month — costing approximately $2,800/hour in lost throughput.

Analysis of MTTR breakdown:

PhaseAverage Time% of MTTR
Detection18 min7%
Response (tech arrives)52 min21%
Diagnosis48 min19%
Parts procurement72 min29%
Repair45 min18%
Testing17 min7%
Total252 min (4.2 hrs)100%

The two biggest time sinks: response time (52 min — technicians were based in a single location and had to travel) and parts procurement (72 min — common parts weren't stocked on-site).

Changes made:

  1. Stationed a technician at each end of the distribution center during peak shifts (response time: 52 → 12 min)
  2. Installed parts kits at each conveyor section with the 15 most common failure parts (parts time: 72 → 8 min)
  3. Added IoT vibration sensors to detect bearing failures before complete breakdown (detection time: 18 → 5 min for predicted failures)

Results:

  • MTTR dropped from 4.2 hours to 1.4 hours (67% reduction)
  • Monthly downtime: 33.6 → 11.2 hours
  • Annual savings: approximately $752,000 in recovered throughput
  • Investment: $95,000 (parts kits, sensors, staffing adjustment)
  • Payback: 7 weeks

Example 2: IT Help Desk MTTR Optimization

An organization with 800 employees tracked MTTR for laptop/desktop issues:

Before optimization:

  • Average MTTR for hardware issues: 18 hours (including overnight waits)
  • Average MTTR for software issues: 6 hours
  • Total IT downtime per month: ~280 employee-hours
  • Estimated productivity loss: $14,000/month

Root causes of high MTTR:

  • No spare devices available (employees waited for repair)
  • Help desk tickets queued by arrival time, not urgency
  • Many issues required on-site visits that could have been resolved remotely
  • No standard diagnostic procedures — each tech approached problems differently

Changes made:

  1. Maintained a pool of 25 pre-configured "hot spare" laptops — immediate swap while failed device gets repaired
  2. Implemented priority-based ticket routing (executive and production-critical users first)
  3. Deployed remote management tools for software issues
  4. Created standardized troubleshooting playbooks for the 20 most common issues

Results:

  • Hardware MTTR (from user's perspective): 18 hours → 45 minutes (hot swap)
  • Software MTTR: 6 hours → 1.5 hours
  • Monthly productivity loss: $14,000 → $3,200
  • Employee satisfaction with IT support: 3.1/5 → 4.4/5

Common Mistakes

  1. Only measuring wrench time. If you only track how long the actual repair takes, you miss 60–70% of the real downtime. Detection, response, and parts waiting are usually the bigger contributors.
  2. Not breaking MTTR into components. "Our MTTR is 6 hours" is useful but not actionable. "Our MTTR is 6 hours — 2 hours waiting for parts, 1.5 hours waiting for a technician, 1.5 hours diagnosis, 1 hour repair" tells you exactly where to improve.
  3. Averaging across all asset types. A blended MTTR that mixes servers, vehicles, and office furniture is meaningless. Calculate MTTR per asset class and per criticality level.
  4. Treating all failures equally. A 10-minute reset of a tripped breaker and a 40-hour rebuild of a gearbox are fundamentally different events. Consider tracking MTTR by failure severity.
  5. Not connecting MTTR to cost. An MTTR of 4 hours means nothing without context. 4 hours of downtime on a $50/hour asset is $200. On a $5,000/hour production line, it's $20,000. Prioritize MTTR improvements where the financial impact is highest.
  6. Improving repair speed but ignoring prevention. Faster repairs are good, but fewer repairs are better. Always pair MTTR reduction efforts with MTBF improvement initiatives.

How to Reduce MTTR

  1. Stock critical spare parts on-site. The single biggest MTTR improvement in most organizations. If the part is on the shelf, you save hours or days of waiting. Use reorder points to keep critical spares stocked automatically.
  2. Implement remote monitoring and diagnostics. IoT sensors and remote access tools let technicians diagnose problems before they arrive — sometimes even fix them remotely. This cuts diagnosis and response time dramatically.
  3. Standardize repair procedures. Document the diagnostic steps and repair procedures for common failures. When every technician follows the same proven process, you eliminate trial-and-error troubleshooting.
  4. Cross-train your maintenance team. If only one person can fix the CNC machine and they're on vacation, your MTTR includes their return flight. Ensure at least two people can handle each critical asset type.
  5. Pre-position tools and equipment. Mobile tool carts at the asset location, diagnostic equipment kept near the machines it serves, PPE ready at work stations. Small logistics improvements compound into significant time savings.
  6. Use work orders with full asset context. When a technician opens a work order and sees the asset's complete repair history, manual, parts list, and known failure modes, they can start the repair immediately instead of researching first.
  7. Implement hot-swap programs for critical assets. For items where MTTR directly impacts user productivity (laptops, phones, vehicles), maintain a pool of pre-configured spares. Swap immediately, repair in the background.

Best Practices

  1. Track MTTR by phase, not just in total. Measure detection time, response time, diagnosis time, parts wait time, repair time, and testing time separately. You can't improve what you can't see.
  2. Set MTTR targets by asset criticality. Production-critical equipment might need a 2-hour MTTR target. A spare conference room projector might be fine with 48 hours. Don't apply one standard to everything.
  3. Conduct post-repair reviews for long-MTTR incidents. Any repair that takes significantly longer than average deserves a review: what caused the delay, and how can you prevent it next time?
  4. Benchmark internally before externally. Compare MTTR between shifts, sites, and technicians within your organization. Internal variation is usually larger than you think and easier to address.
  5. Automate time tracking. Use work order systems that timestamp each phase automatically. Manual timekeeping is unreliable and will drift toward underreporting.
  6. Report MTTR trends monthly. Show the trend alongside the absolute number. An MTTR of 4 hours that's been declining for 6 months tells a different story than 4 hours that's been increasing.

Conclusion

MTTR tells you how quickly your organization recovers from equipment failures. It's not just a maintenance metric — it's a measure of operational resilience. Every hour saved in MTTR is an hour of recovered production, service, or employee productivity. The most effective way to reduce MTTR isn't necessarily to repair faster — it's to eliminate the waiting: waiting for detection, waiting for someone to respond, waiting for parts. Fix the logistics, and the repair times often take care of themselves.

Tracking MTTR with UNIO24

UNIO24 captures timestamps at every stage of the repair process — when the issue was reported, when a technician was assigned, when work began, and when the asset was returned to service. This gives you accurate MTTR calculations with full phase breakdowns. Identify which part of the repair cycle takes the longest, compare response times across teams and locations, and track your improvement over time. Every work order becomes a data point that helps you repair faster next time.