📊 Diagram · Template · Healthcare QI

Root Cause Analysis Diagram

The fishbone (Ishikawa) diagram is the most widely used root cause analysis diagram in healthcare QI. It organises potential causes into categories, making the invisible visible. This page shows three diagram types, how to use each, and two NHS worked examples — including what a completed diagram looks like and how to move from causes to actions.

This page: ready-to-use RCA diagrams — blank fishbone template, completed NHS examples, 5 Whys diagram, Moonshot Process.
For the concepts: Root Cause Analysis — methods, frameworks and psychological why

Related: Never Events Bootstrap CUSUM analysis  ·  Joiner levels of fix  ·  5 Whys  ·  Evaporating Cloud
What you’ll be able to do after this page
▶ Open the StepChange Analyzer

Next: Interpret resultsWhat to do next

StepChangeAnalysis.com  ·  Concepts series  ·  June 2026
☰  Contents

The fishbone (Ishikawa) diagram

The fishbone diagram — also called the Ishikawa diagram or cause-and-effect diagram — was developed by Kaoru Ishikawa at Kawasaki in the 1960s. It maps potential causes of a problem into structured categories, revealing the range of factors that could be contributing. The diagram looks like a fish skeleton: the problem (effect) is the head; the bones are categories of causes; the sub-bones are specific causes within each category.

In healthcare QI the standard six categories are People, Process, Equipment, Environment, Materials, and Management (sometimes called 6Ms: Man, Method, Machine, Milieu, Material, Measurement). The categories are starting points, not constraints — adapt them to your specific problem.


Blank fishbone template

Fishbone (Ishikawa) diagram — blank template
Blank fishbone root cause analysis diagram template with six categories Problem (effect) write here People skills · training · fatigue Process method · steps · protocol Equipment tools · devices · systems Environment culture · place · workload Materials drugs · supplies · information Management decisions · policy · oversight causes

Each bone = a category of causes. Sub-branches (dashed) = specific causes within that category. The problem (effect) goes in the head box on the right.

How to use this template
  1. Write the problem clearly in the head box — be specific. “Delayed discharge” is better than “flow problems.” “Wrong-route medication administered to patient X on ward Y” is better than “medication error.”
  2. Brainstorm causes in each category — write on the sub-branches. Include everything that might be contributing, not just the obvious ones.
  3. Ask “why?” at least twice for each cause — the first answer is usually a symptom. The second or third answer starts to reach the system.
  4. Circle the root causes — the causes that, if addressed, would prevent recurrence. These are usually in Management, Environment, or Process — not in People.
  5. Verify with data — the diagram generates hypotheses. Bootstrap CUSUM on the outcome metric after the fix confirms whether a hypothesis was correct.

NHS example 1 — wrong-route medication error (never event)

Fishbone example — wrong-route medication administration (Never Event)
Completed fishbone diagram for wrong-route medication never event Wrong-route medication administered People Locum unfamiliar with ward Tired end of shift Process No independent check required Verbal handover only Equipment Similar connector design No NRFit connector fitted Environment High pressure, busy ward Poor lighting in bay Management NRFit rollout incomplete No mandatory double-check policy causes

The root causes (RC) are in Management — incomplete NRFit rollout and absence of a mandatory double-check policy. People and Environment contributed but are not root causes: addressing them alone would not prevent recurrence.

Why “locum unfamiliar with ward” is not a root cause

The most common mistake in RCA is stopping at the person. “Locum unfamiliar with ward” describes a contributing factor but not the root cause. The root cause question is: what system feature allowed a locum unfamiliar with the ward to administer medication via the wrong route without any check catching it?

The answer is in Management: NRFit connectors — which make wrong-route administration physically impossible — had not been fully rolled out, and no mandatory independent double-check policy existed. Address the system, and it doesn’t matter whether the nurse is familiar with the ward or not.

See Never Events: Wrong-Route Medication for the full Bootstrap CUSUM analysis of whether NHS Never Event programmes have produced structural change.


NHS example 2 — delayed discharge

Fishbone example — patient discharge delayed beyond clinical readiness
Completed fishbone diagram for delayed hospital discharge Discharge delayed beyond clinical date People Family not involved in planning No named discharge coordinator Process No EDD set at admission Ward round after midday System boundary Social care package not arranged No Transfer of Care Hub Environment Beds needed, pressure to fill Red/Green system not in use Management No criteria-led discharge policy Discharge not owned at exec level causes

Two constraint types visible: internal (Process, Management) and external (System boundary — social care). Addressing only internal causes will improve flow but not eliminate the external constraint. See Gloucestershire Bright Spot for how both were addressed simultaneously.


The 5 Whys diagram

The 5 Whys is a linear cause chain: you ask “why?” repeatedly until you reach the root cause. It is simpler than the fishbone and works well for straightforward single-cause problems. It works less well for complex multi-cause problems where several independent chains lead to the same outcome — a fishbone handles these better.

5 Whys diagram — wrong-route medication example
5 Whys diagram showing chain from problem to root cause Problem Wrong-route medication Why? Spinal and IV connectors look the same Why? NRFit connectors not fitted on this ward Why? Rollout programme not completed trust-wide Why? Root cause No executive accountable for NRFit completion

The 5 Whys traces one causal chain. The root cause is a management system failure (no executive accountable), not a human error.


Fault tree diagram

The fault tree diagram works top-down: starting from the failure event, it maps the combinations of conditions that could produce it using AND/OR logic gates. It is most useful for safety-critical systems where you need to identify all possible pathways to a failure, not just the most likely one. In healthcare, it is used in FMEA (Failure Mode and Effects Analysis) and safety case development.

For most QI improvement work, the fishbone or 5 Whys is more practical. Use fault tree analysis when you need to be exhaustive about failure pathways rather than when you need to identify the single most important cause.


Which diagram to use when

Situation Best diagram Why
Complex problem with multiple possible causes (most QI problems) Fishbone Organises brainstorming across categories; prevents fixating on one cause
Single clear cause chain to trace 5 Whys Faster; drives deeper on one chain; good for simpler problems or initial scoping
Safety-critical system; need to be exhaustive Fault tree Maps all failure pathways; AND/OR logic; used in FMEA
Want to show causal loops and feedback (systemic problems) Causal Loop Diagram Shows reinforcing loops; better for recurring system problems. See Causal Loop Diagrams
Contradictions keeping the problem in place Evaporating Cloud Reveals the assumption sustaining the conflict. See Evaporating Cloud

Moving from diagram to system cause

The most important rule in any RCA diagram: keep asking why until you reach a system cause, not a person cause. Most diagrams stop too early — at the action of an individual rather than at the system feature that allowed or produced that action.

A system cause is one that:

People causes (tiredness, distraction, unfamiliarity) describe human vulnerability to a system design failure. They are real contributing factors. They are not root causes — because addressing them without changing the system means the next person in the same situation faces the same risk.


From cause map to structural fix — the Moonshot Process

Drawing the fishbone diagram is Step 2 of improvement — seeing the structure. But most QI programmes stop there: the diagram is produced, the causes are listed, an action plan is written, the same event recurs. The reason is that RCA rarely reaches the structural assumption that is keeping the problem in place.

The Moonshot Process takes the fishbone forward into structural change. The name matters: calling it “Super RCA” makes people hear audit, blame, and paperwork. Calling it the Moonshot Process makes people hear future, growth, and strategy. The process is identical — the framing shifts the psychological weight from looking backward to moving forward.

It uses the same language QI practitioners already know — 5 Whys, Joiner levels, common and special cause variation — and adds one step most improvement processes miss: explicitly identifying and dissolving the contradiction that makes the structural fix seem impossible.

When to trigger a full structural RCA

Not every event requires a full structural investigation. Deming warned that confusing common and special cause is itself the root cause of most failed management. The rule of three provides a practical trigger:

Why three? A special cause happens once. When it happens three times, it is no longer an accident — it is a feature of the system design. You now have enough data to show management that this is a structural failure, not bad luck.

The three-phase process

The structural RCA process — three phases, ten steps
Three-phase structural RCA process: Phase 1 Diagnostic (5 steps), Phase 2 Resolution Contradiction Engine (4 steps), Phase 3 The Pivot (Moonshot)

Phase 1 finds the structural cause. Phase 2 dissolves the contradiction keeping it in place. Phase 3 reclaims the capacity that was being consumed by the problem and redirects it toward the improvement goal.

Phase Step What you do QI tool
Phase 1
The Diagnostic
1 Define the unacceptable event in one data-backed sentence (e.g. “17.5 wrong-route medication errors per year for a decade”) Bootstrap CUSUM to confirm the rate is stable — common cause, not improving
2 Map patterns — list recurring symptoms, run 5 Whys to find the immediate mechanical trigger 5 Whys, fishbone diagram
3 Audit the structure — look past the trigger to the rules, workflows, and physical environments that allow it to happen Process mapping, Going to the Gemba
4 Triage by Joiner level — classify all potential fixes. Levels 1–2 are containment. Levels 3–4 are structural design. The higher the level, the more effective — and the more politically difficult. Joiner levels of fix
5 Expose the hidden belief — identify the assumption that justifies the current structure (e.g. “safety is achieved through nurse vigilance”) Pre-mortem, constraint interview
Phase 2
The Resolution
6 Identify the contradiction — express the trade-off that makes the structural fix seem impossible Evaporating Cloud
7 Dissolve via TRIZ separation — do not compromise. Redesign the system to separate the conflicting goals in space, time, condition, or system level TRIZ separation principles
8 Verify for evolution — ensure the solution allows future improvement. If it locks the system down so much that improvement becomes impossible, iterate. PDSA cycle
9 Sunset the backdoor — identify the old dangerous path and explicitly migrate away from it. The sunset is not a wall; it is a bridge to a safer architecture. Change management, policy
Phase 3
The Pivot
10 Reclaim and redeploy — the capacity that was being consumed by the recurring problem is now available. Redirect it toward the improvement goal. Bootstrap CUSUM pre-committed prediction to confirm the change point

Worked example — wrong-route medication Never Events

17.5 wrong-route medication errors per year for a decade. Classified as “wholly preventable.” Yet they kept recurring. The reason: the process stopped at the human (the nurse who made the error) rather than the system (the connector design that made the error possible). Here is what the full structural RCA looks like:

Wrong-route medication — event to structural fix
Worked example: wrong-route medication Never Event traced from the event through pattern, structure, mindset to Joiner level fixes including ENFit connector mandate

The left chain shows what the 5 Whys reveals. The right chain shows the Joiner level of fix appropriate at each level. The structural fix (Level 3: ENFit connector mandate) makes the error physically impossible — without relying on human vigilance.

Why “retrain the nurse” is not a root cause fix

The most common RCA failure is stopping at the person. Retraining addresses a Level 1 symptom. The structural question is: what system feature allowed a trained nurse to make this error without any physical barrier preventing it?

The answer is in the equipment design: oral medication syringes (Luer connectors) were physically compatible with IV lines. The connector design made the error possible. No amount of retraining changes the physics. The Level 3 fix — ENFit non-interchangeable connectors — makes wrong-route administration physically impossible. The fix is in the design, not the person.

The test: “If I am still relying on human willpower or vigilance to avoid the error, have I actually solved it?” If the answer is yes, you are still in the danger zone. Iterate on the design.

The Iceberg Why questions — what to ask at each level

The “5” in 5 Whys is a rule of thumb, not a formula. The real question is: have you reached the Mindset level? Here are the questions to ask at each level of the Iceberg.

Why 1 — Questions to uncover the Pattern

You are proving to leadership that this is not an isolated accident. You are looking for history, frequency, and correlation.

Whys 2 and 3 — Questions to uncover the Structure

You are looking for the physical, digital, and procedural rules that made the error possible. You are actively taking the blame off the human.

Whys 4 and 5 — Questions to uncover the Mindset

The hardest level to reach because people are afraid to talk about it. You know you have reached it when answers shift from tools and processes to incentives, budgets, and fears.

Applied to the Never Events example:

Identifying and dissolving the contradiction

The Never Event contradiction — and three ways to dissolve it
The Never Event contradiction: versatility vs route-exclusivity, dissolved by TRIZ separation in Space (ENFit), Condition (colour coding) and System (remove Luer supply chain)

Structure: universal Luer connectors. Mindset: “vigilance is our safety.” Conflict: versatility vs route-exclusivity. Three TRIZ separations dissolve it without compromise.

Two key principles to prevent recurrence

Two key principles for structural safety
Two key principles: 1. Fix systems not people - blameless culture. 2. Defence in depth - limit the blast radius. Human error is a symptom never a root cause.

Principle 1: Human error is a symptom, never a root cause. Design environments where the wrong choice is hard and the right choice is easy. Principle 2: All humans and processes will eventually fail — aim for safe failure, not perfect prevention.

The sunset and migration — what it looks like in practice

From the legacy path to the platform path
Legacy path: manual decanting and universal Luer syringes. Sunset and migrate to standardised path: ready-to-administer oral packages and ENFit connection making error physically impossible. The Moonshot: zero-harm delivery plus reclaimed staff time.

The sunset is not a wall — it is a bridge. The old dangerous path is explicitly replaced with a superior safe path. The capacity previously spent managing the risk is reclaimed and redeployed.

Closing the loop with Bootstrap CUSUM

The three-phase process produces a structural change. Bootstrap CUSUM applied to the outcome metric — wrong-route medication events per month — confirms whether the change point appeared after the ENFit rollout, and whether it is sustained.

The pre-committed prediction: “Wrong-route medication events will fall to zero within 12 months of full ENFit connector rollout across all wards.” If the Bootstrap CUSUM flat line persists after rollout, either the rollout is incomplete, or the connector was not the binding constraint. Both are valuable learning.

See Never Events: Wrong-Route Medication for the Bootstrap CUSUM analysis of NHS Never Event data.


Verifying with data — closing the loop

The fishbone diagram generates hypotheses about what caused the problem. It does not prove which cause was the binding one. Only data can do that.

After implementing the fix, Bootstrap CUSUM applied to the outcome metric confirms whether a structural change point occurred. If the fix addressed the right root cause, the metric changes. If it was a flat line, either the wrong cause was identified, the fix was insufficient, or the data period is too short.

This is the Step 4 of the five-step improvement framework: test honestly. The diagram is Step 2 (see the structure) and Step 3 (challenge the assumption). Bootstrap CUSUM closes the loop.

▶ Open the StepChange Analyzer

Next: Interpret resultsWhat to do next