🌿 Environment & Policy · London Air Quality Analysis

ULEZ Worked — But Not When Anyone Expected

In October 2018, eighteen months before the Ultra Low Emission Zone officially launched, Marylebone Road NO2 dropped 24.7% at 99.9% confidence. Rational drivers were already complying with a policy that hadn’t started yet. Bootstrap CUSUM on a decade of measured data finds three things neither side of the ULEZ debate predicted — and one finding that should change how every city in the world designs and evaluates clean air policy.

By Syd Stewart  ·  StepChangeAnalysis.com  ·  May 2026  ·  Data: Defra AURN, Marylebone Road (MY1), Monthly Mean NO2, 2015–2026
Method: Bootstrap CUSUM  ·  Open the StepChange Analyzer
📋 Article Summary  ·  ⇣ Download Executive Summary PDF  ·  ⇣ Download the data CSV
The anticipatory effect
The ULEZ signal appeared 18 months before launch. Bootstrap CUSUM dates the first structural change to October 2018 — a 24.7% drop at 99.9% confidence — driven by drivers and fleet operators upgrading vehicles in advance of the April 2019 launch. Evaluating ULEZ by comparing before and after April 2019 misses the effect entirely.
COVID was the biggest change
The largest single structural improvement in a decade of Marylebone Road data was COVID: −34.6% at 100% confidence from February 2020. The benefit was real. The cost — hundreds of thousands of deaths and economic damage on a wartime scale — was not in the data.
The London-wide expansion worked
The August 2023 London-wide expansion produced a detectable structural change point in March 2024 — seven months later, within the expected policy lag. At 100% confidence, NO2 fell from 43.00 to 31.78 µg/m³ — the first time Marylebone Road’s stage mean has been below the 40 µg/m³ legal limit.
Legal limit met; health standard not
31.78 µg/m³ is below the UK legal limit of 40 µg/m³ — a milestone for a site in breach of legal standards for decades. But the WHO revised its guideline to 10 µg/m³ in 2021. The Stage 4 mean is still more than three times the health standard.
Method — Bootstrap CUSUM: Model-free step-change detection applied directly to measured data, no counterfactual assumptions. A structural change is confirmed when the cumulative sum of deviations from the mean crosses the bootstrap confidence threshold — stating precisely when the process changed, by how much, and at what confidence.  ·  Data: Defra AURN, Marylebone Road (MY1), monthly mean NO2  ·  N=134 monthly observations  ·  Conf=99.7%  ·  Loops=1000  ·  Data file: ulez-no2-monthly.csv
📊 New to Bootstrap CUSUM?

Same Data, Three Charts, Three Very Different Stories explains what the green CUSUM line means and why it detects structural change that other charts miss — including a step-by-step guide to reading the chart. Takes 5 minutes and makes every chart in this article easier to read.

Read above first   📚 Glossary — CUSUM, Deming, Meadows, Joiner, PDSA and more
☰  Table of Contents — click to expand or collapse

The most contested policy in recent London history

The ULEZ debate has been conducted almost entirely without reference to what the measured data actually shows. Supporters cite modelled estimates of pollution reduction. Critics cite modelled estimates of economic harm. Both sides argue from assumptions about what would have happened without the scheme. Neither side has asked the most important question: when did London’s roadside NO2 actually, structurally change — and is that date consistent with the policy causing it? Bootstrap CUSUM answers that question directly from a decade of measured data. The answer is not what either side expected.

Both sides produce data. Both sides produce models. Neither side has asked the most important question in a rigorous way: when did London’s roadside air quality actually, structurally change — and is that date consistent with ULEZ causing it?

Bootstrap CUSUM is designed to answer exactly that question. It applies a model-free, assumption-free statistical algorithm directly to the measured NO2 data and asks: when did the process structurally change? The answer comes with a confidence level and a precise date. No counterfactual modelling. No assumptions about what would have happened. Just the measured data and a statistical verdict.

🕑 Key dates: policy and CUSUM change points

Oct 2017 T-charge launched. £10 daily charge for the most polluting vehicles in central London. First significant vehicle emission charge.
Oct 2018 CUSUM Stage 2 change point. NO2 drops -24.7% at 99.9% confidence. Anticipatory ULEZ compliance — drivers upgrading vehicles 18 months before the April 2019 launch.
Apr 2019 Central London ULEZ launched. Euro 4 petrol / Euro 6 diesel standard or £12.50 per day. The CUSUM change point predates this by 18 months.
Feb 2020 CUSUM Stage 3 change point. COVID begins reducing traffic. NO2 drops -34.6% at 100% confidence — the largest single structural change in the dataset.
Oct 2021 Inner London ULEZ expansion. Zone extended to North and South Circular roads. No additional CUSUM change point detected at this date.
Aug 2023 London-wide ULEZ expansion. All 33 boroughs. The most politically contested phase.
Mar 2024 CUSUM Stage 4 change point. NO2 drops -26.1% at 100% confidence. Seven months after the London-wide expansion — within the expected policy lag window.

The data: Marylebone Road, 2015–2026

Marylebone Road is one of the most heavily monitored air quality sites in the UK. It sits in the heart of the original central London ULEZ zone, carries some of the highest traffic volumes in London, and has been continuously monitored by the Defra Automatic Urban and Rural Network since the 1990s. It is the site most directly affected by the April 2019 ULEZ launch — and the site where any ULEZ signal should be most clearly visible.

Monthly mean NO2 readings from April 2015 to May 2026 give 134 observations. Settings: 99.7% confidence, Turn Length 5, Loops 1000. Blank days (monitoring outages) were removed before aggregating to monthly means.

📊 Reproducing this analysis: Data downloaded from the Defra UK AIR Data Selector. Go to uk-air.defra.gov.uk/data/data_selector_service, select Search Hourly Networks, site: London Marylebone Road (MY1), pollutant: NO2, statistics: daily mean, 2015–2026. Aggregate to monthly means in Excel, removing blank days. Upload to StepChangeAnalysis.com with Y = NO2 column.

📊 A note on missing readings: Automatic monitoring stations occasionally go offline for maintenance, calibration, or technical failure, leaving days with no recorded value. Before calculating monthly means, the daily data was filtered in Excel to exclude all blank days — only days with valid NO2 readings were included in the pivot table that calculated each monthly average. This means each monthly mean reflects only verified measurements, with no zeros or blanks distorting the average. The CUSUM result is robust: the four stages and their confidence levels are consistent across the dataset.

Bootstrap CUSUM — Marylebone Road NO2 2015-2026 — 4 structural stages
Marylebone Road monthly mean NO2 (µg/m³), April 2015–May 2026. Bootstrap CUSUM, 99.7% confidence, N=134 monthly observations, Loops=1000. Four structural stages. Stage 2 (Oct 2018): pre-ULEZ compliance. Stage 3 (Feb 2020): COVID. Stage 4 (Mar 2024): London-wide ULEZ effect.

Four stages: what the CUSUM actually shows

StagePeriodMean NO2ChangeConfInterpretation
1 Apr 2015–Oct 2018 87.21 µg/m³ Baseline Baseline Pre-policy baseline. More than double the 40 µg/m³ legal limit. The T-charge launched Oct 2017 but its effect does not yet appear as a structural change.
2 Oct 2018–Feb 2020 65.71 µg/m³ −24.7% 99.9% Anticipatory ULEZ compliance. The change point is 18 months before the April 2019 ULEZ launch. Rational actors responded to the announced future policy by upgrading vehicles in advance. This is the policy lag in reverse — the signal appearing before the official start date.
3 Feb 2020–Mar 2024 43.00 µg/m³ −34.6% 100% COVID structural break. The largest single change in the dataset. Traffic collapsed; NO2 followed. Still above the 40 µg/m³ legal limit. The inner London expansion (Oct 2021) falls within this stage — no additional change point detected.
4 Mar 2024–present 31.78 µg/m³ −26.1% 100% London-wide ULEZ expansion effect. Change point March 2024 — seven months after the August 2023 London-wide expansion. Within the expected 6–12 month policy lag. First time Marylebone Road mean falls below the 40 µg/m³ legal limit. Still above the WHO guideline of 10 µg/m³.
Stage summary table — Marylebone Road NO2 4 stages
Stage summary: four structural stages with precise dates, means, confidence levels and percentage changes. Two stages are negative (deterioration prevented or noise). Two are positive structural improvements attributable to policy and COVID respectively.

The three findings that matter

Finding 1: ULEZ produced a detectable effect — but it appeared 18 months before launch. The Stage 2 change point is October 2018, not April 2019. This is the anticipatory compliance effect — drivers and fleet operators responding to an announced future policy by upgrading vehicles in advance. This is rational behaviour and a genuine ULEZ signal. It also means that evaluating ULEZ solely by comparing before and after April 2019 misses the effect entirely.
Finding 2: COVID was the largest single structural change — by a significant margin. The Stage 3 change point (February 2020, -34.6%, 100% confidence) dwarfs every other structural change in the dataset. It is worth pausing on what that represents. The largest single improvement in Marylebone Road air quality in a decade was a byproduct of a global catastrophe that killed hundreds of thousands in the UK alone and caused economic damage on a scale not seen since the Second World War. The data shows the benefit. The cost was incalculable.
Finding 3: The London-wide ULEZ expansion (Aug 2023) produced a genuine detectable structural change. Stage 4 change point is March 2024 — seven months after the London-wide expansion. This is within the 6–12 month policy lag we would expect. At 100% confidence, this is not a marginal finding. The mean fell from 43.00 to 31.78 µg/m³ — the first time Marylebone Road has been below the 40 µg/m³ legal limit as a stage mean. The London-wide expansion worked, at this site, within the expected timeframe.

📚 How does this compare with peer-reviewed research?

A study from the University of Birmingham (Tong et al., npj Clean Air, October 2025) found no significant city-wide impact on NO2 or NOx following the 2023 London-wide ULEZ expansion. The authors attribute this to the high compliance rate already achieved by that point — by the time the expansion launched, only 7.4% of vehicles across London were non-compliant, falling to 4.2% within three months. With so few non-compliant vehicles remaining, there was little room for further aggregate improvement.

This finding is not inconsistent with the Marylebone Stage 4 result. The Birmingham study measured city-wide NO2 averages across the whole of Greater London. This analysis measures a single specific site — Marylebone Road — which carries substantial through-traffic from outer London boroughs that were outside the zone until August 2023. City-wide averaging and single-site Bootstrap CUSUM are measuring different things. The Birmingham study confirms the 2023 expansion found little room for aggregate improvement across London as a whole. The Marylebone data suggests site-specific gains are still detectable where through-traffic composition changed materially.

The two approaches are complementary. Bootstrap CUSUM at individual monitoring sites can detect localised signals that city-wide averaging smooths away. Both findings together tell a more complete story: the 2023 expansion made little difference to aggregate London air quality because compliance was already high, but it produced measurable improvements at specific high-traffic arterial sites where outer London through-traffic had previously been exempt.

How did a London-wide expansion reduce NO2 at a central London site already in the zone?

This is the right question to ask. Marylebone Road has been inside the original ULEZ zone since April 2019. By August 2023, virtually every vehicle passing through it was already compliant. So how did expanding the zone to outer London reduce NO2 at Marylebone four years later?

The answer is traffic composition and origin. Marylebone Road is part of the A501 inner ring road — a major arterial route used by vehicles travelling across London from all directions. Before August 2023, a driver from Croydon, Bromley, or Enfield could drive a non-compliant vehicle right up to and through central London. After August 2023, every vehicle entering London from any direction had to be compliant or pay the charge. Four specific mechanisms explain the Stage 4 improvement:

📈 What Bootstrap CUSUM adds that counterfactual modelling cannot

The GLA and TfL use counterfactual modelling to estimate ULEZ’s contribution — comparing measured NO2 with modelled estimates of what NO2 would have been without the scheme. These are carefully produced estimates, but they are estimates of a hypothetical.

Bootstrap CUSUM does something different: it interrogates the measured data directly, with no model of what might have happened. It asks only: did the process structurally change, when, and how confident are we? The Marylebone analysis finds three structural changes, with high confidence, at dates consistent with the policy timeline. The two approaches are complementary — counterfactual modelling estimates the size of the ULEZ contribution, Bootstrap CUSUM confirms that structural changes occurred and dates them precisely.


Why did the improvement hold after COVID?

Stage 3 runs from February 2020 to March 2024 — a period of four years during which London traffic broadly recovered to pre-pandemic levels. NO2 did not recover. The stage mean of 43.00 µg/m³ is structurally lower than the Stage 2 mean of 65.71 µg/m³, even as traffic returned. Several factors combined to maintain the improvement:


What does the NO2 reduction mean for health?

NO2 is not merely a regulatory metric. It is a respiratory irritant that inflames the airways, reduces lung function, and increases susceptibility to respiratory infections. Long-term exposure is associated with increased risk of cardiovascular disease, asthma, and premature death. In Greater London in 2019, between 3,600 and 4,100 deaths per year were estimated to be associated with exposure to fine particulate matter and nitrogen dioxide combined.

The UK legal annual mean limit of 40 µg/m³ was set in 2000, based on World Health Organisation guidelines at the time. Stage 1 mean at Marylebone Road was 87.21 µg/m³ — more than double the legal limit. Stage 4 mean is 31.78 µg/m³ — the first stage to fall below the legal limit. That is a significant milestone for a site that has been in breach of legal air quality standards for decades.

However the WHO revised its guidelines in 2021, tightening the annual mean NO2 guideline from 40 µg/m³ to just 10 µg/m³ — reflecting accumulated evidence that health effects occur at much lower concentrations than previously thought. The Stage 4 mean of 31.78 µg/m³ is still more than three times the current WHO guideline. The legal limit has been met. The health standard has not.

🌟 How was the legal limit set?

The UK legal annual mean limit of 40 µg/m³ derives from the EU Air Quality Directive (1999/30/EC), which was based on WHO guidelines as they stood in the late 1990s. The limit was adopted into UK law and has not been updated since, despite the WHO revising its guideline downward to 10 µg/m³ in 2021. This means the UK legal limit is now four times higher than the level the WHO considers safe based on current evidence. Meeting the legal standard — as Marylebone Road’s Stage 4 mean now does — is a genuine achievement. It is not the same as meeting the health standard.


What this means for other cities

London is the most cited example of a large-scale urban ULEZ. Cities around the world — from Paris to Seoul to New York — are considering similar schemes. The Marylebone Bootstrap CUSUM analysis has several specific implications for how those cities should design and evaluate their programmes.

First, the anticipatory compliance effect is real and substantial. A 24.7% improvement appeared 18 months before the scheme launched. Cities should begin monitoring immediately on announcement — the pre-launch period contains a genuine policy signal that post-launch analysis will miss if the baseline is set at the launch date.

Second, the policy lag for city-wide schemes is 6–12 months. The London-wide expansion of August 2023 produced a detectable change point in March 2024 — seven months later. This lag is not a failure of the scheme; it is the time required for the vehicle fleet to turn over and for the new compliance standard to work through normal driving patterns. Bootstrap CUSUM provides an objective way to measure this lag precisely: rather than declaring a policy successful or failed at launch, you set a minimum observation period of 12 months and let the CUSUM tell you when — and if — a structural change has occurred.

Third, evaluate at the monitoring site level, not just at the city-wide average level. Bootstrap CUSUM applied to monitoring sites within the affected zone gives a more sensitive and accurate assessment of policy impact than averaging across the whole city.


Three charts, three perspectives on the same data

The Bootstrap CUSUM step-change analysis is the primary tool in this article — but the same Marylebone Road dataset produces three different and complementary perspectives when viewed through different chart types. This mirrors the theme explored in the Three Charts article elsewhere on this site.

The Run Chart: confirming the shift below the median

Marylebone Road NO2 Run Chart 2015-2026
Run Chart showing sustained shifts above and below the overall median (48.5 µg/m³). Red dots indicate 6 or more consecutive months above the median — the pre-2020 baseline. Blue dots indicate 6 or more consecutive months below the median — the post-COVID sustained improvement. The run chart confirms the Bootstrap CUSUM finding visually without any statistical modelling.

The run chart is the simplest of the three. It plots each monthly reading against the overall median and flags runs of 6 or more consecutive points on the same side — a signal that the process has shifted. The sustained red run pre-2020 and the sustained blue run post-2020 visually confirm what the Bootstrap CUSUM found statistically: the process shifted structurally around 2020 and has not returned.

The X-mR Control Chart: why a single process mean is misleading

Marylebone Road NO2 X-mR Control Chart 2015-2026
Shewhart X-mR control chart. Control limits (UNPL 73.9, LNPL 40.8 µg/m³) are calculated from the overall process mean across all 134 observations. The upper and lower charts show individual values and moving ranges respectively. Most pre-2020 readings breach the upper natural process limit — confirming those observations are not common cause variation around the overall mean but a structurally different process.

The X-mR chart applies a single set of control limits calculated from the overall mean of all 134 observations. The result reveals why a single process mean is inappropriate for this data: nearly all pre-2020 readings breach the upper natural process limit (73.9 µg/m³), and many post-2020 readings approach the lower limit (40.8 µg/m³). This is not a process in statistical control — it is a process that has changed fundamentally. The X-mR chart signals that something is wrong with treating this as a single stable process. The Bootstrap CUSUM answers the follow-up question: when did it change, by how much, and how confident are we?

The Bootstrap CUSUM Step Change Analysis: dating the changes precisely

Bootstrap CUSUM — Marylebone Road NO2 2015-2026 — 4 structural stages
Bootstrap CUSUM step-change analysis. Four structural stages at 99.7% confidence with 3SD limits shown. The stage boundaries are dated precisely and each carries a confidence level. This is the only chart of the three that can answer: when exactly did the process change, is that change statistically distinguishable from normal variation, and how large was it?

The three charts answer three different questions. The run chart asks: has the process shifted above or below its median? The X-mR chart asks: are individual observations within the expected range of a stable process? The Bootstrap CUSUM asks: when did the process structurally change, by how much, and with what confidence? For policy evaluation — where the question is whether a specific intervention at a specific date produced a genuine structural change — Bootstrap CUSUM is the right tool. The other two charts provide confirmation and context.


Try it yourself: the data is public, the tool is free

The Defra UK AIR Data Selector contains daily NO2 data for every automatic monitoring site in England going back decades. Marylebone Road (site code MY1) is one of the most complete and longest-running series. Downloading the daily data, aggregating to monthly means, removing blank monitoring days, and uploading to StepChangeAnalysis.com reproduces this analysis exactly in approximately twenty minutes.

FileDescriptionRowsDownload
ulez-no2-monthly.csv Marylebone Road monthly mean NO2 (µg/m³), Apr 2015–May 2026, blank days removed 134 ⇣ Download

📊 Data note: Marylebone Road (site MY1), Defra AURN Automatic London Network, daily mean NO2 aggregated to monthly means, April 2015–May 2026, N=134. Blank monitoring days removed before aggregation. Source: uk-air.defra.gov.uk/data/data_selector_service. WHO annual mean guideline: 10 µg/m³. UK legal limit: 40 µg/m³.

Reproduce this analysis on your own data

Upload any environmental or policy time series as a CSV and apply Bootstrap CUSUM step-change analysis. Free, browser-based, no data leaves your computer.

📊 Open the Free Tool
📈 Part of the StepChange improvement concepts library

This analysis sits within a broader framework for understanding why improvement programmes succeed or fail. Start with Why Nothing Changes for the full picture, or go to Start Here for a guided introduction to the method.