Number of Releases - the context metric
Why the number of releases is the common denominator for every QA metric. How to normalize, the link to Deployment Frequency (DORA), and why 3 bugs across 2 releases is a crisis while 3 across 15 is a win. Article 6 of 9.
Series: QA Leadership · Article 6 of 9
Picture two teams. Both report: "This quarter we had 3 bugs in production." One deserves a bonus for it, the other should stop shipping immediately and run a deep retrospective. Where is the difference? In the number of releases - the one metric neither of them put in the report.
The same absolute number can describe two completely different realities. That is exactly why Number of Releases is not a metric you simply present - it is the foundation through which you judge every other data point.
It is the most overlooked of the five metrics in this series, because on the surface it seems trivial (“Just count how many releases we shipped”). Its role, however, is critical. Without it, indicators like DDR, Escaped Bugs or Issues per Release are merely dry numbers, stripped of any real scale.
The metric that does not shine on its own - but lights up the rest
Picture the other four metrics in the series as satellites. Each one orbits a single reference point - the number of releases. Without that center, each one drifts without context.
= escaped per release
= maturity per release
to detect / escape
= prediction stability
How to normalize every metric in the series
Normalization is simply dividing the absolute number by the number of releases. But the effect is transformative - it turns a number that swings with your pace of work into a quality indicator independent of that pace.
| Metric | Absolute | Normalized | What you gain |
|---|---|---|---|
| Escaped Bugs | 12 / quarter | ÷ releases | Comparability across quarters with a different cadence |
| Issues found | 96 / quarter | ÷ releases | Code-maturity trend independent of deployment count |
| QA time | 320h / quarter | ÷ releases | Cost of quality per release - an argument for budget |
| Hotfixes | 8 / quarter | ÷ releases | Stability of the release process, not a raw failure count |
A concrete example - the same team, two quarters
Watch how normalization completely flips the conclusion. Without it, Q4 looks worse than Q3. With it - you see a clear improvement.
Reading the absolute numbers: “The number of production bugs rose from 7 to 12 - our quality is dropping.” Reading them after normalization: “Bugs per release fell from 1.4 to 0.8. We doubled our delivery pace and the quality of our software clearly improved.” The second conclusion is the true one. The first is a business trap.
Number of Releases vs Deployment Frequency
The number of releases is a very close cousin of the industry’s most famous delivery-pace metric - Deployment Frequency from the DORA research. It is the first of the four key DevOps metrics and a direct indicator of how often your organization actually delivers value to users.
Keep in mind that the latest DORA report moved away from the classic four-tier split in favor of seven archetypes, but the data in the classic layout still makes an excellent reference point. Only about 16% of teams can deploy changes “on demand”, while 24% do so less than once a month. The maturity gap is enormous.
Why does this matter for QA? Because delivery pace and quality are not opposites - that is one of the most important findings from years of DORA research. The fastest teams are also the most stable. More frequent, smaller releases mean a smaller blast radius for every change, easier diagnosis and faster rollback. The number of releases is not just the denominator for your metrics - it is a signal of how mature the whole process is.
Deployment ≠ Release - and why it changes the counting
Before you start collecting data, you have to be clear: are you counting deployments or releases? They are not the same thing, and it trips up even experienced teams - especially those working with feature flags.
For this series’ QA metrics, we count what reaches the user, that is releases in the business sense. An escaped bug is a problem the customer felt - so the denominator has to be the number of moments at which anything could have reached the customer. If your team separates deployment from release with feature flags, decide clearly: is an escaped bug counted from the moment the code is deployed, or from the moment the flag is turned on? Consistency in this definition is critical - just as with escaped bugs in article 3.
Normalization calculator
Enter the absolute numbers and the number of releases - the calculator will show the normalized values with a verdict for each metric.
How to start counting - and do it right
It is the easiest metric in the whole series to collect - but it has a few definitional traps worth settling from the start.
Three pitfalls when using the number of releases
The number of releases in a conversation with the business
Why this “trivial” metric is the foundation
- A common denominator for all the other metrics in the series
- Protection against wrong conclusions drawn from absolute numbers
- Comparability across quarters with a different cadence
- A bridge to Deployment Frequency - the language of DORA and the boardroom
- Proof that pace and quality can rise at the same time
- A goal in itself - more does not always mean better
- A measure of release size (the count alone ignores scale)
- A tool for ranking different teams
- Sufficient on its own - it only shines together with the other metrics
In the next article
Five metrics behind us. Article seven ties them all into a single decision indicator - the Release Confidence Score. Three calculation models, from a simple traffic light to a weighted model, a step-by-step rollout and examples from practice. This is the moment the whole series starts working as a system - a single number that answers the business’s most important question: can we release?
- 01The complete guide readDiagnosis, three pillars, five metrics, the QA → KPI mapping model
- 02Formula, thresholds, historical data, seasonality, pitfalls
- 03Taxonomy, data collection, the cost of each type, how to report
- 04Issues per Release readRollout from scratch, the link to the development process, the EM conversation
- 05Pinpointing problems, not just watching trends
- 06Number of Releases you are hereThe context metric, normalization, the link to Deployment Frequency
- 07Release Confidence Score step by stepThree calculation models, rollout, examples from practice
- 08Storytelling with metrics - building a narrativeHow to turn a table of numbers into a business argument
- 093 anti-patterns that destroy QA credibilityToo many metrics, no context, jargon - and how to avoid each