How we work out who diagnosed it
Once responders establish an incident’s cause, we look back over everything that happened before the cause was understood — channel messages, incident calls, and the investigation’s own findings — and reconstruct the diagnostic chain: the small number of steps that were genuinely key to reaching the answer. Exploration that didn’t pan out doesn’t count, and neither does anything after the diagnosis — confirmation and remediation are important work, but they aren’t diagnosis. Each step is attributed to whoever first surfaced it. Attribution comes from the evidence behind the step: the message, call moment, or investigation finding that delivered the result. If a step’s results arrived in an investigation finding, the step is the investigation’s; if they arrived in a responder’s message, it’s the responders’; a genuine collaboration counts toward both. The balance of steps places each incident in one of four bands:| Band | What it means |
|---|---|
| Autonomous | Investigations reached the correct diagnosis independent of responder input. |
| Led by Investigations | The incident was diagnosed mainly by Investigations, with a small amount of assistance from responders. |
| Led by responders | The incident was diagnosed mainly by responders, with a small amount of assistance from Investigations. |
| Manual | Responders reached the correct diagnosis independent of Investigations input. |
An example
Here’s how an example incident breaks down:The investigation isolates the failing dependency
The investigation recovers the underlying error
Diagnosed: a responder connects it to a config change
Time to diagnose
For each incident that reached a diagnosis, we measure the time from the start of the investigation to the moment the cause was first understood — the specific message or call moment where the answer landed, located after the fact from the incident’s own record. The dashboard shows the median time to diagnose for each band, so you can compare how quickly incidents reach a diagnosis depending on who drove them there. We also calculate how much faster incidents are diagnosed when Investigations drove the diagnosis: the median time to diagnose for incidents in the Autonomous and Led by Investigations bands, compared against the median for the rest (Led by responders and Manual).Where you see it
The breakdown appears on the Investigations page in your dashboard, once around ten investigations in the selected period have reached a diagnosis we can attribute. The percentages are shares of those diagnosed, attributable incidents — an incident whose cause was never established, or whose diagnostic steps couldn’t be attributed to either side, isn’t part of the split.FAQs
Why isn't every incident shown in the breakdown?
Why isn't every incident shown in the breakdown?
What if responders re-did work the investigation had already done?
What if responders re-did work the investigation had already done?
Does Autonomous mean nobody was involved?
Does Autonomous mean nobody was involved?
Does Manual mean Investigations didn't run?
Does Manual mean Investigations didn't run?
How are steps split when both sides worked together?
How are steps split when both sides worked together?
Does a faster time to diagnose for Investigations-driven incidents prove they'd speed up every incident?
Does a faster time to diagnose for Investigations-driven incidents prove they'd speed up every incident?