How we measure engagement
We give every investigation a single engagement score, computed once its incident closes. It’s deterministic — there’s no separate AI judgment involved. We tally the interactions we already record between responders and the investigation, weight them by how much they signal real use, and normalize for the number of responders.What we count
Not every interaction says the same thing. Skimming a link is weaker evidence than merging the code change an investigation proposed. So we group signals into three strengths, from strongest to weakest.| Engagement signal | What it means | Examples |
|---|---|---|
| Strong | A responder acted on it | Acted on a heads-up, merged a code change, merged a duplicate incident it flagged, steered the investigation, held a sustained conversation with @incident |
| Medium | A responder worked with it | Responded to a heads-up, rated a message useful, asked @incident a question |
| Weak | A responder took it in | Clicked through to something it surfaced, rated a message somewhat useful |
- Strong signals count for the most. These are the moments where an investigation changed what a responder did — the clearest evidence it earned its place. A conversation with
@incidentcounts for more the longer it runs: a sustained exchange weighs more than a single question. - We only count real engagement. A heads-up that was ignored, or a message nobody reacted to, contributes nothing.
Normalizing for incident size
A raw tally would make every large incident look more engaged than every small one, simply because more people were in the channel. That’s not what we want to measure. So we scale the score by the number of responders on the incident — but gently, so it grows slower than headcount. A small, tightly-focused incident where two responders both acted on the investigation can score as high as a large one where a handful of a much bigger crowd did. This keeps engagement comparable across incidents of very different sizes, so a trend in the score reflects how responders are using investigations rather than how big their incidents happened to be.Engaging responders
Alongside the score, we track how many of an incident’s responders engaged — the distinct people behind an attributable signal, as a share of everyone who responded. This separates two very different situations that can produce the same score: one responder leaning on the investigation heavily, versus the whole team each using it a little. Broad engagement across a team is a stronger sign that investigations have become part of how people respond, rather than something one person relies on.Seeing engagement in your dashboard
You can view engagement metrics for your account on the investigations homepage in your dashboard, alongside accuracy. To dig into specific incidents, use the investigations list there to find investigations with particularly high or low engagement — a quick way to see what’s landing with your responders and what isn’t.How we use engagement scores
Every closed incident’s investigation is scored this way, so we have a continuous picture of how investigations are being used across your account rather than a one-off sample. We use these scores in a few ways:- Spotting where value is or isn’t landing. Accuracy tells us whether investigations are right; engagement tells us whether that’s translating into use. High accuracy with low engagement points at a delivery problem — the right answer surfaced in a way that didn’t land — rather than a reasoning one.
- Driving improvements. Engagement points at how investigations show up for responders — the timing and clarity of heads-up messages, how findings are surfaced in the channel, and how actionable their suggestions are.
- Specific to your teams. Because it’s built from your responders’ real interactions, engagement reflects how your teams work with investigations, not a generic average.
FAQs
How is engagement different from accuracy?
How is engagement different from accuracy?
Accuracy measures whether an investigation was right, graded against the cause your responders established.
Engagement measures whether responders used it. An investigation can be accurate but ignored, or partly right but
genuinely useful — the two are measured separately and both matter. See Measuring
accuracy.
Does engagement use AI to judge interactions?
Does engagement use AI to judge interactions?
No. The engagement score is deterministic — it’s a weighted tally of interactions we already record (heads-up
reactions, feedback, steers, chatbot threads, link clicks, merged code changes), normalized for incident size.
There’s no separate model judgment involved.
Why normalize by the number of responders?
Why normalize by the number of responders?
Without it, big incidents would always look more engaged than small ones, purely because more people were present.
Scaling by responder count — gently, so it grows slower than headcount — keeps a small, high-engagement incident
comparable to a large one, so the score reflects how investigations are used rather than how big the incident was.
Does a low engagement score mean the investigation was bad?
Does a low engagement score mean the investigation was bad?
Not on its own. A low score can mean the investigation wasn’t useful — but paired with high accuracy, it more often
points at how findings were surfaced, or an incident that resolved before responders needed to lean on it. It’s a
signal to look at, not a verdict.
Related
Measuring accuracy
How we grade investigations against what really caused your incidents, and why accuracy comes first.
Incident channel experience
Where responders meet investigations — heads-up messages, asking questions, and steering.
How investigations work
The process behind a result, and how an investigation builds conviction in real time.