← Back

Scoring every signal against a base rate, first

From Anières

The first honest thing a monitor can do is admit how often it would be wrong by default. That number is the anchor everything else corrects.

The first thing a new monitor should log is its base rate: how often it would flag by pure chance on a random subject with the same shape. Everything the monitor says afterwards is a correction to that number, and the reason the standing report reads honestly is that the anchor is written down before any signal is scored against it.

We anchor every signal on a base rate first. How often, in general, does a director change presage a filing problem. How often, in general, does a new address in a given country presage a residency issue. How often, in general, does a name land near a sanctioned entity by pure coincidence. The number is usually humbler than the analyst's first instinct.

From that anchor, the system updates; independent corroboration nudges the number up. A source with a known weakness in the country nudges it down. A prior finding on the same person in the same direction nudges it up, but less than the raw count would suggest, because the priors are not independent of the new signal.

The whole thing is boring; it is also the difference between a report a client can act on and a report a client has to relitigate. When confidence is anchored honestly, the report can say the number out loud without either team wincing.

The most honest thing an analytical system can do at the start of any question is state how often it would be wrong by default, if it did no work at all. That number is the base rate, and every subsequent piece of evidence should nudge the estimate away from it in proportion to how much information the evidence actually carries.

For a general reader, this is the whole of calibration in one sentence: start from the base rate, move a little for each new fact, and be suspicious of any process that jumps to certainty from a small number of inputs. Well-calibrated confidence is unglamorous and reliably better than the confident-sounding alternative.

Written alongside work at Anières: exposure mapping, cross-reference, and standing-report systems for private clients.