How is this calculated?

Most of these charts use two stats tricks: Bayesian shrinkage so rare matchups don't dominate the leaderboards, and Wilson confidence intervals so we can show honest error bars on small samples. Cells with fewer than 5 games are hidden; cells with 5–9 games are shown with low color saturation as a hint that the prior is doing most of the work.

Ancient One difficulty

We compute the loss rate (defeats divided by total games) for each Ancient One that has at least 5 games logged. The error bars are 95% Wilson score intervals — well-behaved on small samples, unlike the naïve normal approximation. The list is sorted hardest (highest loss rate) first.

Investigator × Ancient One

For each (investigator, Ancient One) pair, the raw win rate is just wins / games. The shrunk win rate uses a Beta(α, β) prior centered on the global win rate, with prior strength of 10 — i.e. each cell is pulled toward the mean as if 10 extra games at the global average had been added. Cells with fewer than 5 games are omitted entirely.

Shrinkage, visually

The leaderboards and tier list use shrunk win rates, computed against a community-mean prior with strength 10. The plot below shows what that actually does to each investigator's number. Investigators with thousands of games barely move; those with only the 30-game floor get yanked toward the middle.

Shrinkage in action

Each dot is an investigator. The diagonal would mean 'no shrinkage'; the horizontal line at the community mean shows the pull. Small-sample investigators (smaller dots) get yanked toward the middle.

N= 96,488 HOW?

Investigator

Raw

Shrunk

Agatha

949

73.1%

73.0%

Luke

1284

71.7%

71.6%

Sefina

636

70.6%

70.4%

Pete

1097

70.0%

69.9%

Carson

515

69.9%

69.7%

Preston

693

69.4%

69.3%

Daniela

754

69.1%

69.0%

Jenny

1772

68.7%

Mary

1376

68.4%

68.3%

Minh

1279

68.3%

68.2%

Ursula

2627

68.0%

Bob

1027

68.0%

67.9%

Roland

828

67.6%

67.5%

Daisy

2337

67.0%

66.9%

Rex

1123

66.9%

66.8%

Carolyn

715

66.7%

66.6%

Gloria

752

66.4%

66.3%

Mateo

727

66.3%

66.2%

Kate

1014

65.8%

65.7%

Wendy

1195

65.1%

Monterey

1546

64.2%

Darrell

651

64.2%

64.1%

Mandy

1296

64.1%

Vincent

619

63.5%

63.4%

William

885

63.3%

63.2%

"Skids"

2029

63.2%

Dexter

1349

62.7%

Calvin

589

62.5%

62.4%

George

1325

62.3%

Rita

879

62.2%

Marie

1466

62.2%

Jacqueline

4432

62.0%

Michael

1016

62.0%

Harvey

863

61.8%

61.7%

Patrice

1607

61.5%

Amanda

563

61.5%

61.4%

Agnes

2118

61.2%

Charlie

4853

61.2%

Tommy

1474

59.8%

Joe

1018

59.6%

Wilson

1390

58.4%

Zoey

1926

58.1%

Hank

1230

57.5%

Finn

1260

55.9%

Tony

1200

55.2%

Lily

4750

54.5%

Diana

4186

54.4%

54.5%

Jim

3215

54.2%

Lola

2828

54.1%

54.2%

Norman

3364

53.9%

Trish

3858

53.6%

Akachi

3849

53.4%

Silas

3364

51.8%

Leo

3632

51.7%

Mark

3158

48.4%

Calibration check

A good shrunk estimate should match observed reality in aggregate: if we bucket cells by their predicted (shrunk) win rate, the observed (raw) average inside each bucket should fall close to the diagonal. Systematic deviation would indicate the prior is biased.

Calibration of shrunk win rates

Within each bucket of predicted (shrunk) win rate, we plot the actual observed win rate. Points on the diagonal = well-calibrated. Systematic deviation = the model over- or under-predicts.

N= 96,488 HOW?

Predicted bucket

Observed

35%

1010

35.9%

45%

18036

45.3%

55%

28947

54.5%

65%

33034

65.4%

75%

14524

75.5%

85%

937

86.4%

Doom-track distribution

For each Ancient One we build a histogram of the final doom-track value across all games where it was reported. The ridgeline view normalizes each AO's histogram so the densities are comparable across rows. Bimodal shapes (mass near 0 and near 15) indicate a "swingy" AO where games end decisively in either direction; smooth unimodal shapes indicate predictable pacing.

Source

The underlying spreadsheet is maintained by the Eldritch Horror community. We fetch the raw submissions tab once a day, normalize, and rebuild. See the repository for the pipeline source.