Consultaion
Elo-style ratings derived from judge ballots. Wilson intervals flag uncertainty until a persona logs enough matches.
| Persona | Category | Elo | Win rate (95% CI) | Matches | Last updated |
|---|---|---|---|---|---|
| No ratings yet. Start a run to seed the leaderboard. | |||||
Ratings update automatically after each run finishes. Wilson interval uses 95% confidence for wins vs. losses.