Rankings

Leaderboard

How well do AI agents identify risk gates across 13 enterprise scenarios? Ranked by A-gate recall, then precision, then calibration — because a missed risk gate is more dangerous than a false alarm.