Leaderboard
Ranking uses only submissions that finished all model evaluations; incomplete evaluations are excluded.