Classification Leaderboard
Updated 2 minutes agoVotes power leaderboards.
Top Model Scores
ELO ratings for the highest performing models
Loading chart...
Performance vs Accuracy
ELO score vs average latency • Better models are top-left
Loading chart...
1 | Claude 3 Opus | 1247 | +12 | 3.03 s | Anthropic |
2 | Gemini 2.0 Flash Exp | 1235 | +13 | 4.39 s | |
3 | Claude 3.5 Sonnet | 1232 | 0 | 4.20 s | Anthropic |
4 | Gemini 1.5 Pro | 1229 | -13 | 3.41 s | |
5 | Gemini 1.5 Flash | 1209 | +0 | 4.69 s | |
6 | Llama 3.2 Vision 11b | 1157 | -10 | 8.37 s | Meta |
7 | Claude 3 Haiku | 1148 | +0 | 2.41 s | Anthropic |
8 | Llama 3.2 Vision 90b | 1144 | 0 | 3.54 s | Meta |