Captioning Model Rankings
Updated 8 minutes agoVotes power rankings.
Top Model Scores
ELO ratings for the highest performing models
Loading chart...
Performance vs Accuracy
ELO score vs average latency • Better models are top-left
Loading chart...
1 | multimodal | 1236 | 4 | 19.57 s | ||
2 | multimodal | 1224 | 3 | 5.55 s | ||
3 | multimodal | 1224 | 5 | 7.97 s | Anthropic | |
4 | multimodal | 1222 | 5 | 3.51 s | ||
5 | multimodal | 1222 | 4 | 11.04 s | Meta | |
6 | multimodal | 1212 | 3 | 7.88 s | ||
7 | multimodal | 1212 | 3 | 6.74 s | ||
8 | multimodal | 1212 | 3 | 17.31 s | Anthropic | |
9 | multimodal | 1211 | 3 | 17.62 s | Qwen | |
10 | multimodal | 1211 | 3 | 1.94 s | Meta | |
11 | multimodal | 1211 | 5 | 13.45 s | Anthropic | |
12 | multimodal | 1201 | 4 | 1.88 s | ||
13 | multimodal | 1200 | 4 | 5.74 s | Meta | |
14 | multimodal | 1200 | 3 | 15.74 s | xAI | |
15 | multimodal | 1200 | 3 | 15.05 s | Mistral | |
16 | multimodal | 1200 | 3 | 5.23 s | Mistral | |
17 | multimodal | 1200 | 3 | 5.82 s | OpenAI | |
18 | multimodal | 1200 | 3 | 5.87 s | OpenAI | |
19 | multimodal | 1199 | 3 | 18.96 s | OpenAI | |
20 | multimodal | 1198 | 3 | 8.98 s | OpenAI | |
21 | multimodal | 1198 | 3 | 4.62 s | OpenAI | |
22 | multimodal | 1191 | 3 | 18.65 s | OpenAI | |
23 | vlm | 1188 | 3 | 4.55 s | Microsoft | |
24 | multimodal | 1188 | 3 | 4.01 s | OpenAI | |
25 | multimodal | 1187 | 3 | 8.37 s | Anthropic | |
26 | multimodal | 1178 | 3 | 3.93 s | Mistral | |
27 | multimodal | 1177 | 3 | 4.59 s | Meta | |
28 | vlm | 1177 | 3 | 4.25 s | Qwen | |
29 | multimodal | 1165 | 5 | 8.59 s | Anthropic | |
30 | multimodal | 1146 | 5 | 3.16 s | Anthropic |