Captioning Model Rankings

Updated 2 minutes ago
Votes power rankings.
Top Model Scores

ELO ratings for the highest performing models

Loading chart...
Performance vs Accuracy

ELO score vs average latency • Better models are top-left

Loading chart...
Action
1
multimodal1278521.10 sGoogle
2
multimodal1246311.54 sQwen
3
multimodal1245510.85 sGoogle
4
multimodal1245312.32 sQwen
5
multimodal1236512.21 sAnthropic
6
multimodal122338.91 sGoogle
7
multimodal1223228.26 sQwen
8
multimodal1222322.27 sQwen
9
multimodal1221310.40 sGoogle
10
multimodal121357.16 sGoogle
11
OpenAI
multimodal1212510.50 sOpenAI
12
multimodal121255.13 sAnthropic
13
multimodal121239.46 sGoogle
14
multimodal1211417.18 sGoogle
15
multimodal121136.57 sQwen
16
Grok
multimodal1203314.47 sxAI
17
multimodal120133.75 sMeta
18
OpenAI
multimodal120029.80 sOpenAI
19
multimodal120055.36 sAnthropic
20
multimodal1200315.05 sMistral
21
multimodal1200510.91 sAnthropic
22
multimodal1200512.71 sOpenAI
23
multimodal1199470.55 sGoogle
24
multimodal119846.71 sGoogle
25
OpenAI
multimodal1192518.48 sOpenAI
26
multimodal119056.20 sAnthropic
27
multimodal119034.09 sMistral
28
multimodal118933.35 sQwen
29
multimodal1188417.97 sQwen
30
multimodal1188323.40 sQwen
31
multimodal118836.00 sMistral
32
multimodal1188314.42 sQwen
33
OpenAI
multimodal118859.79 sOpenAI
34
multimodal118658.95 sAnthropic
35
multimodal1184411.87 sMeta
36
multimodal117952.10 sGoogle
37
multimodal117856.34 sAnthropic
38
multimodal116835.63 sMeta
39
multimodal115436.34 sQwen
40
multimodal114435.94 sMicrosoft
41
multimodal1133517.11 sOpenAI