Captioning Model Rankings

Updated 6 minutes ago
Votes power rankings.
Top Model Scores

ELO ratings for the highest performing models

Loading chart...
Performance vs Accuracy

ELO score vs average latency • Better models are top-left

Loading chart...
1
multimodal1236419.57 sGoogle
2
multimodal122435.55 sGoogle
3
multimodal122457.97 sAnthropic
4
multimodal1222411.04 sMeta
5
multimodal121237.88 sGoogle
6
multimodal121236.74 sGoogle
7
multimodal1212317.31 sAnthropic
8
multimodal1211317.62 sQwen
9
multimodal121131.94 sMeta
10
multimodal121153.44 sGoogle
11
Anthropic
multimodal1211513.45 sAnthropic
12
multimodal120141.88 sGoogle
13
multimodal120045.74 sMeta
14
Grok
multimodal1200315.74 sxAI
15
multimodal1200315.05 sMistral
16
multimodal120035.23 sMistral
17
OpenAI
multimodal120035.82 sOpenAI
18
multimodal120035.87 sOpenAI
19
multimodal1199318.96 sOpenAI
20
multimodal119838.98 sOpenAI
21
OpenAI
multimodal119834.62 sOpenAI
22
OpenAI
multimodal1191318.65 sOpenAI
23
multimodal118836.28 sMeta
24
multimodal118834.01 sOpenAI
25
vlm118834.55 sMicrosoft
26
multimodal118738.37 sAnthropic
27
multimodal117833.93 sMistral
28
vlm117734.25 sQwen
29
multimodal116558.59 sAnthropic
30
multimodal114653.16 sAnthropic