Open Prompt Leaderboard

Updated 7 minutes ago
Votes power leaderboards.
Top Model Scores

ELO ratings for the highest performing models

Loading chart...
Performance vs Accuracy

ELO score vs average latency • Better models are top-left

Loading chart...
1
Google
Gemini 2.5 Flash
1235+123.33 sGoogle
2
Anthropic
Claude 3 Opus
1235+115.81 sAnthropic
3
Google
Gemini 1.5 Pro
1224+128.28 sGoogle
4
Google
Gemini 1.5 Flash
1224+122.51 sGoogle
5
Meta
Llama 4 Maverick
1212+121.95 sMeta
6
Meta
Llama 3.2 Vision 11b
1212+125.31 sMeta
6
OpenAI
GPT-4.1 mini
1212+121.66 sOpenAI
7
OpenAI
GPT-5
1212+1217.22 sOpenAI
8
Meta
Llama 4 Scout
1211-12.38 sMeta
9
Google
Gemini 2.5 Pro
1211-1216.62 sGoogle
10
Google
Gemini 2.0 Flash Exp
1201+134.25 sGoogle
11
Mistral
Mistral Medium 3.1
1200-122.37 sMistral
12
OpenAI
GPT-5 Mini
1200+1216.98 sOpenAI
13
Google
Gemma 3 4B
1200+1211.90 sGoogle
14
Anthropic
Claude 4 Sonnet
1200011.22 sAnthropic
14
Google
Gemma 3 27B
120002.99 sGoogle
14
OpenAI
GPT-4.1 nano
120003.17 sOpenAI
14
OpenAI
GPT-4o
120003.92 sOpenAI
14
Anthropic
Claude 3.7 Sonnet
120005.38 sAnthropic
15
Grok
Grok 4
120008.81 sxAI
16
Anthropic
Claude 4 Opus
1200014.36 sAnthropic
17
Mistral
Mistral Small 3.1 24B
1199-133.65 sMistral
18
Anthropic
Claude 3.5 Haiku
1190+04.15 sAnthropic
19
Qwen
Qwen2.5-VL-7B-Instruct
1189+13.14 sQwen
20
kimi-vl-a3b-thinking
1188-122.69 sUnknown org
20
OpenAI
GPT-4o mini
1188-121.93 sOpenAI
20
Grok
Grok 2 Vision 1212
1188-121.48 sxAI
20
Anthropic
Claude 3.5 Sonnet
1188-123.74 sAnthropic
20
Qwen
Qwen VL Max
1188-126.83 sQwen
21
OpenAI
GPT-4.1
1187-135.84 sOpenAI
22
OpenAI
GPT-5 Nano
1178-1120.51 sOpenAI
23
Google
Gemma 3 12B
1176-123.56 sGoogle
24
Anthropic
Claude 3 Haiku
1176-121.72 sAnthropic
25
Mistral
Pixtral 12B
1176-124.55 sMistral