AI Vision Model Rankings
Updated 5 minutes agoExplore top-performing models across computer vision tasks. Compare accuracy, speed, and user votes to find the best AI models.
Votes power rankings.
Overall Model Rankings
Average performance across all supported vision tasks
| Rank | Model | Score | Tasks | Avg Latency |
|---|---|---|---|---|
1 | Seg Preview | 1369 | 1 | 5.40 s |
2 | Gemini 2.5 Flash | 1249 | 4 | 5.34 s |
3 | Gemini 2.5 Pro | 1249 | 4 | 16.52 s |
4 | YOLO World | 1236 | 1 | 2.89 s |
5 | Gemini 2.0 Flash Exp | 1214 | 5 | 3.64 s |
Loading chart...
Object Detection Model Rankings
Models that detect and localize objects in images.
| Rank | Model | Score | Tasks | Avg Latency |
|---|---|---|---|---|
1 | Seg Preview | 1369 | 1 | 5.40 s |
2 | Gemini 2.5 Flash | 1320 | 4 | 9.16 s |
3 | Gemini 2.5 Pro | 1312 | 4 | 16.77 s |
4 | Florence-2 | 1252 | 3 | 4.22 s |
5 | YOLO World | 1236 | 1 | 2.89 s |
Loading chart...
Classification Model Rankings
Models that classify images into categories.
| Rank | Model | Score | Tasks | Avg Latency |
|---|---|---|---|---|
1 | Claude 3 Opus | 1258 | 4 | 3.00 s |
2 | Gemini 2.0 Flash Exp | 1220 | 5 | 4.18 s |
3 | Gemini 2.5 Flash | 1213 | 4 | 5.05 s |
4 | Claude 3.7 Sonnet | 1210 | 5 | 4.47 s |
5 | Gemini 2.5 Flash Lite | 1203 | 4 | 3.02 s |
Loading chart...
OCR Model Rankings
Models that extract text from images.
| Rank | Model | Score | Tasks | Avg Latency |
|---|---|---|---|---|
1 | Gemini 2.5 Flash Lite | 1239 | 4 | 2.89 s |
2 | GPT-4o mini | 1229 | 3 | 7.14 s |
3 | Mistral Medium 3.1 | 1224 | 3 | 15.02 s |
4 | Claude 4 Opus | 1223 | 5 | 6.31 s |
5 | Claude 3 Haiku | 1222 | 5 | 2.39 s |
Loading chart...
Captioning Model Rankings
Models that generate descriptive captions for images.
| Rank | Model | Score | Tasks | Avg Latency |
|---|---|---|---|---|
1 | Gemini 2.5 Pro | 1236 | 4 | 19.57 s |
2 | Gemma 3 4B | 1224 | 3 | 5.55 s |
3 | Claude 3.7 Sonnet | 1224 | 5 | 7.97 s |
4 | Llama 3.2 Vision 11b | 1222 | 4 | 11.04 s |
5 | Gemma 3 12B | 1212 | 3 | 7.88 s |
Loading chart...
Open Prompt Model Rankings
Models that interpret free-form prompts on images.
| Rank | Model | Score | Tasks | Avg Latency |
|---|---|---|---|---|
1 | Gemini 2.5 Flash | 1244 | 4 | 3.92 s |
2 | Gemini 2.5 Pro | 1235 | 4 | 15.21 s |
3 | Claude 3 Opus | 1234 | 4 | 5.13 s |
4 | Llama 3.2 Vision 90b | 1215 | 4 | 5.56 s |
5 | Llama 4 Maverick | 1212 | 3 | 3.56 s |
Loading chart...