This model is deprecated
Claude Opus 4 was deprecated on Jun 15, 2026 and can no longer be run here. Its evaluation results and details remain available for reference. Try Claude Opus 4.8 instead.
Claude 4 Opus, released by Anthropic in May 2025, is the flagship model of the Claude 4 family, built for complex, long-horizon reasoning and advanced coding workflows. It is multimodal, supporting text (including voice), images, and tool use, and operates as a hybrid reasoning model—able to deliver quick answers in fast mode or switch to extended thinking for deeper, multi-step problem solving. With a ~200,000-token context window and a training cutoff around March 2025, it is optimized for handling large documents, long conversations, and sophisticated agentic tasks.
Positioned at the high end of Anthropic’s offerings, Opus 4 achieves state-of-the-art results on coding benchmarks like SWE-Bench (72.5%) and Terminal-Bench (43.2%). It is best suited for research, enterprise automation, and software development at scale. The model is classified at Anthropic’s ASL-3 safety level, denoting advanced oversight and safety features.
—
Usage
Past 30 DaysNot available
Not in Playground
| Category | Passed | Score |
|---|---|---|
| Document Understanding | 8 / 9 | 88.9% |
| Defect Detection | 10 / 15 | 66.7% |
| Object Understanding | 9 / 14 | 64.3% |
| Spatial Understanding | 11 / 19 | 57.9% |
| Object Counting | 0 / 10 | 0% |
Scores based on a single evaluation run · Methodology
View all Vision Evals →Claude Opus 4 costs $15.00 per 1M input tokens and $75.00 per 1M output tokens.
Pricing updated Jun 28, 2026
Estimated cost per task vs. Visual Understanding score, for this model and others ranked near it. Upper-left is the sweet spot (high quality, low cost).
8 of 9 models plotted · 1 not yet evaluated
| Model | Score | Median tokens | Est. cost / task | Compare |
|---|---|---|---|---|
| Claude Opus 4.6 | 64.2% | 2.3K | $0.014 | Compare |
| Claude Sonnet 4.5 | 59.7% | 2.3K | $0.0092 | Compare |
| Claude Opus 4.1 | 59.7% | 2.1K | $0.040 | Compare |
| GPT-5 Nano | 58.2% | 2.7K | $0.0003 | Compare |
| Qwen3.5 397B A17B | 58.2% | 1.5K | $0.0006 | Compare |
| Claude Opus 4(this model) | 56.7% | — | — | — |
| Gemini 2.5 Flash | 55.2% | 476 | $0.0005 | Compare |
| Gemini 2.5 Flash-Lite | 53.7% | 301 | $0.0000 | Compare |
| Kimi K2.5 | 35.8% | 2.7K | $0.0021 | Compare |
Other models worth comparing for similar use cases.
Other versions in the same family as Claude Opus 4.
License terms and commercial-use guidance for Claude Opus 4.
License information is provided as a guide and is not legal advice.