This model is deprecated

Claude Opus 4 was deprecated on Jun 15, 2026 and can no longer be run here. Its evaluation results and details remain available for reference. Try Claude Opus 4.8 instead.

Claude Opus 4 Overview

Claude 4 Opus, released by Anthropic in May 2025, is the flagship model of the Claude 4 family, built for complex, long-horizon reasoning and advanced coding workflows. It is multimodal, supporting text (including voice), images, and tool use, and operates as a hybrid reasoning model—able to deliver quick answers in fast mode or switch to extended thinking for deeper, multi-step problem solving. With a ~200,000-token context window and a training cutoff around March 2025, it is optimized for handling large documents, long conversations, and sophisticated agentic tasks.

Positioned at the high end of Anthropic’s offerings, Opus 4 achieves state-of-the-art results on coding benchmarks like SWE-Bench (72.5%) and Terminal-Bench (43.2%). It is best suited for research, enterprise automation, and software development at scale. The model is classified at Anthropic’s ASL-3 safety level, denoting advanced oversight and safety features.

Claude Opus 4 Details & Performance

Details

Resources

Vision Tasks

Vision LanguageObject DetectionClassificationOCRVisual Question AnsweringCaptioning

Features

Foundation VisionLLMs with Vision CapabilitiesMultimodal Vision

Usage

Past 30 Days

Not available

Not in Playground

Performance

Avg. Latency

Arena Rankings

Claude Opus 4 Vision Evals

Visual Understanding

72 models · 67 tasks
HighestLowest
This model#52 of 7256.72% pass rate · better than 26%
Score56.72%pass rate across 67 tasks
Speed19.74savg response per task
Cost$15.00 in · $75.00 out / 1M
Tokenstokens unavailable
Score key:≥75%40–74%<40%
CategoryPassedScore
Document Understanding8 / 9
88.9%
Defect Detection10 / 15
66.7%
Object Understanding9 / 14
64.3%
Spatial Understanding11 / 19
57.9%
Object Counting0 / 10
0%

Scores based on a single evaluation run · Methodology

View all Vision Evals →

Claude Opus 4 Pricing

Claude Opus 4 costs $15.00 per 1M input tokens and $75.00 per 1M output tokens.

Input$15.00 / 1M tokens
Output$75.00 / 1M tokens
Cached input$1.50 / 1M tokens

Pricing updated Jun 28, 2026

Price vs. performance

Estimated cost per task vs. Visual Understanding score, for this model and others ranked near it. Upper-left is the sweet spot (high quality, low cost).

8 of 9 models plotted · 1 not yet evaluated

ModelScoreMedian tokensEst. cost / taskCompare
AnthropicClaude Opus 4.6 64.2%2.3K$0.014Compare
AnthropicClaude Sonnet 4.559.7%2.3K$0.0092Compare
AnthropicClaude Opus 4.159.7%2.1K$0.040Compare
OpenAIGPT-5 Nano58.2%2.7K$0.0003Compare
QwenQwen3.5 397B A17B58.2%1.5K$0.0006Compare
AnthropicClaude Opus 4(this model)56.7%
GoogleGemini 2.5 Flash55.2%476$0.0005Compare
GoogleGemini 2.5 Flash-Lite53.7%301$0.0000Compare
MoonshotAIKimi K2.535.8%2.7K$0.0021Compare

Alternatives to Claude Opus 4

Other models worth comparing for similar use cases.

Google
Gemini 3.1 Pro
Gemini 3.1 Pro is a proprietary multimodal model from Google’s Gemini 3 series, released in early 2026 and designed for advanced reasoning across large multimodal datasets. It accepts text, images, audio, video, and documents, supporting up to a 1-million-token input context with up to 64k output tokens. Compared with Gemini 3 Pro, it improves long-context synthesis and multi-step reasoning, enabling more reliable analysis of large documents, datasets, and software codebases.The model also advances visual understanding and grounding, allowing it to interpret UI screenshots, diagrams, and real-world scenes while referencing specific regions within images or video. These capabilities make Gemini 3.1 Pro well suited for multimodal workflows involving document processing, interface analysis, robotics research, and complex visual reasoning.
OpenAI
GPT-5
GPT-5, released by OpenAI in August 2025, is a multimodal large language model that advances beyond the GPT-4 family with a new “unified system” architecture. This design allows the model to dynamically choose between fast responses and extended reasoning depending on task complexity. It supports text, code, and images, alongside stronger tool use and agentic workflows, making it more adaptable for real-world problem solving. While its exact context window size is not disclosed, GPT-5 is optimized for long-horizon reasoning and multi-step tool chaining, indicating substantially expanded capacity over its predecessors.The release introduced specialized variants: GPT-5 Pro, offering extended reasoning for complex workflows, and GPT-5 Codex, optimized for advanced coding tasks such as large-scale refactoring and code review. GPT-5 shows benchmark gains in coding, biomedical reasoning, multimodal analysis, and scientific tasks. Developers also gain new controls, such as verbosity and personalization parameters, for greater steerability. With these improvements, GPT-5 positions itself as OpenAI’s most capable and versatile model, suited for enterprise automation, research, healthcare, and sophisticated coding environments.

Claude Opus 4 License

Proprietary

License terms and commercial-use guidance for Claude Opus 4.

License information is provided as a guide and is not legal advice.