GPT-5.5 is a multimodal large language model released by OpenAI on April 23, 2026, engineered for autonomous, multi-step knowledge work and agentic workflows. It accepts text, images, and code as input, featuring enhanced spatial reasoning and visual grounding to support its computer use capabilities for operating software and navigating UI elements. Built to execute complex workflows end-to-end, the model interprets loosely defined tasks, selects appropriate tools, and performs self-verification with minimal user intervention. It is available in a standard version, a Thinking mode for extended reasoning budgets, and a Pro variant that uses parallel test-time compute for maximum precision on complex tasks.
Co-optimized with NVIDIA for GB200 NVL72 infrastructure, GPT-5.5 delivers per-token latency comparable to its predecessor GPT-5.4 while maintaining a 1-million-token context window. Despite increased capability, the model achieves greater token efficiency in coding and data analysis workflows, often completing tasks with fewer total tokens than previous versions. OpenAI reports a 60% reduction in hallucination rate compared to GPT-5.4, improving reliability for accuracy-sensitive applications. API access is available via the Responses and Chat Completions endpoints at $5 per million input tokens and $30 per million output tokens, double the unit price of GPT-5.4.
Drag and drop an image here, or click to browse
—
Usage
Past 30 Days| Category | Passed | Score |
|---|---|---|
| Object Understanding | 13 / 14 | 92.9% |
| Document Understanding | 8 / 9 | 88.9% |
| Defect Detection | 13 / 15 | 86.7% |
| Spatial Understanding | 15 / 19 | 78.9% |
| Object Counting | 3 / 10 | 30% |
| Category | Passed | Score |
|---|---|---|
| License Plate Recognition | 28 / 30 | 93.3% |
| VQA & Extraction | 52 / 60 | 86.7% |
| Text Recognition | 25 / 30 | 83.3% |
| Focused Scene OCR | 77 / 99 | 77.8% |
| Handwritten Math | 4 / 10 | 40% |
Scores based on a single evaluation run · Methodology
View all Vision Evals →GPT-5.5 costs $5.00 per 1M input tokens and $30.00 per 1M output tokens.
Pricing updated Jun 28, 2026
Estimated cost per task vs. Visual Understanding score, for this model and others ranked near it. Upper-left is the sweet spot (high quality, low cost).
8 of 8 models plotted
| Model | Score | Median tokens | Est. cost / task | Compare |
|---|---|---|---|---|
| Gemini 3.5 Flash | 79.1% | 1.4K | $0.0043 | Compare |
| GPT-5.4 | 77.6% | 1.7K | $0.0052 | Compare |
| GPT-5.5(this model) | 77.6% | 1.7K | $0.011 | — |
| Qwen3.5 122B A10B | 76.1% | 1.2K | $0.0003 | Compare |
| Gemini 3.1 Pro | 75.8% | 1.1K | $0.0024 | Compare |
| Gemini 3 Flash | 74.6% | 1.4K | $0.0014 | Compare |
| GPT-5 Mini | 73.1% | 1.8K | $0.0006 | Compare |
| Qwen3.5 27B | 71.6% | 1.2K | $0.0002 | Compare |
Other models worth comparing for similar use cases.
License terms and commercial-use guidance for GPT-5.5.
License information is provided as a guide and is not legal advice.