Gemma 3 27B, announced on March 12, 2025, is the largest open-weight model in Google DeepMind’s Gemma 3 family. With around 27 billion parameters, it is multimodal—accepting both text and images as input and producing text outputs. It supports a 128,000-token context window and typically generates up to ~8,192 tokens, enabling it to process multi-page documents, extended conversations, or large batches of images in a single prompt.
The model is instruction-tuned in its “-it” variants for chat, reasoning, and summarization use cases, and it supports structured outputs and function calling. It is multilingual, covering over 140 languages. Deployment is flexible: the full BF16 model requires ~46 GB of VRAM, but quantization-aware training (QAT) versions in 8-bit or 4-bit reduce the footprint significantly, allowing more accessible use outside large-scale clusters. While it delivers stronger reasoning and multimodal performance than smaller Gemma models, it remains lighter and more open than proprietary systems, making it well-suited for research, development, and fine-tuned applications.
Drag and drop an image here, or click to browse
Captioning will run automatically
—
Usage
Past 30 Days| Category | Passed | Score |
|---|---|---|
| Document Understanding | 7 / 9 | 77.8% |
| Object Understanding | 10 / 14 | 71.4% |
| Spatial Understanding | 12 / 19 | 63.2% |
| Defect Detection | 9 / 15 | 60% |
| Object Counting | 1 / 10 | 10% |
Scores based on single evaluation run · Methodology
View all Vision Evals →Gemma 3 27B costs $0.080 per 1M input tokens and $0.160 per 1M output tokens.
Pricing updated Jun 22, 2026
Other models worth comparing for similar use cases.
License terms and commercial-use guidance for Gemma 3 27B.
License information is provided as a guide and is not legal advice.