Llama 4 Scout, released on April 5, 2025, is one of Meta AI’s first Llama 4 multimodal models, alongside Maverick. It accepts text + image inputs and produces text outputs, with a knowledge cutoff of August 2024. Scout is notable for its extremely large context window of 10 million tokens, making it well-suited for analyzing very long documents, extended conversations, or large codebases.
Architecturally, Scout uses a Mixture-of-Experts (MoE) system with 16 experts, activating ~17B parameters per inference from a pool of ~109B total parameters, balancing capacity with efficiency. It officially supports 12 languages (including English, Arabic, French, Hindi, and Spanish), while offering multimodal reasoning for images (captioning, Q&A, recognition). Meta highlights that Scout can run on a single Nvidia H100 GPU, making it more accessible than larger-scale Llama 4 models. However, its output token limit is far smaller than its 10M input window, image input support is still constrained, and license restrictions apply for large-scale commercial deployments.
Drag and drop an image here, or click to browse
Captioning will run automatically
—
Usage
Past 30 DaysLlama 4 Scout costs $0.100 per 1M input tokens and $0.300 per 1M output tokens.
Pricing updated Jun 28, 2026
Other models worth comparing for similar use cases.
License terms and commercial-use guidance for Llama 4 Scout.
License information is provided as a guide and is not legal advice.