Llama 4 Scout, released on April 5, 2025, is one of Meta AI’s first Llama 4 multimodal models, alongside Maverick. It accepts text + image inputs and produces text outputs, with a knowledge cutoff of August 2024. Scout is notable for its extremely large context window of 10 million tokens, making it well-suited for analyzing very long documents, extended conversations, or large codebases.
Architecturally, Scout uses a Mixture-of-Experts (MoE) system with 16 experts, activating ~17B parameters per inference from a pool of ~109B total parameters, balancing capacity with efficiency. It officially supports 12 languages (including English, Arabic, French, Hindi, and Spanish), while offering multimodal reasoning for images (captioning, Q&A, recognition). Meta highlights that Scout can run on a single Nvidia H100 GPU, making it more accessible than larger-scale Llama 4 models. However, its output token limit is far smaller than its 10M input window, image input support is still constrained, and license restrictions apply for large-scale commercial deployments.