YOLOv5 Overview

YOLOv5 is an object detection model developed by Ultralytics, released in June 2020 under the AGPL-3.0 license. It is implemented in PyTorch and introduced a more accessible and well-documented YOLO implementation compared to earlier Darknet-based versions, with an integrated training and export pipeline supporting a wide range of deployment targets. YOLOv5 uses a CSP backbone, PANet neck, and a single-stage detection head with anchor-based regression.

YOLOv5 is available in five sizes from Nano to Extra Large and supports export to ONNX, TensorRT, CoreML, and other formats. It is one of the most widely deployed object detection models in production environments and remains a common starting point for custom detection model training due to its documentation, community support, and compatibility with Roboflow Inference.

YOLOv5 Details & Performance

Vision Tasks

Object Detection

Features

Real-Time Vision

Usage

Past 30 Days

Not available

Not in Playground

Performance

Avg. Latency

Arena Rankings

Not yet ranked in arena

Alternatives to YOLOv5

Other models worth comparing for similar use cases.

YOLOv8
YOLOv8 is an object detection and multi-task vision model developed by Ultralytics, released in January 2023 under the AGPL-3.0 license. It succeeds YOLOv5 and introduces an anchor-free detection head, a new C2f module for improved gradient flow, and a decoupled head that separates classification and regression tasks. These changes improve both accuracy and training efficiency compared to earlier Ultralytics models.YOLOv8 supports object detection, instance segmentation, image classification, pose estimation, and oriented bounding box detection within a unified codebase. It is available in five sizes from Nano to Extra Large and exports to ONNX, TensorRT, CoreML, and other formats. YOLOv8 is one of the most widely adopted detection models in production and is directly supported by Roboflow Inference for custom model training and deployment.
YOLO11
YOLO11 is an object detection and multi-task vision model developed by Ultralytics, released in September 2024 under the AGPL-3.0 license. It is the latest generation in the Ultralytics YOLO series and supports object detection, instance segmentation, image classification, pose estimation, and oriented bounding box detection within a single unified framework. YOLO11 introduces architectural refinements that improve accuracy while reducing parameter count compared to YOLOv8 at equivalent model sizes.YOLO11 is available in five model sizes from Nano to Extra Large and is deployable through the Ultralytics Python package, Roboflow Inference, and export formats including ONNX, TensorRT, and CoreML. It supports fine-tuning on custom datasets through the standard Ultralytics training API.
YOLO26
YOLO26 is a real-time object detection model developed by Ultralytics, released in October 2025. It introduces a native end-to-end, NMS-free architecture that eliminates the Non-Maximum Suppression post-processing step, reducing CPU latency by up to 43% for the Nano variant compared to NMS-dependent versions. The model incorporates the MuSGD optimizer and ProgLoss with STAL for improved training stability and small-object detection, and removes Distribution Focal Loss to ensure maximum compatibility with ONNX and TensorRT export targets.YOLO26 supports object detection, instance segmentation, pose estimation, and oriented bounding box detection within a unified framework, with model sizes available from Nano to Extra Large. Its NMS-free design makes it particularly well suited for deployment scenarios where post-processing overhead is a bottleneck, such as embedded systems and real-time edge inference pipelines.
YOLOv9
YOLOv9 is a real-time object detection model developed by Chien-Yao Wang and Hong-Yuan Mark Liao at Academia Sinica, released in February 2024 under the GPL-3.0 license. It introduces Programmable Gradient Information (PGI), a mechanism that preserves complete input information through auxiliary reversible branches during training to address information loss in deep network layers. It also introduces the Generalized Efficient Layer Aggregation Network (GELAN), which achieves better parameter utilization compared to prior CSP-based designs.YOLOv9-C achieves 53.0% AP on COCO with 42% fewer parameters and 21% less computation than YOLOv8-C at comparable accuracy. YOLOv9-E achieves 55.6% AP. The model is deployable through Roboflow Inference and supports fine-tuning via the standard training pipeline in the official repository.
YOLOv10
YOLOv10 is a real-time end-to-end object detection model developed by THU-MIG at Tsinghua University, released in May 2024 under the AGPL-3.0 license. It introduces consistent dual assignments during training — using both one-to-many and one-to-one label assignment strategies — to eliminate the need for non-maximum suppression at inference time while maintaining competitive accuracy. This end-to-end design reduces inference latency compared to NMS-dependent detectors at similar accuracy levels.YOLOv10-B achieves 52.7% AP on COCO with 46% lower latency than YOLOv9-C at comparable performance. The model is available in six sizes from Nano to Extra Large, built on the Ultralytics framework, and exportable to ONNX, TensorRT, and CoreML. YOLOv10 is suited for latency-sensitive deployment scenarios where post-processing overhead is a constraint.
Baidu
RT-DETR
RT-DETR (Real-Time Detection Transformer) is an object detection model developed by Baidu, released in April 2023 under the Apache 2.0 license. It is the first transformer-based real-time object detector, addressing the inference speed limitations of earlier DETR models through an efficient hybrid encoder that decouples intra-scale interaction and cross-scale fusion, enabling the model to process multi-scale features without the high computational overhead of standard transformer encoders.RT-DETR achieves 53.1% AP on COCO at 108 FPS on an NVIDIA T4 GPU for the RT-DETR-L variant, outperforming comparably sized YOLO detectors at similar speeds. It maintains end-to-end inference without non-maximum suppression, simplifying deployment pipelines. RT-DETR established the baseline for real-time transformer detection and has been extended by subsequent works including RF-DETR and RT-DETRv2.

YOLOv5 License

AGPL-3.0

License terms and commercial-use guidance for YOLOv5.

License information is provided as a guide and is not legal advice.