DETR vs YOLOv4-tiny

Compare DETR and YOLOv4-tiny side-by-side.

Compare DETR vs YOLOv4-tiny live

Run the same image across every model that supports a task and compare their outputs side-by-side.

These models don't share enough common tasks for a side-by-side demo. See the comparison table below for their capabilities.

Models in this comparison

Meta

DETR vs YOLOv4-tiny: Overview

DETR

DETR (Detection Transformer) is an end-to-end object detection model developed by Facebook Research (Meta), released in May 2020. It is one of the first models to eliminate hand-crafted components such as anchor generation and non-maximum suppression by framing object detection as a direct set prediction problem, solved with a transformer encoder-decoder architecture built on top of a CNN backbone.

DETR achieves 42.0% AP on the COCO benchmark with a ResNet-50 backbone, performing comparably to a well-tuned Faster R-CNN at the time of release. Its attention-based design allows it to reason about global context and long-range dependencies within an image. DETR is primarily used as a research baseline and architectural reference, with subsequent works such as Deformable DETR and DINO building on its foundations to address its slower training convergence and limited small-object detection capability.

YOLOv4-tiny

YOLOv4-tiny is a lightweight variant of YOLOv4 developed by Academia Sinica, released in November 2020. It retains the core YOLOv4 design principles while significantly reducing the number of convolutional layers and feature map channels to produce a model suitable for inference on devices with limited compute, including embedded hardware and mobile CPUs. It uses a simplified CSP backbone with fewer layers and two detection scales rather than three.

YOLOv4-tiny is optimized for scenarios where inference speed is prioritized over peak accuracy, achieving substantially higher FPS than full YOLOv4 at the cost of reduced AP on standard benchmarks. It is commonly used in robotics, embedded vision systems, and applications where real-time detection is required without GPU acceleration.

DETR vs YOLOv4-tiny Comparison Table

PropertyDETRYOLOv4-tiny
OrganizationMetaAcademia Sinica
Categoryopenopen
Modalityvisionvision
Release DateMay 2020Nov 2020
Context Window
Parameters~41M
LicenseApache 2.0Custom
Vision Tasks
Object Detection
Model Features
Foundation Vision