DEIM is a training framework for DETR-based object detection models released in December 2024 by researchers at Intellindust AI Lab, City University of Hong Kong, Great Bay University, and Hefei Normal University. It enhances existing real-time DETR architectures by improving the matcher used during training, enabling faster convergence and higher accuracy without modifying the inference architecture or adding computational overhead at deployment time. DEIM introduces two core techniques: Dense One-to-One (O2O) matching, which increases the number of positive matches per target, and Matchability-Aware Loss (MAL), which down-weights low-quality matches generated by the dense strategy. The paper was accepted at CVPR 2025.
When integrated with RT-DETR and D-FINE, DEIM consistently improves performance while reducing training time by up to 50%. Applied to RT-DETRv2, it achieves 53.2% AP with a single day of training on an NVIDIA 4090 GPU. DEIM-enhanced models including DEIM-D-FINE-L and DEIM-D-FINE-X achieve 54.7% and 56.5% AP at 124 and 78 FPS respectively on an NVIDIA T4 GPU. DEIM is released under the Apache 2.0 license. A successor, DEIMv2, was released in September 2025, adding DINOv3-based backbones and introducing ultra-lightweight variants (Pico, Femto, and Atto) for edge deployment.
Other models worth comparing for similar use cases.
License terms and commercial-use guidance for DEIM.
License information is provided as a guide and is not legal advice.