RF-DETR vs SAM 3

Compare RF-DETR and SAM 3 side-by-side.

Compare RF-DETR vs SAM 3 live

Run the same image across every model that supports a task and compare their outputs side-by-side.

These models don't share enough common tasks for a side-by-side demo. See the comparison table below for their capabilities.

Models in this comparison

RF-DETR vs SAM 3: Overview

RF-DETR

RF-DETR is a real-time transformer-based object detection model developed by Roboflow, with code and weights first released in March 2025 under the Apache 2.0 license. It is the first real-time model to exceed 60 AP on the Microsoft COCO benchmark, built on a DINOv2 vision transformer backbone with weight-sharing neural architecture search used to identify accuracy-latency trade-offs. The full family spans six sizes from Nano (30.5M parameters, 384×384 input) to 2XL (126.9M parameters, 880×880 input), with the accompanying research paper accepted to ICLR 2026.

RF-DETR is designed for strong domain adaptability, achieving state-of-the-art performance on RF100-VL, a benchmark measuring generalization to real-world object detection tasks across diverse domains. It is deployable through Roboflow Inference and supports fine-tuning on custom datasets, making it well suited for domain-specific applications with limited training data.

SAM 3

Released on November 19th, 2025, Segment Anything 3 (SAM 3) is a zero-shot image segmentation model that “detects, segments, and tracks objects in images and videos based on concept prompts.” This model was developed by Meta as the third model in the Segment Anything series.

Unlike its previous SAM models (Segment Anything and Segment Anything 2), you can provide SAM 3 with the prompt “shipping container” and it will generate precise segmentation masks for all shipping containers in an image. SAM 3 generates segmentation masks that correspond to the location of the objects found with a text prompt.

RF-DETR vs SAM 3 Comparison Table

Property	RF-DETR	SAM 3
Organization	Roboflow	Meta
Category	open	closed
Modality	vision	multimodal
Release Date	Mar 2025	Nov 2025
Context Window	—	—
Parameters	30.5M-126.9M
License	Apache 2.0	Proprietary
Vision Tasks
Object Detection	Demo (COCO)	Demo
Instance Segmentation
Promptable Concept Segmentation		Demo
Video Object Tracking
Zero Shot Segmentation
Model Features
Foundation Vision
Real-Time Vision
Zero-shot Detection