Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

C-RADIOv4 (Tech Report)

About

By leveraging multi-teacher distillation, agglomerative vision backbones provide a unified student model that retains and improves the distinct capabilities of multiple teachers. In this tech report, we describe the most recent release of the C-RADIO family of models, C-RADIOv4, which builds upon AM-RADIO/RADIOv2.5 in design, offering strong improvements on key downstream tasks at the same computational complexity. We release -SO400M (412M params), and -H (631M) model variants, both trained with an updated set of teachers: SigLIP2, DINOv3, and SAM3. In addition to improvements on core metrics and new capabilities from imitating SAM3, the C-RADIOv4 model family further improves any-resolution support, brings back the ViTDet option for drastically enhanced efficiency at high-resolution, and comes with a permissive license.

Mike Ranzinger, Greg Heinrich, Collin McCarthy, Jan Kautz, Andrew Tao, Bryan Catanzaro, Pavlo Molchanov• 2026

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K
mIoU55.2
936
Image ClassificationImageNet-1K
Top-1 Acc86.59
836
Instance SegmentationSA-Co Gold
Avg Performance44.7
10
3D AwarenessSPair
Accuracy60.57
8
3D AwarenessNAVI
Accuracy63.44
8
Showing 5 of 5 rows

Other info

Follow for update