Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

nnU-Net Revisited: A Call for Rigorous Validation in 3D Medical Image Segmentation

About

The release of nnU-Net marked a paradigm shift in 3D medical image segmentation, demonstrating that a properly configured U-Net architecture could still achieve state-of-the-art results. Despite this, the pursuit of novel architectures, and the respective claims of superior performance over the U-Net baseline, continued. In this study, we demonstrate that many of these recent claims fail to hold up when scrutinized for common validation shortcomings, such as the use of inadequate baselines, insufficient datasets, and neglected computational resources. By meticulously avoiding these pitfalls, we conduct a thorough and comprehensive benchmarking of current segmentation methods including CNN-based, Transformer-based, and Mamba-based approaches. In contrast to current beliefs, we find that the recipe for state-of-the-art performance is 1) employing CNN-based U-Net models, including ResNet and ConvNeXt variants, 2) using the nnU-Net framework, and 3) scaling models to modern hardware resources. These results indicate an ongoing innovation bias towards novel architectures in the field and underscore the need for more stringent validation standards in the quest for scientific progress.

Fabian Isensee, Tassilo Wald, Constantin Ulrich, Michael Baumgartner, Saikat Roy, Klaus Maier-Hein, Paul F. Jaeger• 2024

Related benchmarks

TaskDatasetResultRank
3D Medical Image SegmentationMSWAL
DSC56.24
56
Multi-organ SegmentationBTCV (test)
Spl95.95
55
Abdominal Organ SegmentationBTCV (val)
Spleen Dice96.39
14
Medical Image SegmentationD1 Pediatric Organs in CT
DSC82.62
14
Medical Image SegmentationD6
DSC96.89
14
Medical Image SegmentationToothfairy D3
DSC88.55
14
Medical Image SegmentationD4
DSC65.67
14
Medical Image SegmentationD5 Pancreatic Tumor in MR
DSC69.11
14
Medical Image SegmentationD2
DSC87.03
14
Medical Image SegmentationMSD Hippocampus (test)
Dice (Ant.)89.82
12
Showing 10 of 13 rows

Other info

Code

Follow for update