Bonnet: Ultra-fast whole-body bone segmentation from CT scans
About
This work proposes Bonnet, an ultra-fast sparse-volume pipeline for whole-body bone segmentation from CT scans. Accurate bone segmentation is important for surgical planning and anatomical analysis, but existing 3D voxel-based models such as nnU-Net and STU-Net require heavy computation and often take several minutes per scan, which limits time-critical use. The proposed Bonnet addresses this by integrating a series of novel framework components including HU-based bone thresholding, patch-wise inference with a sparse spconv-based U-Net, and multi-window fusion into a full-volume prediction. Trained on TotalSegmentator and evaluated without additional tuning on RibSeg, CT-Pelvic1K, and CT-Spine1K, Bonnet achieves high Dice across ribs, pelvis, and spine while running in only 2.69 seconds per scan on an RTX A6000. Compared to strong voxel baselines, Bonnet attains a similar accuracy but reduces inference time by roughly 25x on the same hardware and tiling setup. The toolkit and pre-trained models will be released at https://github.com/HINTLab/Bonnet.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Bone Segmentation | TotalSegmentator 89 scans (test) | Dice (Rib)94.25 | 12 | |
| Bone Segmentation | TotalSegmentator (test) | Dice (Rib)94.25 | 12 | |
| Bone Segmentation | CT-Pelvic1K | Dice Coefficient96.8 | 12 | |
| Bone Segmentation | CT-Spine1K | Dice Coefficient93.63 | 12 | |
| Bone Segmentation | RibSeg | Dice85.91 | 12 |