MobileFetalCLIP: Selective Repulsive Knowledge Distillation for Mobile Fetal Ultrasound Analysis
About
Fetal ultrasound AI could transform prenatal care in low-resource settings, yet current foundation models exceed 300M visual parameters, precluding deployment on point-of-care devices. Standard knowledge distillation fails under such extreme capacity gaps (~26x), as compact students waste capacity mimicking architectural artifacts of oversized teachers. We introduce Selective Repulsive Knowledge Distillation, which decomposes contrastive KD into diagonal and off-diagonal components: matched pair alignment is preserved while the off-diagonal weight decays into negative values, repelling the student from the teacher's inter-class confusions and forcing discovery of architecturally native features. Our 11.4M parameter student surpasses the 304M-parameter FetalCLIP teacher on zero-shot HC18 biometry validity (88.6% vs. 83.5%) and brain sub-plane F1 (0.784 vs. 0.702), while running at 1.6 ms on iPhone 16 Pro, enabling real-time assistive AI on handheld ultrasound devices. Our code, models, and app are publicly available at https://github.com/numanai/MobileFetalCLIP.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Fetal Ultrasound Plane Classification | Planes DB | F1 Score (5 Planes)94.6 | 9 | |
| Biometric Validity | HC18 | HC1888.6 | 8 | |
| Image Classification | Planes DB 6-View (test) | F1 Score93 | 5 | |
| Image Classification | Planes DB Brain (test) | F1 Score79.9 | 5 | |
| Image Classification | CHD (test) | AUROC76.9 | 5 | |
| Standard Plane Identification | Fetal Ultrasound | iPhone 16 Pro Latency (ms)1.6 | 2 |