US-JEPA: A Joint Embedding Predictive Architecture for Medical Ultrasound
About
Ultrasound (US) imaging poses unique challenges for representation learning due to its inherently noisy acquisition process. The low signal-to-noise ratio and stochastic speckle patterns hinder standard self-supervised learning methods relying on a pixel-level reconstruction objective. Joint-Embedding Predictive Architectures (JEPAs) address this drawback by predicting masked latent representations rather than raw pixels. However, standard approaches depend on hyperparameter-brittle and computationally expensive online teachers updated via exponential moving average. We propose US-JEPA, a self-supervised framework that adopts the Static-teacher Asymmetric Latent Training (SALT) objective. By using a frozen, domain-specific teacher to provide stable latent targets, US-JEPA decouples student-teacher optimization and pushes the student to expand upon the semantic priors of the teacher. In addition, we provide the first rigorous comparison of all publicly available state-of-the-art ultrasound foundation models on UltraBench, a public dataset benchmark spanning multiple organs and pathological conditions. Under linear probing for diverse classification tasks, US-JEPA achieves performance competitive with or superior to domain-specific and universal vision foundation model baselines. Our results demonstrate that masked latent prediction provides a stable and efficient path toward robust ultrasound representations.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Ultrasound Image Classification | BUSBRA (test) | Macro F176 | 10 | |
| Ultrasound Image Classification | FATTY LIV. (test) | Macro F189.2 | 10 | |
| Ultrasound Image Classification | GBCU (test) | Macro F170.2 | 10 | |
| Ultrasound Image Classification | MMOTU (test) | Macro F152.2 | 10 | |
| Ultrasound Image Classification | POCUS (test) | Macro F193.1 | 10 | |
| Ultrasound Image Classification | AUL (test) | Macro F10.696 | 10 | |
| Ultrasound Image Classification | TN5000 (test) | Macro F1 Score73.1 | 10 | |
| Ultrasound Image Classification | BUTTERFLY (test) | Macro F191.5 | 10 |