| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Disease Classification | Stanford external (val) | AUROC0.977 | 18 | |
| Single-conditional Image Retrieval | Stanford40 | Action Accuracy72.2 | 12 | |
| Action Recognition | Stanford-40 | mAP96.21 | 8 | |
| Thyroid Ultrasound Segmentation | Stanford | DSC97.63 | 7 | |
| LVEF estimation | Stanford | MAE (Original)3.97 | 5 | |
| Monocular Depth Estimation | Stanford (test) | AbsRel0.0666 | 5 | |
| Point cloud registration | Stanford (30% overlapped) | Latency (s)5.3 | 5 | |
| Shape Reconstruction | Stanford | L1 Chamfer Distance (dCD)0.33 | 4 | |
| MCQ Diagnostic Accuracy | Stanford Multimodal (test) | Accuracy73.7 | 3 | |
| MCQ Diagnostic Accuracy | Stanford ECG (test) | Accuracy88 | 3 | |
| Fine-grained Image Classification | Stanford-40 (test) | Accuracy98.8 | 2 | |
| Battery cycle life prediction | Stanford (test) | Imp9.26 | 1 | |
| Visual Recognition | Stanford-40 | Top-1 Accuracy97.83 | 1 |