Cohort-Based Active Modality Acquisition
About
Real-world multimodal machine learning often faces missing, costly-to-acquire modalities, raising the problem of which samples to prioritize for additional acquisition under a budget. Prior work mainly studies per-sample or training-time acquisition while test-time, cohort-level acquisition is less explored. We propose Cohort-based Active Modality Acquisition (CAMA), a novel test-time cohort-level modality acquisition setting, and introduce imputation-based acquisition strategies that estimate the expected utility of acquiring a missing modality, along with upper-bound heuristics for benchmarking. Experiments on datasets with up to 15 modalities demonstrate that our proposed imputation-based strategies can more effectively guide the acquisition of an additional modality for selected samples compared with methods relying solely on pre-acquisition information, entropy-based guidance, or random selection. We showcase the real-world relevance and scalability of our method by demonstrating its ability to guide the acquisition of proteomics data for disease prediction in a large prospective cohort, the UK Biobank (UKB). Our work provides an effective approach for optimizing modality acquisition at the cohort level, enabling more effective use of resources in constrained settings.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Active Modality Acquisition | MIMIC Symile G_full (test) | Fracture Score0.911 | 44 | |
| Active Modality Acquisition | MOSEI Audio imputed by Text (test) | AUROC2.306 | 11 | |
| Active Modality Acquisition | MOSEI Text imputed by Image | AUROC (G_full)0.9 | 11 | |
| Active Modality Acquisition | MOSEI Text imputed by Image and Audio (test) | AUROC0.892 | 11 | |
| Active Modality Acquisition (Text imputed by Audio) | MOSEI | AUROC3.157 | 11 | |
| Active Modality Acquisition | MOSEI Audio imputed by Image | AUROC0.8 | 11 | |
| Active Modality Acquisition | MOSEI | AUROC (G_full)0.855 | 11 | |
| Active Modality Acquisition | MOSEI Audio imputed by Image and Text (test) | AUROC (G_full)0.857 | 11 | |
| Active Modality Acquisition | UKB | AUROC0.641 | 11 | |
| Active Modality Acquisition | Symile | AUROC (Cardiomegaly)0.747 | 11 |