BriMA: Bridged Modality Adaptation for Multi-Modal Continual Action Quality Assessment
About
Action Quality Assessment (AQA) aims to score how well an action is performed and is widely used in sports analysis, rehabilitation assessment, and human skill evaluation. Multi-modal AQA has recently achieved strong progress by leveraging complementary visual and kinematic cues, yet real-world deployments often suffer from non-stationary modality imbalance, where certain modalities become missing or intermittently available due to sensor failures or annotation gaps. Existing continual AQA methods overlook this issue and assume that all modalities remain complete and stable throughout training, which restricts their practicality. To address this challenge, we introduce Bridged Modality Adaptation (BriMA), an innovative approach to multi-modal continual AQA under modality-missing conditions. BriMA consists of a memory-guided bridging imputation module that reconstructs missing modalities using both task-agnostic and task-specific representations, and a modality-aware replay mechanism that prioritizes informative samples based on modality distortion and distribution drift. Experiments on three representative multi-modal AQA datasets (RG, Fis-V, and FS1000) show that BriMA consistently improves performance under different modality-missing conditions, achieving 6--8\% higher correlation and 12--15\% lower error on average. These results demonstrate a step toward robust multi-modal AQA systems under real-world deployment constraints.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Action Quality Assessment | FS1000 Modality Missing Rate β=10% (supplementary) | SRCC (Avg.)0.756 | 13 | |
| Action Quality Assessment | FS1000 Modality Missing Rate β=25% (supplementary) | SRCC (Avg)0.74 | 12 | |
| Action Quality Assessment | FS1000 Modality Missing Rate β=50% (supplementary) | SRCC (Avg)0.698 | 12 | |
| Action Quality Assessment | RG β = 10% | SRCC Ball0.648 | 12 | |
| Action Quality Assessment | RG β = 25% | SRCC (Ball)0.592 | 12 | |
| Action Quality Assessment | RG β = 50% | SRCC (Ball)0.569 | 12 | |
| Sentiment intensity prediction | CMU MOSI beta = 10% (test) | SRCC0.734 | 12 | |
| Sentiment intensity prediction | CMU MOSI beta = 25% (test) | SRCC0.7 | 12 | |
| Sentiment intensity prediction | CMU MOSI beta = 50% (test) | SRCC0.683 | 12 | |
| Action Quality Assessment | RG Full Modality | -- | 1 |