| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Emotion Recognition | M3ED (test) | Weighted F152.28 | 35 | |
| Emotion Recognition | M3ED (val) | Weighted F158.7 | 35 | |
| Visual-Inertial Odometry | M3ED | Accuracy (Hard Sequence)57 | 11 | |
| Speech Emotion Recognition | M3ED | Macro F132.1 | 5 | |
| Speech Emotion Recognition | M3ED Chinese (Zh) | Weighted Accuracy (WA)49.15 | 5 | |
| Semantic Segmentation | M3ED Urban Day (test) | mIoU67.28 | 4 | |
| Disparity Estimation | M3ED (test) | MAE0.813 | 3 |