| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Human Motion Composition | Babel | PJ1.09 | 13 | |
| Action Localization | BABEL Subset-2 v1.0 (test) | mAP@0.165.2 | 12 | |
| Action Localization | BABEL (test) | mAP@tIoU (Subset-1)53.9 | 12 | |
| Motion Generation | BABEL 2021 (test) | FID0.0004 | 10 | |
| Speech Recognition | BABEL Swahili sw (test) | WER21 | 7 | |
| Speech Recognition | BABEL Tagalog tl (test) | WER29.3 | 7 | |
| Action Localization | BABEL Subset-3 | mAP@0.148.5 | 6 | |
| Action Localization | BABEL Subset-1 | mAP@0.160.5 | 6 | |
| Action Localization | BABEL Subset-3 v1.0 (test) | mAP@0.142 | 6 | |
| Action Localization | BABEL Subset-1 v1.0 (test) | mAP@0.153.7 | 6 | |
| Speech Recognition | BABEL Georgian ka (test) | WER24.3 | 6 | |
| Speech Recognition | BABEL Assamese as (test) | WER39 | 6 | |
| Automatic Speech Recognition | Babel Average (test) | Absolute WER Improvement5.77 | 6 | |
| Automatic Speech Recognition | Babel Lithuanian - LIT IARPA program (test) | WER66 | 6 | |
| Automatic Speech Recognition | Babel Telugu - TEL (test) | WER82.4 | 6 | |
| Automatic Speech Recognition | Babel Kazakh - KAZ IARPA program (test) | WER71 | 6 | |
| Automatic Speech Recognition | Babel Cebuano - CEB IARPA program (test) | WER70.6 | 6 | |
| Automatic Speech Recognition | Babel Tok Pisin - TOK IARPA program (test) | WER54.3 | 6 | |
| Automatic Speech Recognition | Babel Kurmanji - KUR IARPA program (test) | WER77.7 | 6 | |
| Motion Generation (Segment) | BABEL | FID3.072 | 5 | |
| Temporal Motion Generation | BABEL (test) | R Precision62 | 5 | |
| Motion Transition Generation (70 frames) | BABEL 2021 (test) | FID0.0008 | 5 | |
| Human Motion Prediction | BABEL (test) | Accuracy49.6 | 5 | |
| Future Motion Prediction | BABEL (test) | ADEw1.1 | 5 | |
| Action-driven human motion prediction | BABEL | Accuracy55.37 | 5 |