| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Action Segmentation | Breakfast | F1@1089.9 | 107 | |
| Temporal Action Segmentation | Breakfast | Accuracy75.9 | 96 | |
| Action Segmentation | Breakfast | MoF65.1 | 66 | |
| Action Anticipation | Breakfast | MoC Accuracy32.27 | 64 | |
| Action segmentation | Breakfast (test) | MoF60.6 | 31 | |
| Action Recognition | Breakfast | Top-1 Accuracy97.9 | 28 | |
| Dense anticipation mean over classes | Breakfast (test) | Mean Error @ 10% Horizon12.8 | 28 | |
| Action Segmentation | Breakfast 14 | MoF70.2 | 26 | |
| Single-label activity classification | Breakfast | Accuracy90.27 | 21 | |
| Temporal Action Segmentation | Breakfast 40 | F1@1082.8 | 19 | |
| Action Alignment | Breakfast | IoD66.2 | 18 | |
| Action Segmentation | Breakfast 10 tasks (test) | Acc51.7 | 16 | |
| Human activity recognition | Breakfast | Accuracy90.7 | 14 | |
| Temporal Video Segmentation | Breakfast | MoF0.522 | 14 | |
| Long-form Video Classification | Breakfast | Top-1 Accuracy95.2 | 14 | |
| Video Question Answering | Breakfast | Accuracy98.5 | 13 | |
| Action Recognition | Breakfast (1357:335) | Accuracy94.9 | 13 | |
| Video Understanding | Breakfast | Top-1 Acc97.4 | 12 | |
| Next Action Anticipation | Breakfast (test) | Accuracy64.7 | 11 | |
| Unsupervised Temporal Action Segmentation | Breakfast | MOF56.1 | 10 | |
| Video Action Recognition | Breakfast | Top-1 Accuracy76.6 | 10 | |
| Long-term Action Anticipation | Breakfast (test) | MoC (alpha=0.2, beta=0.1)28.25 | 9 | |
| Action Alignment | Breakfast (test) | MoF63 | 9 | |
| Action Segmentation | Breakfast (avg) | Mof50.2 | 9 | |
| Action Segmentation | Breakfast 5 tasks (test) | Accuracy65.1 | 8 |