| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Question Answering | HD-EPIC | Recipe Accuracy65 | 10 | |
| Audio-to-Text Retrieval | HD-EPIC | mAP32.2 | 8 | |
| Text-to-Audio Retrieval | HD-EPIC | mAP10.7 | 8 | |
| Parallel task execution prediction | HD-EPIC 2-Body Problem | Coverage1 | 8 | |
| Motion Generation | HD-EPIC curated P&R sequences | Prime Success66.31 | 8 | |
| Parallel task execution prediction | HD-EPIC 3-Body Problem | Coverage85.9 | 1 |