| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-Entity Segmentation | BEHAVE (test) | mIoU88.63 | 25 | |
| Human-Object Interaction Generation | BEHAVE (test) | FID0 | 17 | |
| Vertex-level human-scene contact prediction | BEHAVE (test) | Precision36 | 9 | |
| 3D Human-Object Interaction Generation | BEHAVE (test) | FID0.093 | 9 | |
| 3D human and object reconstruction | BEHAVE | CD Human4.59 | 9 | |
| Human Mesh Recovery | BEHAVE (Protocol 2) | MPJPE32.6 | 8 | |
| Human Mesh Recovery | BEHAVE (Protocol 1) | MPJPE48.9 | 8 | |
| Contact Estimation | BEHAVE (unseen) | Precision75.4 | 8 | |
| Joint Human and Object Reconstruction | BEHAVE (test) | CD (SMPL) (cm)5.241 | 8 | |
| 3D Human Reconstruction | BEHAVE | SMPL v2v Error (cm)4.99 | 8 | |
| 4D human-object interaction reconstruction | BEHAVE (test) | Chamfer Distance (Human)7.25 | 7 | |
| Human Mesh Recovery | BEHAVE | PA-MPJPE22.7 | 7 | |
| Joint Human-Object Tracking | BEHAVE extended (key frames) | SMPL Chamfer Distance5.24 | 6 | |
| 3D Object Reconstruction | BEHAVE (test) | Chamfer Distance (cm)4.66 | 6 | |
| 6-DoF Object Tracking | BEHAVE (test) | ADD-S25.71 | 6 | |
| Novel View Synthesis | BEHAVE Novel View | PSNR24.12 | 5 | |
| Video depth estimation | BEHAVE | Abs Rel0.033 | 5 | |
| HOI Video Generation | BEHAVE (test) | CLIPSIM0.3138 | 5 | |
| Human-Object Interaction Reconstruction | BEHAVE | Chamfer Distance6.295 | 5 | |
| Monocular Dynamic Scene Reconstruction | BEHAVE Average | PSNR31.51 | 4 | |
| Monocular Dynamic Scene Reconstruction | BEHAVE Trashbin_6 | PSNR31.62 | 4 | |
| Monocular Dynamic Scene Reconstruction | BEHAVE Backpack_6 | PSNR29.05 | 4 | |
| Monocular Dynamic Scene Reconstruction | BEHAVE Plasticcontainer_3 | PSNR29.38 | 4 | |
| Monocular Dynamic Scene Reconstruction | BEHAVE Backpack_3 | PSNR30.17 | 4 | |
| Monocular Dynamic Scene Reconstruction | BEHAVE Suitcase_2 | PSNR34.58 | 4 |