| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| In-the-wild model generalization | Human Bench Average | NSE Score57.9 | 14 | |
| In-the-wild model generalization | Human Bench Text-based Demo | NSE23.4 | 14 | |
| In-the-wild model generalization | Human Bench Vision-based Demo | NSE15.5 | 14 |