| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Reasoning | VPCT Ball Drop Maze and Counting | VPCT Accuracy96 | 5 | |
| VPCT | VPCT | Human Score4.56 | 5 | |
| Annotation Quality | VPCT Ball Drop and Maze Navigation | VPCT Score3.12 | 5 | |
| Annotation-text Alignment | VPCT Ball Drop Maze Navigation | VPCT Score100 | 5 |