| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| DBE-KT (test) | QG-SMS | AA66.66 | 10 | 4d ago | |
| EduAgent (test) | QG-SMS | Accuracy66.39 | 10 | 4d ago | |
| PsyCLIENT-CP +content | Accuracy (A)61.9 | 6 | 3d ago | ||
| PsyCLIENT-CP | DeepSeek-R1 | Accuracy (A)16.4 | 6 | 3d ago | |
| PsyCLIENT-CP +behavior | DeepSeek-R1 | Accuracy (A)26.1 | 6 | 3d ago | |
| PsyCLIENT-CP vanilla | Accuracy (A)83.1 | 6 | 3d ago | ||
| PsyCLIENT-CP Human Client | Accuracy (A)100 | 6 | 3d ago | ||
| Photorealistic discrimination dataset (test) | CLIP | Realistic Acc93.9 | 6 | 4d ago | |
| Discrimination task 256 shape-color combinations (test) | DINOv2 | Test Accuracy99.5 | 6 | 4d ago | |
| Discrimination task 256 shape-color combinations (train) | CLIP | Train Accuracy100 | 6 | 4d ago |