| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| HiToM | UserHarness | Accuracy87.08 | 64 | 6d ago | |
| ToMi | UserHarness | Accuracy100 | 55 | 6d ago | |
| BigToM | UserHarness | Accuracy98.67 | 48 | 6d ago | |
| TomBench OOD | GPT-4o | Emotion75.24 | 17 | 3mo ago | |
| FANToM | DITTO | Accuracy95 | 14 | 13d ago | |
| ToMBench | Accuracy81.8 | 9 | 3mo ago | ||
| ToMATO | Accuracy82.2 | 9 | 3mo ago | ||
| TOM-SB | ADA - ToM Only | ToM Accuracy (Trajectory)72 | 3 | 1mo ago |