Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Theory of Mind Question Answering on MMToM-QA
Loading...
88.3
Accuracy
PDDL-MIND
53.98
62.89
71.8
80.71
Apr 20, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
PDDL-MIND
Evaluator LLM=GPT-4o
2026.04
88.3
AutoToM
Evaluator LLM=GPT-4o
2026.04
83
Human
Evaluator LLM=GPT-4o
2026.04
82.5
TT
Evaluator LLM=GPT-4o
2026.04
69
BIP-ALM
Evaluator LLM=GPT-4o
2026.04
56.2
LIMP
Evaluator LLM=GPT-4o
2026.04
55.3
Feedback
Search any
task
Search any
task