Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Theory of Mind Question Answering on MuMa-ToM
Loading...
93.5
Accuracy
Human
31.516
47.608
63.7
79.792
Apr 20, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Human
Evaluator LLM=GPT-4o
2026.04
93.5
PDDL-MIND
Evaluator LLM=GPT-4o
2026.04
88.8
AutoToM
Evaluator LLM=GPT-4o
2026.04
81.4
LIMP
Evaluator LLM=GPT-4o
2026.04
76.6
BIP-ALM
Evaluator LLM=GPT-4o
2026.04
33.9
Feedback
Search any
task
Search any
task