Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Theory of Mind reasoning on MMToM-QA Text-only

1Belief Inference 1.1

SymbolicToM

0.54240.66120.780.8988Jan 16, 2024Apr 8, 2024Jul 1, 2024Sep 23, 2024Dec 16, 2024Mar 10, 2025Jun 2, 2025
Updated 22d ago

Evaluation Results

MethodLinks
2024.01
10.610.740.7830.7330.66700.5070.4770.63
2025.06
10.610.740.7830.7330.66700.5070.4770.63
2024.01
0.970.120.770.620.480.4270.0270.4270.340.48
2025.06
0.970.120.770.620.480.4270.0270.4270.340.48
2024.01
0.960.9580.8130.910.8580.7670.650.6830.740.825
2024.01
0.960.150.820.6430.6130.440.0270.5470.4070.525
2025.06
0.960.9580.8130.910.8580.7670.650.6830.740.825
2025.06
0.960.150.820.6430.6130.440.0270.5470.4070.525
2025.06
0.920.5020.7240.7150.6830.440.450.4820.5140.615
2025.06
0.9010.7050.8740.8270.6880.7550.7530.7180.7290.778
2024.01
0.890.680.90.8230.5470.6670.5070.6270.5870.705
2024.01
0.880.690.880.8170.7730.680.3070.7070.6170.717
2025.06
0.880.690.880.8170.7730.680.3070.7070.6170.717
2025.06
0.8310.4760.6230.6430.640.3860.3870.460.4680.556
2024.01
0.810.110.390.4370.4670.160.2130.480.330.383
2024.01
0.640.550.50.5630.4930.480.4130.3870.4430.503
2024.01
0.560.530.380.490.520.5070.5070.560.5230.507