Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Theory of Mind reasoning on MMToM-QA Multimodal

95.8Belief Inference 1.1

Human

33.60849.75465.982.046Jan 16, 2024Apr 8, 2024Jul 1, 2024Sep 23, 2024Dec 16, 2024Mar 10, 2025Jun 2, 2025
Updated 22d ago

Evaluation Results

MethodLinks
2024.01
95.896.710097.59091.783.388.988.593
2025.06
95.896.710097.59091.783.388.988.593
2024.01
94135955.35626.745234.744
2025.06
94135955.35626.745234.744
2025.06
92.1769387.173.48075.578.776.981.3
2024.01
90698681.76878.75673.36975.3
2025.06
90698681.76878.75673.36975.3
2024.01
88688580.362.777.3728073.376.7
2025.06
88688580.362.777.3728073.376.7
2024.01
62523248.746.729.342.76044.746.7
2024.01
4614694365.322.740484443.5
2025.06
4614694365.322.740484443.5
2024.01
363852423641.330.745.338.340.2
2025.06
363852423641.330.745.338.340.2