Share your thoughts, 1 month free Claude Pro on usSee more

Belief reasoning on EgoToM

72Accuracy

Humans

Updated 4mo ago

Evaluation Results

Method	Links
Humans 2026.03		72
Humans 2026.03		71
Gemini-2.5-Flash 2026.03		46.7
Video-Llama2-72B 2026.03		46
LLaVA-Next-Video-7B 2026.03		45.3
LLaVA-Next-Video-7B 2026.03		45.3
GPT-4-Turbo 2026.03		45
Qwen2.5-VL-7B 2026.03		42
Qwen2.5-VL-7B 2026.03		40.6
LLaVA-Next-Video-7B 2026.03		39.2
LLaVA-Next-Video-7B 2026.03		39.2
CogVLM2 2026.03		39
LLaVA-Next-Video-7B 2026.03		38.9
Qwen2.5-VL-7B 2026.03		36
Qwen2.5-VL-7B 2026.03		35.6
Qwen2.5-VL-7B 2026.03		35.6
Qwen2.5-VL-7B 2026.03		24.3
LLaVA-Next-Video-7B 2026.03		20.6
GPT-4o 2026.03		20.4