Share your thoughts, 1 month free Claude Pro on usSee more

Video Reasoning on MMMU Video

84.6Accuracy

GPT-5-thinking

Updated 4mo ago

Evaluation Results

Method	Links
GPT-5-thinking 2025.11		84.6
Gemini 2.5 Pro 2025.11		83.6
OpenAI O3 2025.11		83.3
Seed 1.5VL 2025.11		81.4
Qwen3-VL-235B-Thinking 2025.11		80
VideoChat-M1 2025.11		80
Qwen3-VL-235B-Instruct 2025.11		74.7
Qwen3-VL-8B-Instruct 2025.11		65.3
GPT-4o 2025.11		61.2
Gemini 1.5 Pro 2025.11		53.9
VideoChat-R1.5-7B 2025.11		51.4
Aria-28B 2025.11		50.8
LLAVA-Video-72B 2025.11		49.7
LLaVA-ov-72B 2025.11		48.3
LLaVA-Video-7B 2025.11		36.1
LongVA-7B 2025.11		24