Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long Video Reasoning on LongVideoReason (eval)
Loading...
80.3
Accuracy
VideoZoomer
58.46
64.13
69.8
75.47
Oct 23, 2025
Nov 2, 2025
Nov 13, 2025
Nov 24, 2025
Dec 4, 2025
Dec 15, 2025
Dec 26, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
VideoZoomer
Size=7B, Evaluation Pr...
2025.12
80.3
Video-R1
Size=7B, Evaluation Pr...
2025.12
72.8
Qwen2.5-VL
Size=7B, Evaluation Pr...
2025.12
70.8
VideoRFT-7B
Model=VideoRFT-7B
2025.10
69.4
Open-o3-Video-7B (Ours)
Model=Open-o3-Video-7B...
2025.10
69.4
VideoR1-7B
Model=VideoR1-7B
2025.10
68.9
LongVILA-R1
Size=7B, Evaluation Pr...
2025.12
67.9
Gemini-1.5-Pro
Size=-, Evaluation Pro...
2025.12
67.3
InternVL-2.5-8B
Model=InternVL-2.5-8B
2025.10
62
GPT-4o
Size=-, Evaluation Pro...
2025.12
60.7
VideoLLaMA3-7B
Model=VideoLLaMA3-7B
2025.10
59.8
Qwen2.5-VL-7B (Base)
Model=Qwen2.5-VL-7B (B...
2025.10
59.3
Feedback
Search any
task
Search any
task