Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Video Question Answering on TempCompass MC
Loading...
77.9
Accuracy
GPT-5.2
63.548
67.274
71
74.726
May 30, 2026
Accuracy
Speed
Order Score
Attribute Change Score
Updated 1d ago
Evaluation Results
Method
Method
Links
Accuracy
Speed
Order Score
Attribute Change Score
GPT-5.2
2026.05
77.9
73.5
74.8
84
Qwen3-VL-235B
Model Scale=235B
2026.05
76.1
62.5
82.8
83.3
Ours FT (4B)
Fine-tuning=Yes, Model...
2026.05
72
55.5
79.5
79.9
Baseline (Qwen3-VL-4B)
Fine-tuning=No, Model...
2026.05
70.1
49.5
74.8
78.1
GPT-4o
2026.05
64.1
46.1
55.3
76
Feedback
Search any
task
Search any
task