Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Video Reasoning on AoT Bench
Loading...
68.9
Accuracy
OmniJigsaw (CMM)
46.488
52.3065
58.125
63.9435
Apr 9, 2026
Accuracy
Updated 8d ago
Evaluation Results
Method
Method
Links
Accuracy
OmniJigsaw (CMM)
Inference Mode=w/ Audio
2026.04
68.9
OmniJigsaw (SMS)
Inference Mode=w/ Audio
2026.04
68.12
VideoJigsaw
Inference Mode=w/ Audio
2026.04
67.45
OmniJigsaw (JMI)
Inference Mode=w/ Audio
2026.04
66.83
OmniJigsaw (CMM)
Inference Mode=w/o Audio
2026.04
66.83
OmniJigsaw (SMS)
Inference Mode=w/o Audio
2026.04
66.39
VideoJigsaw
Inference Mode=w/o Audio
2026.04
66.22
OmniJigsaw (JMI)
Inference Mode=w/o Audio
2026.04
65.83
Qwen3-Omni-30B
Inference Mode=w/ Audio
2026.04
64.88
Qwen3-Omni-30B
Inference Mode=w/o Audio
2026.04
63.32
Video-R1
Inference Mode=w/o Audio
2026.04
52.6
Omni-R1
Inference Mode=w/ Audio
2026.04
52.09
Omni-R1
Inference Mode=w/o Audio
2026.04
51.03
HumanOmniV2
Inference Mode=w/ Audio
2026.04
48.58
HumanOmniV2
Inference Mode=w/o Audio
2026.04
47.35
Feedback
Search any
task
Search any
task