Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Behavioral Prediction on Woodward-like Behavioral Prediction Animate
Loading...
85
Accuracy
Qwen 3.5 Plus
33
46.5
60
73.5
Mar 5, 2025
Accuracy
Updated 2mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen 3.5 Plus
2025.03
85
GPT-5.2
2025.03
84
Humans
2025.03
75
Gemini 3.1 Pro
2025.03
63
Claude 3.5 Sonnet
2025.03
53
Qwen VL Max
2025.03
48
Gemini 3.1 Flash
2025.03
41
Claude Opus 4.6
2025.03
37
GPT-4o
2025.03
35
Feedback
Search any
task
Search any
task