Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prevention on MCQ
Loading...
99.3
gpt-5.1 Score
DOPE-v2
65.292
74.121
82.95
91.779
Jan 18, 2026
gpt-5.1 Score
gpt-4o Score
sonnet Score
opus Score
Updated 4d ago
Evaluation Results
Method
Method
Links
gpt-5.1 Score
gpt-4o Score
sonnet Score
opus Score
DOPE-v2
configuration=ICW + Fo...
2026.01
99.3
98.7
91
91
DOPE-v1
configuration=ICW + Du...
2026.01
96.3
88
90.3
90.3
TRAPDOC
2026.01
89.9
80.4
74.8
74.8
code-glyph
2026.01
84
84.6
76.1
76.1
ICW
2026.01
66.6
60.2
66.5
66.5
Feedback
Search any
task
Search any
task