Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prevention on T/F
Loading...
100
gpt-5.1 Score
DOPE-v2
66.512
75.206
83.9
92.594
Jan 18, 2026
gpt-5.1 Score
gpt-4o Score
sonnet Score
opus Score
Updated 4d ago
Evaluation Results
Method
Method
Links
gpt-5.1 Score
gpt-4o Score
sonnet Score
opus Score
DOPE-v2
configuration=ICW + Fo...
2026.01
100
96.7
90.2
90.2
DOPE-v1
configuration=ICW + Du...
2026.01
96.7
89.3
89.8
89.8
code-glyph
2026.01
86.4
80.8
80.5
80.5
TRAPDOC
2026.01
83.3
86.6
81.4
81.4
ICW
2026.01
67.8
62
60.5
60.5
Feedback
Search any
task
Search any
task