Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Data Analysis on DABStep easy
Loading...
83.33
Accuracy
Claude Sonnet 4.5
45.7756
55.5253
65.275
75.0247
Jan 22, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Claude Sonnet 4.5
Model Type=Proprietary
2026.01
83.33
Claude Sonnet 4
Model Type=Proprietary
2026.01
81.94
Kimi K2 Instruct
Model Type=Open-sourced
2026.01
77.78
Deepseek-v3.1
Model Type=Open-sourced
2026.01
76.39
GPT-5
Reasoning effort=mediu...
2026.01
75
Qwen3-Coder 480B
Model Type=Open-sourced
2026.01
75
GPT-5.1
Reasoning effort=high,...
2026.01
73.61
GPT-4o
Model Type=Proprietary
2026.01
73.61
Qwen3 235B Instruct
Model Type=Open-sourced
2026.01
73.61
GPT-5.1
Reasoning effort=none,...
2026.01
70.83
GPT-OSS-120B
Model Type=Open-sourced
2026.01
70.83
Qwen3-4B-Instruct
Model Type=Open-sourced
2026.01
58.33
Qwen2.5-7B-Instruct
Model Type=Open-sourced
2026.01
47.22
Feedback
Search any
task
Search any
task