Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Data Analysis on DAEval Verified
Loading...
92.82
Accuracy
Kimi K2 Instruct
48.8696
60.2798
71.69
83.1002
Jan 22, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Kimi K2 Instruct
Model Type=Open-sourced
2026.01
92.82
GPT-4o
Model Type=Proprietary
2026.01
92.26
Claude Sonnet 4.5
Model Type=Proprietary
2026.01
91.71
Claude Sonnet 4
Model Type=Proprietary
2026.01
90.91
Qwen3-Coder 480B
Model Type=Open-sourced
2026.01
90.61
GPT-5.1
Reasoning effort=high,...
2026.01
89.5
GPT-5
Reasoning effort=mediu...
2026.01
89.5
GPT-5.1
Reasoning effort=none,...
2026.01
87.85
Qwen3 235B Instruct
Model Type=Open-sourced
2026.01
85.08
GPT-OSS-120B
Model Type=Open-sourced
2026.01
84.53
Deepseek-v3.1
Model Type=Open-sourced
2026.01
82.32
Qwen3-4B-Instruct
Model Type=Open-sourced
2026.01
64.47
Qwen2.5-7B-Instruct
Model Type=Open-sourced
2026.01
50.56
Feedback
Search any
task
Search any
task