Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Chemistry on Chemistry tasks
Loading...
84.4
Score
Claude-Sonnet-4.5-Think
47.272
56.911
66.55
76.189
Jan 22, 2026
Score
Performance Difference (Δ)
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Performance Difference (Δ)
Claude-Sonnet-4.5-Think
Generation Mode=LLM-in...
2026.01
84.4
1.1
Claude-Sonnet-4.5-Think
Generation Mode=Standa...
2026.01
83.3
-
GPT-5
Generation Mode=LLM-in...
2026.01
81.6
0.5
GPT-5
Generation Mode=Standa...
2026.01
81.1
-
DeepSeek-V3.2-Thinking
Generation Mode=LLM-in...
2026.01
77.8
1.1
Kimi-K2-Thinking
Generation Mode=LLM-in...
2026.01
77.6
3.2
DeepSeek-V3.2-Thinking
Generation Mode=Standa...
2026.01
76.7
-
Kimi-K2-Thinking
Generation Mode=Standa...
2026.01
74.4
-
MiniMax-M2
Generation Mode=LLM-in...
2026.01
68.4
14.4
Qwen3-4B-Instruct-2507
Generation Mode=Standa...
2026.01
56.4
-
MiniMax-M2
Generation Mode=Standa...
2026.01
54
-
Qwen3-Coder-30B-A3B
Generation Mode=LLM-in...
2026.01
54
5.3
Qwen3-4B-Instruct-2507
Generation Mode=LLM-in...
2026.01
49.3
-7.1
Qwen3-Coder-30B-A3B
Generation Mode=Standa...
2026.01
48.7
-
Feedback
Search any
task
Search any
task