Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Biomedicine on Biomedicine
Loading...
55.8
Score
GPT-5
7.544
20.072
32.6
45.128
Jan 22, 2026
Score
Performance Difference (Δ)
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Performance Difference (Δ)
GPT-5
Generation Mode=Standa...
2026.01
55.8
-
GPT-5
Generation Mode=LLM-in...
2026.01
49
-6.8
DeepSeek-V3.2-Thinking
Generation Mode=LLM-in...
2026.01
41.6
2.8
Kimi-K2-Thinking
Generation Mode=Standa...
2026.01
40.4
-
DeepSeek-V3.2-Thinking
Generation Mode=Standa...
2026.01
38.8
-
Claude-Sonnet-4.5-Think
Generation Mode=LLM-in...
2026.01
38
1
Claude-Sonnet-4.5-Think
Generation Mode=Standa...
2026.01
37
-
Kimi-K2-Thinking
Generation Mode=LLM-in...
2026.01
35.4
-5
MiniMax-M2
Generation Mode=LLM-in...
2026.01
28.2
2
MiniMax-M2
Generation Mode=Standa...
2026.01
26.2
-
Qwen3-Coder-30B-A3B
Generation Mode=LLM-in...
2026.01
18.2
3.8
Qwen3-Coder-30B-A3B
Generation Mode=Standa...
2026.01
14.4
-
Qwen3-4B-Instruct-2507
Generation Mode=Standa...
2026.01
10.2
-
Qwen3-4B-Instruct-2507
Generation Mode=LLM-in...
2026.01
9.4
-0.8
Feedback
Search any
task
Search any
task