Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Pairwise Classification on PMR-Reddit Easy
Loading...
98
Accuracy
MedGemma-27BSFT
78.24
83.37
88.5
93.63
Jan 19, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
MedGemma-27BSFT
Size=27B, Method=Urgen...
2026.01
98
Qwen3-32BSFT
Size=32B, Method=Urgen...
2026.01
96
GPT-OSS
Size=120B, Type=Deep R...
2026.01
93
Reward-8Burgent
Size=8B, Method=Urgent...
2026.01
93
Reward-4Burgent
Size=4B, Method=Urgent...
2026.01
91
Qwen3-8BSFT
Size=8B, Method=UrgentSFT
2026.01
90
Qwen3-32B
Size=32B, Type=Instruct
2026.01
89
MedGemma-27b
Size=27B, Type=Instruct
2026.01
89
Qwen3-4B
Size=4B, Type=Instruct
2026.01
85
Qwen3-8B
Size=8B, Type=Instruct
2026.01
85
Qwen3-4BSFT
Size=4B, Method=UrgentSFT
2026.01
84
Qwen3-32B-R
Size=32B, Type=Deep Re...
2026.01
81
Reward-8Bbase
Size=8B, Type=Base Reward
2026.01
80
Reward-4Bbase
Size=4B, Type=Base Reward
2026.01
79
Feedback
Search any
task
Search any
task