Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Search Relevance Assessment on LocalQSMed HARD v1.0
Loading...
0.6611
Accuracy
Qwen3-32B
0.046252
0.205876
0.3655
0.525124
Dec 3, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-32B
Loss Rate=0.00%
2025.12
0.6611
Qwen3-14B
Loss Rate=0.02%
2025.12
0.6106
DeepSeek-R1-Distill-Qwen-14B
Loss Rate=0.00%
2025.12
0.5758
Qwen3-8B
Loss Rate=0.00%
2025.12
0.549
Qwen3-4B
Loss Rate=0.00%
2025.12
0.5294
Llama-3.3-70B-Instruct
Loss Rate=46.86%
2025.12
0.3737
DeepSeek-R1-Distill-Llama-70B
Loss Rate=43.68%
2025.12
0.3598
Llama-3.1-8B
Loss Rate=63.55%
2025.12
0.3504
Llama-3.2-3B
Loss Rate=0.00%
2025.12
0.3358
Llama-3.2-1B
Loss Rate=63.67%
2025.12
0.2913
Qwen3-0.6B
Loss Rate=6.40%
2025.12
0.2784
DeepSeek-R1-Distill-Qwen-7B
Loss Rate=17.59%
2025.12
0.0699
Feedback
Search any
task
Search any
task