Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Question Answering on CV-MedExQA ambiguous (test)
Loading...
73.09
Accuracy
AU-Probe
44.8748
52.1999
59.525
66.8501
Jan 24, 2026
Accuracy
Improvement
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Improvement
AU-Probe
Model=Qwen2.5-7B, Clar...
2026.01
73.09
8.94
AU-Probe
Model=Bio-Medical-Llam...
2026.01
69.57
10.32
AU-Probe
Model=Llama-3.1-8B, Cl...
2026.01
68.3
7.55
ASK4CONF
Model=Qwen2.5-7B, Clar...
2026.01
65.85
-
ASK4CONF
Model=Bio-Medical-Llam...
2026.01
65.21
-
No Clarification
Model=Qwen2.5-7B, Clar...
2026.01
64.15
-
ASK4CONF
Model=Llama-3.1-8B, Cl...
2026.01
61.7
-
No Clarification
Model=Llama-3.1-8B, Cl...
2026.01
60.74
-
No Clarification
Model=Bio-Medical-Llam...
2026.01
59.26
-
AU-Probe
Model=BioMistral-7B, C...
2026.01
57.87
11.91
ASK4CONF
Model=BioMistral-7B, C...
2026.01
52.77
-
No Clarification
Model=BioMistral-7B, C...
2026.01
45.96
-
Feedback
Search any
task
Search any
task