Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Uncertainty Quantification on MedMCQA (test)
Loading...
71.7
AUROC
SAR
64.42
66.31
68.2
70.09
Jul 3, 2023
AUROC
Updated 4d ago
Evaluation Results
Method
Method
Links
AUROC
SAR
Backbone=Vicuna-13b
2023.07
71.7
SAR
Backbone=LLaMA-2-13b-chat
2023.07
70.2
SE
Backbone=Vicuna-13b
2023.07
68.5
SE
Backbone=LLaMA-2-13b-chat
2023.07
65.5
LN-PE
Backbone=Vicuna-13b
2023.07
64.9
LN-PE
Backbone=LLaMA-2-13b-chat
2023.07
64.7
Feedback
Search any
task
Search any
task