Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multiple Choice QA on PubMedQA (test)
Loading...
81.8
AUROC
Total variance, experts
65.576
69.788
74
78.212
Apr 2, 2026
AUROC
Updated 16d ago
Evaluation Results
Method
Method
Links
AUROC
Total variance, experts
Uses experts=✓, inference
2026.04
81.8
Total variance, CAE
Uses experts=✓, training
2026.04
73.2
Total variance, base
2026.04
70.9
Entropy and MI
2026.04
70.7
Prediction variance
2026.04
70.2
Single model
2026.04
69.3
MC Dropout
2026.04
67.1
Test-time augmentations
2026.04
67
HUQ
2026.04
66.2
Feedback
Search any
task
Search any
task