Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical Question Answering on MIMIC-III (test)
Loading...
67.05
CUS Score
GPT-4 (Baseline)
21.3628
33.2239
45.085
56.9461
Nov 20, 2025
CUS Score
ZTI Score
Updated 18d ago
Evaluation Results
Method
Method
Links
CUS Score
ZTI Score
GPT-4 (Baseline)
Train Dataset=MedQA, I...
2025.11
67.05
44.5
GPT-4 + MedBayes-Lite
Train Dataset=MedQA, I...
2025.11
23.12
80.15
Feedback
Search any
task
Search any
task