Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Medbullets

Benchmarks

Task NameDataset NameSOTA ResultTrend
Medical Question AnsweringMedBullets
Accuracy84.2
65
Medical Question AnsweringMedBullets (test)
Accuracy82.79
18
Medical ReasoningMedBullets
Accuracy80.8
13
Medical ReasoningMedBullets
Token Cost (tokens/question)2,391
11
Medical Question AnsweringMedbullets op5
Accuracy53.9
8
Medical ReasoningMedbullets OOD (out-of-distribution)
Accuracy63.6
7
Medical ReasoningMedBullets 5-option multiple choice
Accuracy53.9
7
Medical ReasoningMedBullets 4-option multiple choice
Accuracy62.3
7
Question AnsweringMedbullets 5
Accuracy65.3
7
Medical Knowledge EvaluationMedbullets
Accuracy58.44
5
Medical Question AnsweringMedbullets op5
Pass@194.16
4
Medical Question AnsweringMedbullets op4
Pass@195.78
4
Showing 12 of 12 rows