Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Medical Question Answering on MedQA US (4-option)

90.2Accuracy

GPT-4 (Medprompt)

66.69672.79878.985.002Nov 28, 2023Jan 22, 2024Mar 17, 2024May 12, 2024Jul 6, 2024Aug 30, 2024Oct 25, 2024
Updated 2mo ago

Evaluation Results

MethodLinks
2023.11
90.2
2024.10
89
2024.10
89
2023.11
86.5
2023.11
81.4
2024.10
81
2024.10
78
2023.11
67.6