Share your thoughts, 1 month free Claude Pro on usSee more

Biomedical Question Answering on PubMedQA PQA-L In-Domain (test)

78Accuracy

Human (expert)

Updated 4mo ago

Evaluation Results

Method	Links
Human (expert) 2023.08		78
BioMedGPT-10B 2023.08		76.1
Llama2-Chat 2023.08		75.5
Llama 2023.08		73.4
InstructGPT 2023.08		73.2
PMC-Llama 2023.08		69.5
ChatGPT 2023.08		63.9
Human (pass) 2023.08		60
Llama2-Chat 2023.08		21.9
Llama 2023.08		5.2
Llama2 2023.08		3.7