Share your thoughts, 1 month free Claude Pro on usSee more

Biomedical Question Answering on Four biomedical QA datasets macro-averaged (test)

85.3Faithfulness

Med42-Llama3-8B

Updated 4mo ago

Evaluation Results

Method	Links
Med42-Llama3-8B 2026.01		85.3	6.3	6.8
Med-Qwen2-7B 2026.01		81.4	8	7.7
Meditron3-8B 2026.01		71.5	7.6	8.2
PMC-LLaMA-13B 2026.01		60.1	10.7	11.3