Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Biomedical QA datasets

Benchmarks

Task NameDataset NameSOTA ResultTrend
Biomedical Question AnsweringFour biomedical QA datasets macro-averaged (test)
Faithfulness85.3
4
Showing 1 of 1 rows