Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BioASQ

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringBioASQ
Accuracy98.32
72
Medical Question AnsweringBioASQ
Accuracy80.74
38
Hallucination DetectionBioASQ
AUROC81.13
28
Selective PredictionBioASQ
E-AURC0.2744
28
Question AnsweringBioASQ (dev)
F1 Score77.8
28
Biomedical reasoningBioASQ out-of-domain
Accuracy91.87
25
Domain AdaptationBioASQ (test)
BBH54.89
20
Biomedical Multi-hop Question AnsweringBioASQ-B
EM40.6
18
Extractive Question AnsweringBioASQ (test)
EM47.27
16
Snippet RetrievalBIOASQ 7 (test batches 1-5)
MAP0.2518
16
Document RetrievalBIOASQ 7 (test batches 1-5)
MAP19.24
16
Question AnsweringBioASQ MRQA out-of-domain evaluation 2019 (test)
EM60.3
15
Reading ComprehensionBioASQ MRQA out-of-domain
EM67.62
14
Question AnsweringBioASQ factoid 7b (test)
SAcc47.4
13
Extractive Question AnsweringBioASQ MRQA
F1 Score91
12
Biomedical Question AnsweringBioASQ
Factoid Acc29
11
Question AnsweringBioASQ
SAME_CONCLUSION Score85.71
10
RetrievalBioASQ (test)
Top-2046
9
Biomedical Question AnsweringBioASQ (test)
ROUGE54.8
8
Question AnsweringBioASQ MRQA Out-of-domain
F1 Score49.37
8
Document ClassificationBioASQ
Macro F171.28
8
Medical Question AnsweringBioASQ (test)
ROUGE-128.55
8
Question Answering RetrievalBioASQ
nDCG@1076.9
8
Generative Question AnsweringBioASQ (test)
EM43.01
8
Question AnsweringBioASQ Task B 14 2026 challenge edition (sampled 1000 factoid questions)
ECE0.071
7
Showing 25 of 55 rows