Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PubMedQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringPubMedQA
Accuracy83.6
145
Question AnsweringPubMedQA (test)
Accuracy81.8
81
Medical Question AnsweringPubMedQA
Accuracy81.4
45
Question AnsweringPubMedQA PQA-L (test)
Accuracy78.2
25
Question AnsweringPubMedQA
EM79.82
18
Prompt Leakage AttackPubMedQA
ASR (500)14
16
Question AnsweringPubMedQA
Context Influence115.78
15
Question AnsweringPubMedQA (out-of-domain)
ROUGE-L11.7
14
Biomedical Question AnsweringPubMedQA PQA-L In-Domain (test)
Accuracy78
11
Close-ended QAPubMedQA
Accuracy85
10
Medical Question AnsweringPubMedQA Reasoning Required
Accuracy82
10
Question AnsweringPubMedQA
Accuracy78.6
9
Evaluating Context Influence and Input RegurgitationPubMedQA
Context Influence Score I(D; y_tilde)97.95
9
Retrieval-Augmented GenerationPubMedQA
Accuracy77.9
8
Medical Question AnsweringPubMedQA Synthetic NIID 1.0 (test)
Accuracy75.1
7
Medical Question AnsweringPubMedQA Synthetic IID 1.0 (test)
Accuracy75.1
7
Pre-training data contamination detectionPubMedQA (PMQA) (test)
AUC0.54
7
Question AnsweringPubMedQA
Acc66.4
6
Question AnsweringPubMedQA
BLEU-19.7
6
Question AnsweringPubmedQA
F143.21
5
Question AnsweringPubMedQA English (test)
Accuracy74.64
5
Question AnsweringPubMedQA official (val)
F1 Score93.33
4
Medical Question AnsweringPubMedQA
Pass@186
4
Long-form QAPubMedQA (test)
ROUGE-137.49
4
Showing 24 of 24 rows