Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Question Answering on PubMedQA Reasoning Required
Loading...
82
Accuracy
GPT-4 (Medprompt)
55.0848
62.0724
69.06
76.0476
Nov 28, 2023
Apr 10, 2024
Aug 22, 2024
Jan 3, 2025
May 17, 2025
Sep 28, 2025
Feb 9, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4 (Medprompt)
Prompting Strategy=Med...
2023.11
82
Med-PaLM 2
Strategy Selection=cho...
2023.11
81.8
Flan-PaLM 540B
Strategy Selection=cho...
2023.11
79
GPT-4
Prompting Strategy=5-s...
2023.11
75.2
TEXTRESNET
mode=deep residual tuning
2026.02
60.31
DSPy
optimizer=MIPRO
2026.02
60.26
HBC
mode=hierarchical imit...
2026.02
58.8
CoT
mode=unoptimized lower...
2026.02
57.34
TextGrad
summarization=false
2026.02
56.96
TextGrad
summarization=true
2026.02
56.12
Feedback
Search any
task
Search any
task