Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-form QA on PubMedQA (test)

37.49ROUGE-1

Fine-Tuned GPT-4o + MedBioRAG

25.249228.427131.60534.7829Dec 10, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
37.4914.7827.896.1137.02-3.89
2025.12
35.8213.5526.094.3435.33-9.23
2025.12
26.399.5517.472.7318.1-7.86
2025.12
25.729.0217.052.4817.04-9.04