Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on PubMedQA Zero-Context Setting (test)
Loading...
64.77
Accuracy
Llama 3.1 8B Instruct
57.8644
59.6572
61.45
63.2428
Apr 2, 2026
Accuracy
F1 Score
Updated 16d ago
Evaluation Results
Method
Method
Links
Accuracy
F1 Score
Llama 3.1 8B Instruct
Model=Llama 3.1, Param...
2026.04
64.77
59.74
Llama 3.2 3B Instruct
Model=Llama 3.2, Param...
2026.04
64.41
58.97
Full FT 5ep 16k context
Fine-Tuning=Full, Epoc...
2026.04
60.62
57.24
LoRA FT 2ep 10k context
Fine-Tuning=LoRA, Epoc...
2026.04
58.13
57.23
Feedback
Search any
task
Search any
task