Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on HealthQA
Loading...
79.24
Accuracy
GPT4o
54.332
60.7985
67.265
73.7315
May 17, 2026
Accuracy
Updated 15d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT4o
Backbone Architecture=...
2026.05
79.24
AMATA 8B
Model Scale=8B, Backbo...
2026.05
78.74
GiGPO 8B
Model Scale=8B, Backbo...
2026.05
77.2
SPA-RL 8B
Model Scale=8B, Backbo...
2026.05
76.88
SMART 8B
Model Scale=8B, Backbo...
2026.05
75.99
SelfRag 8B
Model Scale=8B, Backbo...
2026.05
70.89
Llama-3-Ins.8B
Model Scale=8B, Backbo...
2026.05
64.39
RADIT 8B
Model Scale=8B, Backbo...
2026.05
55.29
Feedback
Search any
task
Search any
task