Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Biomedical Question Answering on PubMedQA
Loading...
0.6
Baseline Accuracy
ThinkSwitch
0.426667
0.471667
0.516667
0.561667
May 31, 2026
Baseline Accuracy
Best Accuracy
Final Accuracy
Net Gain
Updated 1d ago
Evaluation Results
Method
Method
Links
Baseline Accuracy
Best Accuracy
Final Accuracy
Net Gain
ThinkSwitch
Model Variant=thinking
2026.05
0.6
0.8333
0.8333
0.2333
ThinkSwitch
Model Variant=instruct
2026.05
0.4333
0.6333
0.6
0.1667
Feedback
Search any
task
Search any
task