Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Yes/No Question Answering on BioASQ Task B
Loading...
93.51
Accuracy
ParallaxRAG + Qwen3-Plus
89.9844
90.8997
91.815
92.7303
Oct 17, 2025
Accuracy
F1 (Yes)
F1 (No)
Macro F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
F1 (Yes)
F1 (No)
Macro F1
ParallaxRAG + Qwen3-Plus
Generator=Qwen3-Plus,...
2025.10
93.51
95.24
89.8
92.52
GPT-4o
Generator=GPT-4o, Retr...
2025.10
91.4
93.47
86.7
90.09
GPT4-Turbo
Generator=GPT4-Turbo,...
2025.10
91.29
93.57
86.16
89.86
Single-View + Qwen3-Plus
Generator=Qwen3-Plus,...
2025.10
91.12
92.67
83.74
88.21
Deepseek-R1-8B
Generator=Deepseek-R1-...
2025.10
90.77
93.12
85.58
89.35
Qwen3-Plus
Generator=Qwen3-Plus,...
2025.10
90.12
92.55
84.02
88.29
Feedback
Search any
task
Search any
task