Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Factoid Question Answering on BioASQ Task B
Loading...
55.05
Strict Score
GPT4-Turbo
25.046
32.8355
40.625
48.4145
Oct 17, 2025
Strict Score
Lenient Score
MRR
Updated 4d ago
Evaluation Results
Method
Method
Links
Strict Score
Lenient Score
MRR
GPT4-Turbo
Generator=GPT4-Turbo,...
2025.10
55.05
65.15
60.1
ParallaxRAG + Qwen3-Plus
Generator=Qwen3-Plus,...
2025.10
42.1
82.22
86.67
Single-View + Qwen3-Plus
Generator=Qwen3-Plus,...
2025.10
37.73
72.14
84.62
GPT-4o
Generator=GPT-4o, Retr...
2025.10
34.62
34.62
34.62
Qwen3-Plus
Generator=Qwen3-Plus,...
2025.10
32.19
62.13
41.32
Deepseek-R1-8B
Generator=Deepseek-R1-...
2025.10
26.2
27.33
26.77
Feedback
Search any
task
Search any
task