Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
List Question Answering on BioASQ Task B
Loading...
71.16
Precision
ParallaxRAG + Qwen3-Plus
38.5664
47.0282
55.49
63.9518
Oct 17, 2025
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
ParallaxRAG + Qwen3-Plus
Generator=Qwen3-Plus,...
2025.10
71.16
45.69
47.37
Single-View + Qwen3-Plus
Generator=Qwen3-Plus,...
2025.10
68.33
42.8
44.64
GPT4-Turbo
Generator=GPT4-Turbo,...
2025.10
57.88
48.57
50.51
Qwen3-Plus
Generator=Qwen3-Plus,...
2025.10
56.82
39.89
45.63
GPT-4o
Generator=GPT-4o, Retr...
2025.10
51.02
40.25
43.3
Deepseek-R1-8B
Generator=Deepseek-R1-...
2025.10
39.82
34.27
35.17
Feedback
Search any
task
Search any
task