Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Biomedical Question Answering on BioACE automatic evaluation N=50 runs
Loading...
94.96
Precision
agent_faiss_deepseek
81.7208
85.1579
88.595
92.0321
Mar 18, 2026
Precision
Recall
Completeness
Correctness
Updated 1mo ago
Evaluation Results
Method
Method
Links
Precision
Recall
Completeness
Correctness
agent_faiss_deepseek
Team=dal
2026.03
94.96
38.42
82.94
64.32
rrf_llama70b_no-val
Team=dal
2026.03
94.68
37.86
89.99
67.26
hltbio-lg.fsrrf
Team=hltbio
2026.03
93.22
40.27
72.95
68.04
hltbio-lg.fsrrfprf
Team=hltbio
2026.03
92.52
37.82
63.77
69.67
pubmedbert_medcpt_gpt4o
Team=GEHC-HTIC
2026.03
89.02
35.42
59.83
67.56
task_b_baseline
Team=Baseline
2026.03
82.23
32.5
49.73
60
Feedback
Search any
task
Search any
task