Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clinical End-to-End Performance on Translational medicine scenario family 30 queries
Loading...
74.7
Positive Rate
BIORESEARCHER
32.06
43.13
54.2
65.27
May 7, 2026
Positive Rate
Negative Rate
Updated 26d ago
Evaluation Results
Method
Method
Links
Positive Rate
Negative Rate
BIORESEARCHER
core model=GPT-5.4, re...
2026.05
74.7
96.8
OpenAI Deep Research
2026.05
68.9
81.1
CellType
core model=Claude Opus...
2026.05
61.7
83.3
Medea
core model=GPT-5.4, re...
2026.05
33.7
93.3
Feedback
Search any
task
Search any
task