Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clinical order generation on ClinicalBench
Loading...
37.54
Precision
MedResearcher-R1
21.888
25.9515
30.015
34.0785
May 31, 2026
Precision
Recall
F1 Score
Updated 1d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
MedResearcher-R1
Category=Agentic Reaso...
2026.05
37.54
33.59
31
CAREAgent
Category=Agentic Reaso...
2026.05
32.52
40.01
31.86
MDAgents
Category=Multi-Agent M...
2026.05
32.38
36.13
29.77
ReflecTool
Category=Single-Agent...
2026.05
31.39
28.69
26.81
Tongyi DeepResearch
Category=Agentic Reaso...
2026.05
31.1
38.94
30.31
AgentClinic
Category=Multi-Agent M...
2026.05
26
14.48
16.53
ReAct
Category=Single-Agent...
2026.05
24.31
11.56
13.78
MedAgents
Category=Multi-Agent M...
2026.05
22.49
26.33
21.22
Feedback
Search any
task
Search any
task