Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Task Solving on EHRCon
Loading...
79.1
Pass@3
RetroAgent
61.004
65.702
70.4
75.098
May 8, 2026
Pass@3
Pass@5
Updated 23d ago
Evaluation Results
Method
Method
Links
Pass@3
Pass@5
RetroAgent
2026.05
79.1
85.2
GiGPO
2026.05
78.1
83.3
ReACT
2026.05
77.2
83.3
A³
variant=σ-Reveal
2026.05
77.2
79.6
A³
variant=Vanilla
2026.05
75
79.6
HGPO
2026.05
71.1
75.9
GSPO
2026.05
68.9
75.9
rStar
2026.05
63.3
70.4
LATS
2026.05
61.7
70.4
Feedback
Search any
task
Search any
task