Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AgentClinic

Benchmarks

Task NameDataset NameSOTA ResultTrend
Medical DiagnosisAgentClinic OOD original (test)
Similarity (Sim)0.684
20
Interactive Medical DiagnosisAgentClinic MedQA
ICR53.9
12
Agentic medical interactionAgentClinic MedQA
Accuracy65.8
6
Showing 3 of 3 rows