Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical Dialogue on MTMedDialog Sample 15 cases per department
Loading...
69.32
Overall Accuracy
TheraAgent
55.176
58.848
62.52
66.192
May 7, 2026
Overall Accuracy
Neurology Accuracy
Respiratory Accuracy
Endocrinology Accuracy
Gastroenterology Accuracy
Updated 26d ago
Evaluation Results
Method
Method
Links
Overall Accuracy
Neurology Accuracy
Respiratory Accuracy
Endocrinology Accuracy
Gastroenterology Accuracy
TheraAgent
2026.05
69.32
75.2
73.44
76.27
58
Claude-4-Sonnet
2026.05
63.92
63.2
67.68
63.2
58.13
Gemini-2.5-Pro
2026.05
61.63
66.4
65.45
64.93
52.79
DeepSeek-R1
2026.05
57.6
51.2
62
60.27
50
Kimi-K2
2026.05
55.72
55.2
59.76
55.2
49.33
Feedback
Search any
task
Search any
task