Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ER-Reason

Benchmarks

Task NameDataset NameSOTA ResultTrend
final diagnosis predictionER-Reason
Accuracy26.7
56
Clinical ReasoningER-Reason
Accuracy34.4
26
Diagnostic ReasoningER-Reason
Final Accuracy72.14
9
LLM Self-Consistency CertificationER-REASON
Bonferroni Score92.1
5
Showing 4 of 4 rows