| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| final diagnosis prediction | ER-Reason | Accuracy26.7 | 56 | |
| Clinical Reasoning | ER-Reason | Accuracy34.4 | 26 | |
| Diagnostic Reasoning | ER-Reason | Final Accuracy72.14 | 9 | |
| LLM Self-Consistency Certification | ER-REASON | Bonferroni Score92.1 | 5 |