| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Overall | StethoBench (test) | BERTScore71.8 | 8 | |
| Location | StethoBench (test) | BERTScore72 | 8 | |
| Comparison | StethoBench (test) | BERTScore70.7 | 8 | |
| DDx | StethoBench (test) | BERTScore67.7 | 8 | |
| Reasoning | StethoBench (test) | BERTScore71.4 | 8 | |
| Reporting | StethoBench (test) | BERTScore72.8 | 8 | |
| Detection | StethoBench (test) | BERTScore70.4 | 8 | |
| Classification | StethoBench (test) | BERTScore75.5 | 8 | |
| Overall | StethoBench | ROUGE-146.5 | 8 | |
| Location | StethoBench | ROUGE-144.4 | 8 | |
| Comparison | StethoBench | ROUGE-154.5 | 8 | |
| Differential Diagnosis (DDx) | StethoBench | ROUGE-135.4 | 8 | |
| Reasoning | StethoBench | ROUGE-148.8 | 8 | |
| Reporting | StethoBench | ROUGE-150 | 8 | |
| Detection | StethoBench | ROUGE-143.4 | 8 | |
| Classification | StethoBench | ROUGE-150.2 | 8 |