| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Skin Disease Classification | DDI (Out-of-domain) | F1 Score60.13 | 12 | |
| Drug-Drug Interaction prediction | DDI dataset (5-fold cross-val) | AUC95.7 | 9 | |
| LLM-as-a-Judge | DDI (test) | EM (Δ)59.03 | 8 | |
| Medical Image Classification | DDI (test) | Accuracy82.58 | 8 | |
| Skin lesion classification | DDI v1 (test) | Accuracy79 | 7 | |
| Biomedical Interaction Prediction | DDI (test) | AUPRC0.897 | 7 | |
| Skin disease classification | DDI In-Domain | Avg Accuracy87.4 | 4 | |
| Relation Extraction | DDI (test) | Macro F184.1 | 3 | |
| Relation Extraction | DDI | F1 Score44.89 | 1 |