Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DDI

Benchmarks

Task NameDataset NameSOTA ResultTrend
High-stakes specialized classificationDDI (test)
Macro-F161.12
37
Skin Disease ClassificationDDI (Out-of-domain)
F1 Score60.13
12
Drug-Drug Interaction predictionDDI dataset (5-fold cross-val)
AUC95.7
9
LLM-as-a-JudgeDDI (test)
EM (Δ)59.03
8
Medical Image ClassificationDDI (test)
Accuracy82.58
8
Skin lesion classificationDDI v1 (test)
Accuracy79
7
Biomedical Interaction PredictionDDI (test)
AUPRC0.897
7
Drug-Drug Interaction PredictionDDI Seen-Drug Setting
Accuracy (ACC)95.69
6
Drug-Drug Interaction PredictionDDI dataset (Both-unseen)
ACC58.12
6
Relation ClassificationDDI 13
F1 Score87.6
6
Relation ExtractionDDI 13
Precision87
6
High-stakes specialized classificationDDI (val)
Macro F158.27
4
Skin disease classificationDDI In-Domain
Avg Accuracy87.4
4
Relation ExtractionDDI (test)
Macro F184.1
3
Relation ExtractionDDI
F1 Score44.89
1
Showing 15 of 15 rows