Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DDI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Skin Disease ClassificationDDI (Out-of-domain)
F1 Score60.13
12
Drug-Drug Interaction predictionDDI dataset (5-fold cross-val)
AUC95.7
9
LLM-as-a-JudgeDDI (test)
EM (Δ)59.03
8
Medical Image ClassificationDDI (test)
Accuracy82.58
8
Skin lesion classificationDDI v1 (test)
Accuracy79
7
Biomedical Interaction PredictionDDI (test)
AUPRC0.897
7
Relation ClassificationDDI 13
F1 Score87.6
6
Relation ExtractionDDI 13
Precision87
6
Skin disease classificationDDI In-Domain
Avg Accuracy87.4
4
Relation ExtractionDDI (test)
Macro F184.1
3
Relation ExtractionDDI
F1 Score44.89
1
Showing 11 of 11 rows