Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RELBENCH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Entity RegressionRelBench v1.0 (test)
CTR (Avito Ad)18.4
45
Entity classificationRelBench rel-avito user-visits
AUC66.8
36
Entity classificationRelBench rel-stack user-badge
AUC89
27
Binary ClassificationRelBench 1.0 (test)
Relational Amazon User Churn72.03
26
Entity ClassificationRELBENCH rel-amazon item-churn (test)
AUROC83.37
23
Entity ClassificationRELBENCH rel-avito user-clicks (test)
AUROC71.31
22
Entity ClassificationRELBENCH rel-f1 driver-top3 (test)
AUROC91.4
19
Entity ClassificationRELBENCH rel-f1 driver-dnf (test)
AUROC0.812
19
Entity ClassificationRELBENCH rel-avito user-visits (test)
AUROC0.6701
19
Entity ClassificationRELBENCH rel-stack user-badge (test)
AUROC0.9
18
item-salesRELBENCH rel-hm (test)
MAE0.052
16
driver-positionRELBENCH rel-f1 (test)
MAE3.539
16
user-attendanceRELBENCH rel-event (test)
MAE0.2423
16
ad-ctrRELBENCH rel-avito (test)
MAE0.0362
16
user-ltvRELBENCH rel-amazon (test)
MAE14.087
16
RegressionRelBench v2 (test)
MAE (RateBeer User-Count)6.021
13
Entity classificationRelBench rel-avito user-clicks
AUC69.04
12
Entity RegressionRelBench V1
F1 Positive Error (MAE)2.747
11
Entity ClassificationRelBench V1
DNF Score82.41
11
Entity ClassificationRELBENCH rel-event (test)
AUROC0.8549
10
Entity ClassificationRELBENCH rel-event user-repeat (test)
AUROC79.26
10
site-successRELBENCH rel-trial (test)
MAE0.397
9
study-adverseRELBENCH rel-trial (test)
MAE43.682
9
post-votesRELBENCH rel-stack (test)
MAE0.062
9
item-ltvRELBENCH rel-amazon (test)
MAE48.224
9
Showing 25 of 75 rows