Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ML

Benchmarks

Task NameDataset NameSOTA ResultTrend
Click-Through Rate PredictionML 1M
AUC0.9087
46
Dimension SelectionML-1M
AUC80.99
31
Sequential RecommendationML-20M
NDCG@100.78
26
RecommendationMl 1M (test)
Recall21.2
24
Rubric satisfaction evaluationML
Claude-4 Sonnet Score36.7
24
Sequential RecommendationML-100K
NDCG@2013.29
22
Negative Constraint RecommendationML 1M
Recall@100.2076
22
Generative RecommendationML OOD 10M
Hit Rate @1061
18
RecommendationML-100K
HR@546.15
18
Collaborative FilteringML-20M large (test)
Recall@200.403
17
Sequential RecommendationML-10M
HR@575.91
15
Collaborative FilteringML-20M strong generalization
AOA Recall@200.3956
14
Multi-objective Re-rankingML-1M
HR@581.24
13
Collaborative FilteringML 10M
Recall@1019.34
12
Model ExtractionML-1M (test)
N@1062.6
12
RecommendationML-100K
NDCG@119.21
11
CTR PredictionML-1M
AUC0.8194
11
Sequential RecommendationML 1M (test)
NDCG@1020.007
11
Sequential RecommendationML 32M
HR@518.02
10
Sequential RecommendationML-1M Head
NDCG@104.46
10
RecommendationML-10M (test)
RMSE0.777
10
Federated Recommendation and Attribute UnlearningML-100K Age attribute 1.0 (leave-one-out)
HR@1067.03
9
User behavior simulationML-100K
Precision71.99
9
Collaborative FilteringML-1M
HR@1031.79
9
Recommendation System EfficiencyML 1M (overall)
Training Time (m)2.06
9
Showing 25 of 56 rows