Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ML

Benchmarks

Task NameDataset NameSOTA ResultTrend
Click-Through Rate PredictionML 1M
AUC0.9087
46
RecommendationMl 1M (test)
Recall21.2
24
Rubric satisfaction evaluationML
Claude-4 Sonnet Score36.7
24
Negative Constraint RecommendationML 1M
Recall@100.2076
22
RecommendationML-100K
HR@546.15
18
Sequential RecommendationML-20M
Memory (MB)2,315
18
Collaborative FilteringML-20M large (test)
Recall@200.403
17
Collaborative FilteringML-20M strong generalization
AOA Recall@200.3956
14
Model ExtractionML-1M (test)
N@1062.6
12
RecommendationML-100K
NDCG@119.21
11
CTR PredictionML-1M
AUC0.8194
11
RecommendationML-10M (test)
RMSE0.777
10
Federated Recommendation and Attribute UnlearningML-100K Age attribute 1.0 (leave-one-out)
HR@1067.03
9
User behavior simulationML-100K
Precision71.99
9
Collaborative FilteringML-1M
HR@1031.79
9
Recommendation System EfficiencyML 1M (overall)
Training Time (m)2.06
9
Item Response Theory AssessmentML-1M
AUC0.701
9
Generative RecommendationML 20M
NDCG@100.1233
8
Positive Constraint RecommendationML1M
Recall@1073
8
Collaborative FilteringML-10M
HR@100.3676
8
Recommendation System EfficiencyML 10M (overall)
Training Time (h)3.06
8
Sequential RecommendationML-1M implicit feedback
HR@519.44
8
RecommendationML-20M (test)
R@1096.2
8
Future item recommendationML-100K
Recall14.9
8
RankingML-20M (test)
Recall@2039.5
8
Showing 25 of 44 rows