Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Yelp

Benchmarks

Task NameDataset NameSOTA ResultTrend
Sequential RecommendationYelp
Recall@100.0781
120
RecommendationYelp 2018 (test)
Recall@207.83
101
RecommendationYelp (test)
NDCG@209.82
82
Synthetic Text EvaluationYelp non-IID
MAUVE Score0.3751
64
Sequential RecommendationYelp (Overall)
Hit Rate @100.6692
63
Text ClassificationYelp (test)
Accuracy94.8
55
RecommendationYelp 2018
Recall@2019.69
53
Adversarial AttackYelp
ASR39.8
49
Sentiment ClassificationYelp (test)
Accuracy96.4
46
Adversarial Attack on Neural Contextual BanditsYelp
Regret36
42
Collaborative FilteringYelp 2018
NDCG@205.75
42
Review Sentiment ClassificationYelp 2014 (test)
Accuracy68.6
41
Sequential RecommendationYelp (Tail)
Hit Rate@1026.93
39
Sentiment ClassificationYelp Polarity (test)
Error Rate1.81
37
Text classificationYelp (5-fold cross-validation)
Accuracy71.7
36
RecommendationYelp
NDCG@107.79
35
Collaborative FilteringYelp 2018 (test)
Recall@207.43
35
Language ModelingYelp (test)
PPL4.708
35
OOD DetectionYelp (test)
AUROC97.59
34
Sentiment ClassificationYelp5 (test)
Accuracy98.5
34
Text ClassificationYelp.P (test)
Accuracy98.63
34
Multi-class text classificationYelp
Micro-F161.9
33
Sentiment AnalysisYelp '13 (test)
Accuracy68.3
33
RecommendationYelp Set-up (S)
Recall@108.33
32
RecommendationYelp
NDCG@100.119
32
Showing 25 of 246 rows
...