Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Amazon Reviews

Benchmarks

Task NameDataset NameSOTA ResultTrend
Indirect Prompt InjectionAmazon Reviews
ASR99.8
47
Sequential RecommendationAmazon Reviews 8 Domains
NDCG@10 (Avg)89.4
36
Sentiment AnalysisAmazon Reviews (test)
Average Accuracy91.74
24
Review RankingAmazon Reviews 2023 (test)
N@1 (All_Beauty)0.713
19
Sentiment AnalysisAmazon Reviews
F1 Score58.9
16
Membership Inference AttackAmazon Reviews
AUC0.901
14
Selective ClassificationAmazon Reviews Covariate Shift
AURC22.2
13
Selective ClassificationAmazon Reviews (In-Distribution)
AURC20.6
13
Sequential RecommendationAmazon Reviews Sports (test)
HR@10.0162
11
Sequential RecommendationAmazon Reviews Toys (test)
HR@10.0334
11
Sequential RecommendationAmazon Reviews Beauty (test)
HR@13.29
11
Sentiment ClassificationAmazon Reviews
Accuracy85.7
10
Sentiment Controlled Text GenerationAmazon reviews
PPL (Pos.)11.99
10
Sentiment AnalysisAmazon Reviews (Out-of-domain)
Accuracy84.7
10
Conversational RecommendationAmazon Reviews Game 2023 (test)
SR43
10
Conversational RecommendationAmazon Book Reviews 2023 (test)
SR63
10
Text ClassificationAmazon Reviews
Accuracy (Books)85.32
9
Sentiment AnalysisAmazon reviews (test)
Accuracy98
8
Sentiment ClassificationAmazon reviews Last Tasks (Final task of sequence)
Accuracy87.99
8
Sentiment ClassificationAmazon reviews All Tasks Average over 24
Accuracy85.24
8
Cross-domain recommendationAmazon Reviews Music → Books
MAE0.8298
6
Cross-domain recommendationAmazon Reviews Music → Movies
MAE0.802
6
Cross-domain recommendationAmazon Reviews Movies → Books
MAE0.7916
6
Cross-domain recommendationAmazon Reviews Movies → Music
MAE0.7768
6
Cross-domain recommendationAmazon Reviews Books → Movies
MAE0.8205
6
Showing 25 of 41 rows