Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Amazon Reviews

Benchmarks

Task NameDataset NameSOTA ResultTrend
Indirect Prompt InjectionAmazon Reviews
ASR99.8
47
Sequential RecommendationAmazon Reviews 8 Domains
NDCG@10 (Avg)89.4
36
Sentiment AnalysisAmazon Reviews (test)
Average Accuracy91.74
24
Review RankingAmazon Reviews 2023 (test)
N@1 (All_Beauty)0.713
19
Sentiment AnalysisAmazon Reviews
F1 Score58.9
16
Membership Inference AttackAmazon Reviews
AUC0.901
14
Selective ClassificationAmazon Reviews Covariate Shift
AURC22.2
13
Selective ClassificationAmazon Reviews (In-Distribution)
AURC20.6
13
Sequential RecommendationAmazon Reviews Sports (test)
HR@10.0162
11
Sequential RecommendationAmazon Reviews Toys (test)
HR@10.0334
11
Sequential RecommendationAmazon Reviews Beauty (test)
HR@13.29
11
Sentiment ClassificationAmazon Reviews
Accuracy85.7
10
Sentiment Controlled Text GenerationAmazon reviews
PPL (Pos.)11.99
10
Sentiment AnalysisAmazon Reviews (Out-of-domain)
Accuracy84.7
10
Conversational RecommendationAmazon Reviews Game 2023 (test)
SR43
10
Conversational RecommendationAmazon Book Reviews 2023 (test)
SR63
10
Sentiment AnalysisAmazon reviews (test)
Accuracy98
8
Sentiment ClassificationAmazon reviews Last Tasks (Final task of sequence)
Accuracy87.99
8
Sentiment ClassificationAmazon reviews All Tasks Average over 24
Accuracy85.24
8
Review Rating ClassificationAmazon Reviews en ja zh
Acc (de)0.4998
6
Review Rating ClassificationAmazon Reviews en, es, fr
Accuracy (de)50.99
6
RecommendationAmazon Reviews Electronics averaged across Env-1, Env-2, Env-3 (test)
NDCG@100.297
5
RecommendationAmazon Reviews 2023
HV0.16
4
Suitability Score PredictionAmazon Reviews
MAE1.078
4
Persona-based SummarizationAmazon Reviews
RefBS-R0.722
4
Showing 25 of 32 rows