| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Recommendation | Amazon Review Toys (test) | AUC0.7985 | 30 | |
| Recommendation | Amazon Review Beauty (test) | AUC0.8189 | 30 | |
| Recommendation | Amazon Review Sports (test) | AUC82.4 | 30 | |
| CTR prediction | Amazon-Review Automotive (test) | AUC0.654 | 20 | |
| CTR prediction | Amazon-Review Beauty (test) | AUC0.634 | 20 | |
| CTR prediction | Amazon-Review Appliances (test) | AUC0.688 | 20 | |
| Text Classification | Amazon Review 1000 labels (test) | Top-1 Error Rate39.17 | 19 | |
| Text Classification | Amazon Review 250 labels (test) | Top-1 Error Rate42.98 | 19 | |
| Text Classification | Amazon Review (test) | Macro F1 Score63.85 | 18 | |
| Semi-supervised learning | Amazon Review | Error Rate40.16 | 16 | |
| Sentence Classification | Amazon Review (test) | Accuracy92.94 | 15 | |
| Event sequence modeling | Amazon Review | Accuracy (%)70 | 13 | |
| Next-item prediction | Amazon Review Office (test) | HR@313.44 | 11 | |
| Next-item prediction | Amazon Review Industrial (test) | HR@30.1189 | 11 | |
| Multi-class Classification | Amazon Review (AR) | Accuracy44.06 | 10 | |
| Error Detection | Amazon Review | F1 Score0.453 | 10 | |
| Accuracy Estimation | Amazon Review | Absolute Estimation Error0.018 | 10 | |
| Text Classification | Amazon Review Full (test) | Test Error37 | 9 | |
| Perception classification | Amazon Review (test) | Joy0.96 | 8 | |
| Topic Modeling | Amazon Review | CNPMI0.058 | 8 | |
| Text Classification | Amazon-Review single-domain generalization (test) | Class D Score76.42 | 8 | |
| Sentiment Classification | Amazon Review Dataset K -> E | Accuracy80.7 | 8 | |
| Sentiment Classification | Amazon review dataset K -> D | Accuracy72.2 | 8 | |
| Sentiment Classification | Amazon review dataset K -> B | Accuracy68.2 | 8 | |
| Sentiment Classification | Amazon Review E -> K | Accuracy84.4 | 8 |