| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Fluency | BOOKS | Fluency Score4.46 | 28 | |
| Next-item prediction | Books Industrial dataset (test) | Hits@1081.62 | 22 | |
| Sequential Recommendation | Books Amazon (test) | HR@2000.2173 | 20 | |
| LLM Unlearning | BOOKS | VerMem0 | 16 | |
| Trustworthiness evaluation | Books | Avg F196.7 | 16 | |
| OOD detection | Books-History | FPR@9536.03 | 13 | |
| Hierarchical Text Classification | Books | Macro F161.2 | 10 | |
| Recommendation | Books | H@151.17 | 9 | |
| Recommendation | Books N=20 (test) | Hit Rate @ 142.36 | 9 | |
| Recommendation | Books (test) | NDCG@205.93 | 8 | |
| Attribute estimation | Books | Hits@1090.77 | 8 | |
| Personalized review generation | Books | ROUGE-133.18 | 7 | |
| Collaborative Ranking | Books | H@1026.73 | 6 | |
| Recommendation | Books | Speedup Ratio (k=10)35.5 | 4 | |
| Language Modeling | BOOKS (test) | Perplexity10.89 | 2 | |
| Language Modeling | BOOKS (dev) | Perplexity14.2 | 2 | |
| Text Similarity | BK3 (Books3) | BLEU90 | 2 |