| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | C4 | Perplexity1 | 1,688 | |
| Language Modeling | C4 | Perplexity4.77 | 1,565 | |
| Language Modeling | C4 (val) | PPL1.7 | 737 | |
| Language Modeling | C4 (test) | Perplexity4.97 | 464 | |
| Language Generation | C4 | Perplexity5.52 | 190 | |
| Perplexity | C4 | Perplexity6.24 | 137 | |
| Language Modeling | C4 | C4 Loss2.55 | 121 | |
| Watermark Detection | C4 | TPR @ FPR=1%1 | 95 | |
| Language Modeling | C4 | Perplexity7.34 | 72 | |
| Language Modeling | C4 | Perplexity9.36 | 58 | |
| Pre-training | C4 (val) | Perplexity17.8 | 58 | |
| Language Modeling | C4 (train) | PPL15.28 | 50 | |
| LLM Pretraining | C4 | Perplexity13.3 | 47 | |
| Language Model Pre-training | C4 Llama 2 pre-training (val) | Perplexity13.19 | 47 | |
| Sentence-Level Watermarking | C4 | AUROC100 | 40 | |
| Watermarking | C4 | TPR (FPR < 10^-4)100 | 40 | |
| Language Modeling | C4 LLaMA-130M (val) | Perplexity18.504 | 40 | |
| Language Modeling | C4 | Entropy1 | 39 | |
| Watermark Detection | C4 | TPR @ 1% FPR100 | 36 | |
| Language Modeling | C4 | Log-PPL2.834 | 35 | |
| Masked Language Modeling | C4 (val) | PPLX3.828 | 35 | |
| Feature Space Preservation | C4 | Cosine Similarity100 | 32 | |
| Language Modeling | C4 | Word Perplexity18.08 | 32 | |
| Next Token Prediction | C4 (held-out) | Perplexity (PPL)21.5 | 30 | |
| Clustering | C4 | Clustering Score63.95 | 30 |