| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Document Retrieval | Reuters21578 (test) | Precision@10094.07 | 45 | |
| Machine-Generated Text Detection | Reuters (Claude) | TPR @ FPR=1%85.2 | 36 | |
| Machine-Generated Text Detection | Reuters GPT4All | TPR @ FPR=1%93.24 | 36 | |
| Unsupervised document hashing | Reuters | Precision0.8624 | 32 | |
| Entity Linking | N3-Reuters-128 | Macro F163.4 | 25 | |
| Machine-Generated Text Detection | Reuters ChatGPT-turbo | TPR@FPR=1%97.51 | 24 | |
| Machine-Generated Text Detection | Reuters ChatGLM | TPR@FPR=1%99.29 | 24 | |
| Machine-Generated Text Detection | Reuters Dolly | TPR @ FPR=1%38.62 | 24 | |
| Machine-Generated Text Detection | Reuters ChatGPT split | TPR@FPR=1%98.09 | 24 | |
| Clustering | REUTERS 10K | ACC82.1 | 23 | |
| Machine-Generated Text Detection | Reuters (test) | TPR@FPR=1%99.87 | 22 | |
| Text Classification | Reuters | Micro-F196.48 | 22 | |
| Text Classification Explanation | Reuters (test) | Delta Acc (Top-1)10.4 | 21 | |
| 5-way few-shot text classification | Reuters (test) | Accuracy96.7 | 20 | |
| Topic Classification | Reuters-21578 (test) | Accuracy0.923 | 15 | |
| Multi-view Clustering | Reuters 100% aligned | ACC59.14 | 14 | |
| Text Categorization | REUTERS (test) | Classification Error2.83 | 14 | |
| Multi-view Clustering | Reuters | ACC58.4 | 13 | |
| Cross-lingual Document Classification | Reuters de -> en | Accuracy76.9 | 13 | |
| Machine-Generated Text Detection | Reuters Claude 1.0 (test) | TPR @ FPR=1%94.36 | 12 | |
| Machine-Generated Text Detection | Reuters StableLM 1.0 (test) | TPR @ FPR=1%34.84 | 12 | |
| Machine-Generated Text Detection | Reuters ChatGLM 1.0 (test) | TPR@FPR=1%97.33 | 12 | |
| Machine-Generated Text Detection | Reuters ChatGPT-turbo 1.0 (test) | TPR@FPR=1%0.5609 | 12 | |
| Machine-Generated Text Detection | Reuters ChatGPT 1.0 (test) | TPR@FPR=1%89.82 | 12 | |
| Machine-Generated Text Detection | Reuters GPT4All 1.0 (test) | TPR @ FPR=1%71.33 | 12 |