| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| FineWeb-edu CC-MAIN-2024-10 | Ens(Good, Bad) | Recall@3081.9 | 7 | 4d ago | |
| OpenWebText Quality | Balanced Accuracy88.2 | 3 | 4d ago | ||
| OpenWebText Mainstream | Balanced Accuracy92.4 | 3 | 4d ago | ||
| OpenWebText Politics | Balanced Accuracy95.6 | 3 | 4d ago | ||
| OpenWebText AI subset | SIEVE | Balanced Acc95.7 | 3 | 4d ago | |
| OpenWebText Climate | SIEVE | Balanced Acc96.7 | 3 | 4d ago |