| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| RottenReview | Spearman Correlation0.49 | 10 | 3mo ago | ||
| HATS 70% human agreement split | SemDist | Correlation Ratio78.8 | 9 | 28d ago | |
| HATS 100% human agreement split | SemDist | Correlation Ratio88.8 | 9 | 28d ago | |
| SubstanReview | WarrantScore | Spearman Correlation0.82 | 9 | 3mo ago | |
| HAMLETJudge (val) | HamletJudge | CP Correlation0.792 | 4 | 3mo ago | |
| Refined human judgment dataset human vs model-generated | SO | SO-S0.995 | 3 | 3mo ago | |
| Original human judgment dataset | Generation Perplexity | Generation Perplexity0.643 | 3 | 3mo ago |