| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| RottenReview | Spearman Correlation0.49 | 10 | 1mo ago | ||
| SubstanReview | WarrantScore | Spearman Correlation0.82 | 9 | 1mo ago | |
| HAMLETJudge (val) | HamletJudge | CP Correlation0.792 | 4 | 1mo ago | |
| Refined human judgment dataset human vs model-generated | SO | SO-S0.995 | 3 | 1mo ago | |
| Original human judgment dataset | Generation Perplexity | Generation Perplexity0.643 | 3 | 1mo ago |