| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WritingBench | Score85.87 | 74 | 5d ago | ||
| WritingBench v1 (test) | Average Score85.3 | 61 | 1mo ago | ||
| ROCStories | Accuracy76.3 | 48 | 2mo ago | ||
| WildChat 5,000 conversations | UserLM-8b | KLfwd0.392 | 24 | 23d ago | |
| Arena-Write v1 (test) | LongWriter-Zero | Elo1,447 | 16 | 3mo ago | |
| LongBench-Write (LB) | LB Score97.8 | 11 | 1mo ago | ||
| Challenge-writing Task set | Vanilla GPT-4 | ROUGE-131.62 | 2 | 3mo ago | |
| Easy-writing | Mixtral-8x7b | ROUGE-151.65 | 2 | 3mo ago |