| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Creative Writing | Creative Writing EQ-Bench v3 | ELO829.05 | 13 | |
| Creative Writing | Creative Writing | Discovery Score45.2 | 12 | |
| AI Text Detection | Creative Writing | AUC99.9 | 7 | |
| Creative Writing | Creative Writing | Win Rate vs Confidence70.1 | 6 | |
| AI-generated text detection | Creative Writing Out-of-Domain | F1 Score95.7 | 5 | |
| AI-generated text detection | Creative Writing In-Domain | F1 Score98.4 | 5 |