| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| user-churn | StackExchange 4DBInfer (test) | AUC0.8796 | 9 | |
| post-upvote | StackExchange 4DBInfer (test) | AUC0.8896 | 9 | |
| Question Answering | StackExchange Q&A (test) | Accuracy (Bio.)82.2 | 8 | |
| Present keyphrase generation | StackExchange | F1@327.2 | 8 | |
| Topic Classification | StackExchange (test) | Acc67.56 | 6 | |
| Language Modeling | StackExchange (val) | Perplexity4.43 | 3 | |
| Absent keyphrase generation | StackExchange | Recall@54.6 | 3 | |
| Reward Modeling | StackExchange (val) | REWARD0 | 1 |