| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Instruction Following | Self-Instruct | ROUGE-L16.5 | 48 | |
| Instruction Following | Self-Instruct (test) | ROUGE-L21.81 | 42 | |
| Instruction Following | Self-instruct Eval | Win Rate (A)56.7 | 19 | |
| Language Generation | Self-Instruct (test) | ROUGE-L23.4 | 14 | |
| Prompt Recovery | Self-Instruct | BLEU-134.71 | 14 | |
| Open-ended instruction following | Self-instruct Eval | Win Rate (A)53.6 | 7 | |
| Instruction Following Evaluation | SELF-INSTRUCT Ours | Score74.29 | 5 | |
| Instruction Following Evaluation | SELF-INSTRUCT | Score69.48 | 5 | |
| Instruction Following Evaluation | SELF-INSTRUCT seed data | Score72.01 | 5 | |
| Instruction Following Evaluation | Self-instruct Eval | Win Rate (A)56.7 | 5 | |
| Instruction Following | Self-Instruct | Language Democratization73.92 | 4 |