| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Instruction Tuning | FLAN subset Average (test) | ROUGE-156.7 | 7 | |
| Instruction Tuning | FLAN All (test) | ROUGE-157.5 | 7 | |
| Instruction Tuning | FLAN Dual (test) | ROUGE-156.3 | 7 | |
| Instruction Tuning | FLAN Single (test) | ROUGE-156.2 | 7 | |
| Natural Language Processing | FLAN 8-task subset: arc_challenge, cosmos_qa, definite_pronoun_resolution, glue_qqp, hellaswag, mnli, squad_v1, sst2 | Closed-book QA71 | 7 | |
| Instruction Following | Flan | Paraphrase0.7549 | 2 |