| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Instruction Induction | Instruction Induction | Avg Execution Score38.7 | 17 | |
| Instruction Induction | Instruction Induction (test) | Active to Passive100 | 10 | |
| Instruction Induction | Instruction Induction 1.0 (test) | Active to Passive100 | 9 | |
| Instruction Induction | Instruction Induction (test) | Antonyms0.852 | 6 | |
| 14-task average | Instruction Induction (test) | Mean Score69.95 | 4 | |
| word_in_context | Instruction Induction word_in_context (test) | Mean Accuracy61 | 4 | |
| translation_en-fr | Instruction Induction translation_en-fr (test) | Mean Score81.8 | 4 | |
| translation_en-es | Instruction Induction translation_en-es (test) | Mean Score85.4 | 4 | |
| translation_en-de | Instruction Induction translation_en-de (test) | Average Score85 | 4 | |
| taxonomy_animal | Instruction Induction taxonomy_animal (test) | Mean Accuracy89 | 4 | |
| synonyms | Instruction Induction synonyms (test) | Mean Score27.8 | 4 | |
| sentiment | Instruction Induction sentiment (test) | Mean Accuracy88.8 | 4 | |
| sentence_similarity | Instruction Induction sentence_similarity (test) | Mean Score22.2 | 4 | |
| second_word_letter | Instruction Induction second_word_letter (test) | Mean Accuracy94.2 | 4 | |
| rhymes | Instruction Induction rhymes (test) | Mean Score65 | 4 | |
| orthography_starts_with | Instruction Induction orthography_starts_with (test) | Mean Accuracy0.686 | 4 | |
| negation | Instruction Induction negation (test) | Mean Score78.2 | 4 | |
| informal_to_formal | Instruction Induction informal_to_formal (test) | Mean Score61.26 | 4 | |
| antonyms | Instruction Induction antonyms (test) | Mean Score78.8 | 4 |