| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Natural Language Processing | Federated Dataset Personalization 2 | Paraphrasing Accuracy90.5 | 6 | |
| Natural Language Processing | Federated Dataset 1 (Personalization) | Paraphrasing Score0.805 | 6 | |
| Natural Language Processing | Federated Dataset Test-Time Personalization 2 | Paraphrasing71.64 | 4 | |
| Open Domain QA | Federated Dataset 1 unseen tasks (test) | AVG Score78.76 | 4 | |
| Reading Comprehension | Federated Dataset 1 unseen tasks (test) | Average Score71.88 | 4 | |
| Summarization | Federated Dataset 1 unseen tasks (test) | Average Score22.46 | 4 | |
| Natural Language Processing | Federated Dataset 1 Test-Time Personalization | Paraphrase Accuracy78.1 | 4 |