| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-hop Question Answering | Mix | F1 Score79.69 | 14 | |
| Retrieval-Augmented Generation | Mix | Comprehensiveness95.9 | 12 | |
| Explanatory QA | Mix (test) | EM76.5 | 10 | |
| Robustness Prediction | MIX (Dynamic) | Mean Error0.0006 | 8 | |
| Robustness Prediction | MIX (Static) | Mean Error0.0047 | 8 | |
| Federated Graph Classification | Mix across-domain setting | Communication Rounds3 | 8 | |
| Retrieval | Mix | Recall@30.66 | 7 | |
| Visual Question-Answering | Mix dataset | Accuracy (Mix)64.93 | 3 |