| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Contextual Robustness Question Answering | ConflictQA Unknown queries | Accuracy (Short Context)99.28 | 22 | |
| Contextual Robustness Question Answering | ConflictQA (Known queries) | Accuracy (Contradictory Short)82.49 | 22 | |
| Generative Multiple-choice Question Answering | ConflictQA | TA Rate98.8 | 6 |