| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Fact-checking | AggreFact CNN | Balanced Acc69.5 | 15 | |
| Fact-checking | AggreFact Xsum | Balanced Accuracy76.4 | 15 | |
| Factuality Evaluation | AggreFact-XSum FTS | Balanced Accuracy80.2 | 15 | |
| Factuality Evaluation | AggreFact-CNN (OLD) | Balanced Accuracy82.1 | 15 | |
| Factuality Evaluation | AggreFact CNN (EXF) | Balanced Accuracy76.5 | 15 | |
| Factuality Evaluation | AggreFact-CNN (FTS) | Balanced Accuracy70.3 | 15 | |
| Factuality Evaluation | AggreFact-XSum (OLD) | Balanced Accuracy73.9 | 14 | |
| Factuality Evaluation | AggreFact-XSum (EXF) | Balanced Accuracy0.799 | 14 | |
| Factuality Evaluation | AggreFact (FTSOTA) | Balanced Accuracy (CNN-FTS)70.5 | 14 | |
| Fine-grained consistency detection | AggreFact-Unified 1.0 (All) | F10.4643 | 6 | |
| Fine-grained consistency detection | AggreFact-Unified XFORMER 1.0 | F1 Score46.02 | 6 |