| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Fact-checking | HealthVer | F1-macro68 | 21 | |
| Natural Language Explanation Generation | HealthVer (test) | Faithfulness0.033 | 9 | |
| Human Evaluation of Explanations | HealthVer | Helpfulness (MAR)2.15 | 5 | |
| Knowledge Poisoning Attack | HealthVer k=10 (test) | ASR21 | 4 | |
| Claim Verification | HealthVer (test) | Precision (NE)47 | 2 |