| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| AMTCele | GPT4o | Accuracy88.88 | 64 | 4d ago | |
| COCO | IT_zs | Accuracy81.9 | 30 | 4d ago | |
| PHEME | MetaAdapt | Accuracy69.2 | 26 | 4d ago | |
| SLN (test) | KALM | Micro F194.22 | 26 | 4d ago | |
| PolitiFact | PAMAS | Accuracy96.43 | 21 | 4d ago | |
| Chinese Dataset | OmiGraph | macF185.85 | 18 | 4d ago | |
| English Dataset | OmiGraph | Macro F176.08 | 18 | 4d ago | |
| LUN | KALM | Macro F169.82 | 17 | 4d ago | |
| VERITE (out-of-distribution) | LADLE-MM | Accuracy79.6 | 13 | 4d ago | |
| Horne 2017 | TEGRA | Accuracy99.99 | 12 | 4d ago | |
| CoAID | TEGRA | Accuracy99.42 | 12 | 4d ago | |
| MISBENCH (Multi-hop based Misinformation) 1.0 (test) | GPT-4o | Factual Memory Success Rate96.88 | 12 | 4d ago | |
| MISBENCH One-hop based Misinformation 1.0 (test) | GPT-4o | Factual Memory Success Rate91.44 | 12 | 4d ago | |
| DeRev 2018 | PAMAS | Accuracy96.28 | 11 | 4d ago | |
| Amazon | PAMAS | Accuracy0.9715 | 11 | 4d ago | |
| MMCoVaR | AMPEND | Accuracy91.27 | 10 | 4d ago | |
| LUN (test) | KALM | Micro F171.28 | 9 | 4d ago | |
| DGM4 (test) | Fact-checking (ours) | Accuracy55 | 7 | 4d ago | |
| COVID-19 Misinformation Shared Task English (test) | TOKOFOU | Precision90.7 | 4 | 4d ago |