| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reasoning Quality Correlation Analysis | LIAR | Somers' D0.2769 | 45 | |
| Fact Checking | LIAR | Accuracy@169 | 33 | |
| Veracity Prediction | LIAR RAW | Macro F150.59 | 32 | |
| Fact Verification | LIAR | F1 Score68.6 | 24 | |
| Multi-Classification | LIAR Open | Accuracy46.81 | 23 | |
| Binary Classification | LIAR Closed | Accuracy79.15 | 23 | |
| Multi-Classification | LIAR Closed | Accuracy26.99 | 22 | |
| Binary Classification | LIAR Open | Accuracy84.21 | 22 | |
| Fake News Detection | LIAR (test) | Accuracy65.2 | 21 | |
| Fact-checking | LIAR-RAW | Precision77.38 | 20 | |
| Fake News Detection | LIAR (val) | Accuracy27.7 | 13 | |
| Fact-checking | LIAR | Accuracy79 | 12 | |
| Claim Verification | LIAR (test) | Precision46.8 | 12 | |
| Veracity Explanation Ranking | LIAR RAW | Informativeness (MAR)2.09 | 12 | |
| Veracity Prediction | LIAR-RAW (test) | Precision43.83 | 12 | |
| Reasoning | LIAR Ambiguity-Augmented subset of 200 samples | Accuracy@169 | 11 | |
| Fake News Detection | LIAR ambiguity-augmented | Accuracy68.9 | 11 | |
| Fact-Checking | LIAR (test) | Accuracy68.2 | 11 | |
| Explanation Generation | LIAR-RAW (test) | ROU-125.5 | 11 | |
| Node classification | LIAR (test) | Fidelity100 | 8 | |
| Adversarial Attack | LIAR-NEW Perplexity | Attack Success Rate (ASR)19.95 | 7 | |
| Adversarial Attack | LIAR ClaimBuster NEW | Attack Success Rate (ASR)97.02 | 7 | |
| Adversarial Attack | LIAR-NEW Verifact | Attack Success Rate40.34 | 7 | |
| Adversarial Attack | LIAR ICL NEW | Attack Success Rate30.35 | 7 | |
| Explanation Quality Evaluation | LIAR RAW | Meaningfulness Score2.29 | 7 |