| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WikiBio | HETA | AUC π-Soft-NS2.3 | 67 | 2d ago | |
| TellMeWhy | HETA | AUC π-Soft-NS2.25 | 67 | 2d ago | |
| FaithEval | ProbeRAG | F1 Score74.9 | 42 | 4d ago | |
| MDACE (test) | AttInGrad | Comp Score87 | 40 | 1mo ago | |
| ImageNet | Deletion Score61.41 | 30 | 1mo ago | ||
| BoolQ | Grad-ELLM | AUC π-Soft-NS37 | 27 | 1mo ago | |
| SST2 | Grad-ELLM | AUC π-Soft (NS)0.563 | 27 | 1mo ago | |
| IMDb | Grad-ELLM | AUC π-Soft-NS57.2 | 27 | 1mo ago | |
| Faithfulness Baseline Narratives | Coherent Design | Rank Accuracy (RA)92.2 | 25 | 26d ago | |
| AG-news (test) | LIME | Rate of Label Changes2 | 24 | 17d ago | |
| IMDb (test) | ICL | Rate of Label Changes4.5 | 24 | 17d ago | |
| SST-2 (test) | ICL | Rate of Label Changes5.5 | 24 | 17d ago | |
| ImageNet (val) | Score-CAM | ADCC81.03 | 24 | 1mo ago | |
| Halogen | Distributional Semantics Tracing | CODE73 | 20 | 1mo ago | |
| LongBench | LightRAG | NAR Score98.94 | 18 | 1mo ago | |
| AG News | Beta-Shapley | Rate of Label Changes20 | 12 | 17d ago | |
| IMDb | Kernel SHAP | Rate of Label Changes18 | 12 | 17d ago | |
| SST-2 | Kernel SHAP | Rate of Label Changes28 | 12 | 17d ago | |
| SciGen (test) | Chain-of-Thought (COT) | SummaC Score0.2822 | 12 | 1mo ago | |
| BookSum (test) | GPT-5 | SummaC40.71 | 12 | 1mo ago | |
| Multi-News (test) | GPT-5 | SummaC38.5 | 12 | 1mo ago | |
| ArXiv (test) | o3 | SummaC53.58 | 12 | 1mo ago | |
| WikiHow (test) | GPT-5 | SummaC39.06 | 12 | 1mo ago | |
| Reddit (test) | GPT-5 | SummaC34.39 | 12 | 1mo ago | |
| SAMSum (test) | GPT-5 | SummaC29.58 | 12 | 1mo ago |