| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Citation Hallucination Detection | Citations PDF input mode rendered benchmark synthetic and real-world | Total Citations Count1,132 | 16 | |
| Citation Hallucination Detection | Citations BibTeX input mode synthetic and real-world (source .bib entries) | Total Count1,156 | 14 | |
| Text Classification | Citations (full) | Accuracy64.6 | 8 | |
| Calibration | Citations (test) | ECE0.179 | 2 |