| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Diversity | podcast | Win Count56 | 6 | |
| Comprehensiveness | podcast | Win Count50 | 6 | |
| GraphRAG Content Quality Evaluation | podcast Leiden level C3 post-cutoff | Comprehensiveness Win50 | 6 | |
| GraphRAG Content Quality Evaluation | podcast Leiden level C2 post-cutoff | Comprehensiveness Win Count50 | 6 | |
| Directness | podcast C3 community level post-cutoff (test) | Win Count64 | 6 | |
| Directness | podcast C2 community level post-cutoff (test) | Wins62 | 6 | |
| Diversity | podcast Leiden level C1 | Win Rate72 | 6 | |
| Diversity | podcast Leiden level C0 | Win Rate68 | 6 | |
| Comprehensiveness | podcast Leiden level C1 | Win Rate (C1 Podcast)68 | 6 | |
| Comprehensiveness | podcast Leiden level C0 | Win Rate64 | 6 | |
| Community Summary Evaluation | podcast C3 (post-cutoff) | Comprehensiveness Win53 | 6 | |
| Community Summary Evaluation | podcast C2 (post-cutoff) | Comprehensiveness Win58 | 6 | |
| Speaker-Attributed Automatic Speech Recognition | Podcast (test) | CER4.46 | 4 |