| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Reddit CQG (test) | GTM | Fluency54.8 | 10 | 4d ago | |
| CANARD Non-Biased | SOD | CQG Non-Biased Performance22.4 | 6 | 4d ago | |
| CANARD Biased | MarCQAp | Performance26 | 6 | 4d ago | |
| CoQAR Non-Biased | SOD | Performance (%)18.8 | 6 | 4d ago | |
| CoQAR (Biased) | Performance26.7 | 6 | 4d ago | ||
| CoQA (test) | SG-CQG | Factuality Score2.61 | 3 | 4d ago |