| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| SQuAD 1.1 (test) | ProphetNet | BLEU-425.8 | 29 | 1mo ago | |
| EduMath-CQ | EQPR | Win Rate47.11 | 25 | 1mo ago | |
| SQuAD (test) | partial copy + QA-based reranking | BLEU-144.61 | 22 | 1mo ago | |
| SQuAD | BLEU-40.496 | 21 | 1mo ago | ||
| SQuAD 1.1 | ERNIE-GEN_LARGE | METEOR0.2631 | 21 | 1mo ago | |
| Fairytale QA | ROUGE-L54.8 | 17 | 1mo ago | ||
| SQuAD 1.1 (dev) | ERNIE-GENLARGE | BLEU-425.4 | 16 | 1mo ago | |
| QT | NeoDiff | BLEU20.44 | 14 | 1mo ago | |
| MSCOCO-VQA (test) | Humans2016 | METEOR60.8 | 12 | 1mo ago | |
| QSGen-ChildQ (test) | HIBRIDS-ENC | ROUGE-127.33 | 11 | 1mo ago | |
| SQuAD Du | UniLM-v2 | BLEU-424.43 | 10 | 1mo ago | |
| CoQA (val) | SG-CQG | Distinct-168.35 | 9 | 1mo ago | |
| SQuAD 1.1 (reversed dev-test) | ERNIE-GENLARGE | BLEU-426.95 | 9 | 1mo ago | |
| QG | Meta-DiffuB | BLEU22.71 | 8 | 1mo ago | |
| Molweni (test) | Ins | BLEU Score20.26 | 8 | 1mo ago | |
| OR-ShARC (test) | EFT | F1 (BLEU-1)59.3 | 7 | 1mo ago | |
| OR-ShARC (dev) | EFT | F1 (BLEU-1)65.5 | 7 | 1mo ago | |
| SQuAD (Zhao split) | Our model | BLEU-426.3 | 7 | 1mo ago | |
| MusiQue 2hop | DPKG | B429.03 | 6 | 1mo ago | |
| FairytaleQA (test) | SkillQG | Q-B40.656 | 6 | 1mo ago | |
| SQuAD QG | UNIMO | BLEU-424.59 | 6 | 1mo ago | |
| HotpotQA (test) | Ours2-hop | BLEU-30.2107 | 6 | 1mo ago | |
| TextRS-300 (test) | KRSVQG | BLEU-144.26 | 5 | 1mo ago | |
| NWPU-300 (test) | KRSVQG | BLEU-141.87 | 5 | 1mo ago | |
| FairytaleQA | SkillQG | Grammaticality Wins53 | 5 | 1mo ago |