| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Single-hop Question Answering | NQ (Natural Questions) (test) | Accuracy45.3 | 21 | |
| Question Answering | NQ (Natural Questions) December 2018 Wikipedia dump (test) | EM36.9 | 14 | |
| Question Retrieval | NQ (Natural Questions) (full) | Retrieval Accuracy40.9 | 12 | |
| Question Answering | NQ (Natural Questions) in-domain (test) | LasJ Score58.03 | 11 |