| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| TREC-QA (test) | ASR | MAP94.88 | 63 | 3d ago | |
| ASNQ (test) | DeBERTaV3Base + SSP (ALL) | P@171.3 | 45 | 3d ago | |
| WikiQA | P@185.6 | 36 | 4d ago | ||
| TREC-QA | P@192.6 | 24 | 4d ago | ||
| ASNQ | ELECTRA-Base + SSP (DPC) | P@170.5 | 24 | 4d ago | |
| WQA (test) | MASR-FP | P@14.96 | 19 | 3d ago | |
| SelQA (test) | DRCN | MAP0.925 | 15 | 4d ago | |
| TREC-QA clean-version | DRCN | MAP83 | 14 | 4d ago | |
| PrivacyQA 1.0 (test) | GPT-4o-mini Multi-agent | SAE0.611 | 12 | 4d ago | |
| NewsAS2 | ROBERTa-Base + SSP (All) | MAP83 | 12 | 4d ago | |
| Alexa Virtual Assistant traffic accurate Sample 3 (test) | TANDA | Prec@10.5814 | 12 | 4d ago | |
| Alexa Virtual Assistant traffic accurate Sample 2 (test) | TANDA | Prec@174.85 | 12 | 4d ago | |
| Alexa Virtual Assistant traffic Sample 1 (test) | TANDA | Prec@171.26 | 12 | 4d ago | |
| WikiQA clean (test) | TANDA (RoBERTa-Large) | MAP92 | 12 | 4d ago | |
| IQAD Bench 2 | ROBERTa-Base + SSP (All) | MAP0.014 | 11 | 4d ago | |
| IQAD Bench 1 | ROBERTa-Base + SSP (SDC) | MAP1.7 | 11 | 4d ago | |
| QASent (test) | MAP80.1 | 8 | 4d ago | ||
| TrecQA raw (test) | DRCN | MAP0.804 | 6 | 4d ago | |
| QASent | Lexical Decomposition and Composition Model | MAP77.14 | 5 | 4d ago |