| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| COMMONSENSEQA 1,000 examples | Veracity Search (VS) | Mean Hamming Similarity0.935 | 20 | 4d ago | |
| GSM8K 1,000 examples | Veracity Search (VS) | Mean Hamming Similarity75.1 | 20 | 4d ago | |
| PRONTOQA (1,000 examples) | Veracity Search (VS) | Mean Hamming Similarity96.4 | 20 | 4d ago | |
| PRONTOQA 5-hop (test) | AVI | Hamming Similarity0.955 | 4 | 4d ago | |
| PRONTOQA 4-hop (test) | AVI | Hamming Similarity96.7 | 4 | 4d ago | |
| PRONTOQA 3-hop (test) | AVI | Hamming Similarity95.6 | 4 | 4d ago |