End-to-End Neural Ad-hoc Ranking with Kernel Pooling
About
This paper proposes K-NRM, a kernel based neural model for document ranking. Given a query and a set of documents, K-NRM uses a translation matrix that models word-level similarities via word embeddings, a new kernel-pooling technique that uses kernels to extract multi-level soft match features, and a learning-to-rank layer that combines those features into the final ranking score. The whole model is trained end-to-end. The ranking layer learns desired feature patterns from the pairwise ranking loss. The kernels transfer the feature patterns into soft-match targets at each similarity level and enforce them on the translation matrix. The word embeddings are tuned accordingly so that they can produce the desired soft matches. Experiments on a commercial search engine's query log demonstrate the improvements of K-NRM over prior feature-based and neural-based states-of-the-art, and explain the source of K-NRM's advantage: Its kernel-guided embedding encodes a similarity metric tailored for matching query words to document words, and provides effective multi-level soft matches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Passage Ranking | MS MARCO (dev) | MRR@1021.8 | 73 | |
| Information Retrieval | Robust04 | P@2042.21 | 72 | |
| Information Retrieval | TREC Title queries 1-3 | MAP0.2633 | 19 | |
| Ad-hoc Document Ranking | WebTrack 2012-14 | nDCG@2032.87 | 18 | |
| Document Reranking | Robust04 Description | MAP0.4066 | 13 | |
| Document Reranking | GOV2 Title | MAP34.69 | 12 | |
| Document Reranking | GOV2 Description | MAP32.69 | 12 | |
| Document Reranking | Robust04 Title | MAP36.73 | 12 | |
| Passage Ranking | MS MARCO (eval) | MRR@100.198 | 12 | |
| Ad hoc document retrieval | TREC Microblog 2012 (test) | AP22.77 | 9 |