End-to-End Neural Ad-hoc Ranking with Kernel Pooling

About

This paper proposes K-NRM, a kernel based neural model for document ranking. Given a query and a set of documents, K-NRM uses a translation matrix that models word-level similarities via word embeddings, a new kernel-pooling technique that uses kernels to extract multi-level soft match features, and a learning-to-rank layer that combines those features into the final ranking score. The whole model is trained end-to-end. The ranking layer learns desired feature patterns from the pairwise ranking loss. The kernels transfer the feature patterns into soft-match targets at each similarity level and enforce them on the translation matrix. The word embeddings are tuned accordingly so that they can produce the desired soft matches. Experiments on a commercial search engine's query log demonstrate the improvements of K-NRM over prior feature-based and neural-based states-of-the-art, and explain the source of K-NRM's advantage: Its kernel-guided embedding encodes a similarity metric tailored for matching query words to document words, and provides effective multi-level soft matches.

Chenyan Xiong, Zhuyun Dai, Jamie Callan, Zhiyuan Liu, Russell Power• 2017

Related benchmarks

Task	Dataset	Result
Passage Ranking	MS MARCO (dev)	MRR@1021.8	73
Information Retrieval	Robust04	P@2042.21	72
Information Retrieval	TREC Title queries 1-3	MAP0.2633	19
Ad-hoc Document Ranking	WebTrack 2012-14	nDCG@2032.87	18
Document Reranking	Robust04 Description	MAP0.4066	13
Document Reranking	GOV2 Title	MAP34.69	12
Document Reranking	GOV2 Description	MAP32.69	12
Document Reranking	Robust04 Title	MAP36.73	12
Passage Ranking	MS MARCO (eval)	MRR@100.198	12
Ad hoc document retrieval	TREC Microblog 2012 (test)	AP22.77	9

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord