Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking

About

The goal of local citation recommendation is to recommend a missing reference from the local citation context and optionally also from the global context. To balance the tradeoff between speed and accuracy of citation recommendation in the context of a large-scale paper database, a viable approach is to first prefetch a limited number of relevant documents using efficient ranking methods and then to perform a fine-grained reranking using more sophisticated models. In that vein, BM25 has been found to be a tough-to-beat approach to prefetching, which is why recent work has focused mainly on the reranking step. Even so, we explore prefetching with nearest neighbor search among text embeddings constructed by a hierarchical attention network. When coupled with a SciBERT reranker fine-tuned on local citation recommendation tasks, our hierarchical Attention encoder (HAtten) achieves high prefetch recall for a given number of candidates to be reranked. Consequently, our reranker requires fewer prefetch candidates to rerank, yet still achieves state-of-the-art performance on various local citation recommendation datasets such as ACL-200, FullTextPeerRead, RefSeer, and arXiv.

Nianlong Gu, Yingqiang Gao, Richard H.R. Hahnloser• 2021

Related benchmarks

TaskDatasetResultRank
Paper-Baseline RecommendationAI Resource Recommendation (test)
R@1029.18
8
Paper-Dataset RecommendationAI Resource Recommendation (test)
Recall@1018.03
8
Resource RecommendationPaper Baseline
R@2041.68
6
Citation RecommendationFullTextPeerRead (test)
Recall@550.27
5
Citation RecommendationACL-200 (test)
Recall@50.4186
5
Citation RecommendationRefSeer (test)
Recall@50.2672
5
Citation RecommendationarXiv HAtten (test)
Recall@524.26
5
Citation RecommendationACL-200
MRR45.53
5
Citation RecommendationFullTextPeerRead
MRR55.03
5
Citation RecommendationRefseer
MRR30.64
5
Showing 10 of 13 rows

Other info

Follow for update