Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking
About
The goal of local citation recommendation is to recommend a missing reference from the local citation context and optionally also from the global context. To balance the tradeoff between speed and accuracy of citation recommendation in the context of a large-scale paper database, a viable approach is to first prefetch a limited number of relevant documents using efficient ranking methods and then to perform a fine-grained reranking using more sophisticated models. In that vein, BM25 has been found to be a tough-to-beat approach to prefetching, which is why recent work has focused mainly on the reranking step. Even so, we explore prefetching with nearest neighbor search among text embeddings constructed by a hierarchical attention network. When coupled with a SciBERT reranker fine-tuned on local citation recommendation tasks, our hierarchical Attention encoder (HAtten) achieves high prefetch recall for a given number of candidates to be reranked. Consequently, our reranker requires fewer prefetch candidates to rerank, yet still achieves state-of-the-art performance on various local citation recommendation datasets such as ACL-200, FullTextPeerRead, RefSeer, and arXiv.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Paper-Baseline Recommendation | AI Resource Recommendation (test) | R@1029.18 | 8 | |
| Paper-Dataset Recommendation | AI Resource Recommendation (test) | Recall@1018.03 | 8 | |
| Resource Recommendation | Paper Baseline | R@2041.68 | 6 | |
| Citation Recommendation | FullTextPeerRead (test) | Recall@550.27 | 5 | |
| Citation Recommendation | ACL-200 (test) | Recall@50.4186 | 5 | |
| Citation Recommendation | RefSeer (test) | Recall@50.2672 | 5 | |
| Citation Recommendation | arXiv HAtten (test) | Recall@524.26 | 5 | |
| Citation Recommendation | ACL-200 | MRR45.53 | 5 | |
| Citation Recommendation | FullTextPeerRead | MRR55.03 | 5 | |
| Citation Recommendation | Refseer | MRR30.64 | 5 |