SeedER: Seed-and-Expand Retrieval from Knowledge Graphs
About
Knowledge graphs (KGs) offer a rich representation for relational knowledge, but their irregular structure makes retrieval challenging: ego-graph expansion grows rapidly, and dense embedding methods struggle with multi-hop compositional queries. Existing agent-based graph exploration approaches, while expressive, are often too expensive for large-scale retrieval. We introduce SeedER (Seed-and-Expand Retrieval), a retrieval framework that explicitly leverages KG structure through iterative, low-cost expansion. SeedER first seeds a compact set of core nodes using lightweight dense and entity-based retrieval, then selectively expands this set via a learned graph-aware policy trained with reinforcement learning. This design decomposes global reasoning into reusable local decisions, enabling efficient discovery of query-relevant nodes while tightly controlling expansion cost. We show theoretical limitations of dense retrieval on compositional graph queries, and establish advantages of SeedER from both compositional generalization and graph-constrained submodular optimization perspectives. Empirically, SeedER substantially improves recall with compact candidate sets over strong dense and graph-augmented baselines, making it an effective first-stage retriever for knowledge-intensive reasoning systems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Knowledge Graph Retrieval | STARK-PRIME | H@131 | 25 | |
| Knowledge Graph Retrieval | STARK AMAZON | H@140.6 | 25 | |
| Knowledge Graph Retrieval | STARK MAG | Hits@139.7 | 11 | |
| Retrieval | STARK-PRIME | Hit@119.9 | 9 | |
| Retrieval | STARK MAG | Hit Rate @ 124.4 | 9 | |
| Retrieval | STARK AMAZON | Hit@10.319 | 9 |