Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LARGER: Lexically Anchored Repository Graph Exploration and Retrieval

About

Repository-level coding agents must first localize the files and symbols relevant to a task; failures at this stage can cascade across downstream objectives ranging from patch generation to test writing and codebase question answering. Existing agents navigate repositories primarily through lexical search, often missing structural relations such as imports, call chains, type hierarchies, and code-test links. Graph-based retrieval can recover such dependencies, but existing approaches often require separate graph tools or traversal stages that fragment the agent's interaction loop. We formalize repository context localization as Lexically Anchored Structural Localization, where success depends on turning lexical matches into high-precision structural entry points and exposing the most useful confidence-filtered local neighborhoods within the agent's existing search loop. We introduce LARGER (Lexically Anchored Repository Graph Exploration and Retrieval), a lexically anchored active-set retrieval framework that starts from lexical matches, aligns them to graph anchors, and performs confidence-filtered local expansion within the agent's existing search loop. LARGER integrates directly into existing CLI coding agents without requiring external graph databases or specialized graph interfaces. Across four benchmarks spanning localization, test generation, and codebase understanding, LARGER improves file-level Acc@5 on LocBench by +13.9 points with tuned hyperparameters and still gains +11.8 points with fixed hyperparameters over the strongest baseline, while delivering consistent gains on MuLocBench, SWE-Atlas Test Writing, and SWE-Atlas Codebase QA.

Yuntong Hu, Tongli Su, Liang Zhao, Bowen Zhu, Hasibul Haque• 2026

Related benchmarks

TaskDatasetResultRank
File-level LocalizationLocBench
Top-1 Accuracy77.1
11
File-level LocalizationMuLocBench
Top-1 Accuracy28
11
Code LocalizationLocBench V1
Mean Time (s)60
10
Code LocalizationMuLocBench
Mean Latency (s)99.9
10
Code LocalizationMuLocBench
Cost/Instance0.44
7
Codebase QASWE-Atlas
Score32.25
4
Test WritingSWE-Atlas
Score37.78
4
Showing 7 of 7 rows

Other info

Follow for update