What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction
About
Automated paper reproduction -- generating executable code from academic papers -- is bottlenecked not by information retrieval but by the tacit knowledge that papers inevitably leave implicit. We formalize this challenge as the progressive recovery of three types of tacit knowledge -- relational, somatic, and collective -- and propose \method, a graph-based agent framework with a dedicated mechanism for each: node-level relation-aware aggregation recovers relational knowledge by analyzing implementation-unit-level reuse and adaptation relationships between the target paper and its citation neighbors; execution-feedback refinement recovers somatic knowledge through iterative debugging driven by runtime signals; and graph-level knowledge induction distills collective knowledge from clusters of papers sharing similar implementations. On an extended ReproduceBench spanning 3 domains, 10 tasks, and 40 recent papers, \method{} achieves an average performance gap of 10.04\% against official implementations, improving over the strongest baseline by 24.68\%. The code will be publicly released upon acceptance; the repository link will be provided in the final version.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| General Graph Learning | GeneralGL | Performance Gap4.66 | 6 | |
| General Recommendation | GeneralRec | Performance Gap8.77 | 6 | |
| Graph Structure Learning | GSL | Performance Gap17.86 | 6 | |
| Long-term time-series forecasting | LongTerm | Performance Gap3.27 | 6 | |
| Multimodal Recommendation | MMRec | Performance Gap20.17 | 6 | |
| Noisy Graph Learning | NoisyGL | Performance Gap5.79 | 6 | |
| Sequential Recommendation | SeqRec | Performance Gap3.38 | 6 | |
| Short-term Time Series Forecasting | ShortTerm | Performance Gap6.01 | 6 | |
| Time Series Anomaly Detection | AnomalyDetection | Performance Gap25.44 | 6 | |
| Time-series classification | Classification | Performance Gap5.03 | 6 |