Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction

About

Automated paper reproduction -- generating executable code from academic papers -- is bottlenecked not by information retrieval but by the tacit knowledge that papers inevitably leave implicit. We formalize this challenge as the progressive recovery of three types of tacit knowledge -- relational, somatic, and collective -- and propose \method, a graph-based agent framework with a dedicated mechanism for each: node-level relation-aware aggregation recovers relational knowledge by analyzing implementation-unit-level reuse and adaptation relationships between the target paper and its citation neighbors; execution-feedback refinement recovers somatic knowledge through iterative debugging driven by runtime signals; and graph-level knowledge induction distills collective knowledge from clusters of papers sharing similar implementations. On an extended ReproduceBench spanning 3 domains, 10 tasks, and 40 recent papers, \method{} achieves an average performance gap of 10.04\% against official implementations, improving over the strongest baseline by 24.68\%. The code will be publicly released upon acceptance; the repository link will be provided in the final version.

Lehui Li, Ruining Wang, Haochen Song, Yaoxin Mao, Tong Zhang, Yuyao Wang, Jiayi Fan, Yitong Zhang, Jieping Ye, Chengqi Zhang, Yongshun Gong• 2026

Related benchmarks

TaskDatasetResultRank
General Graph LearningGeneralGL
Performance Gap4.66
6
General RecommendationGeneralRec
Performance Gap8.77
6
Graph Structure LearningGSL
Performance Gap17.86
6
Long-term time-series forecastingLongTerm
Performance Gap3.27
6
Multimodal RecommendationMMRec
Performance Gap20.17
6
Noisy Graph LearningNoisyGL
Performance Gap5.79
6
Sequential RecommendationSeqRec
Performance Gap3.38
6
Short-term Time Series ForecastingShortTerm
Performance Gap6.01
6
Time Series Anomaly DetectionAnomalyDetection
Performance Gap25.44
6
Time-series classificationClassification
Performance Gap5.03
6
Showing 10 of 11 rows

Other info

Follow for update