Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

The Scientific Contribution Graph: Automated Literature-based Technological Roadmapping at Scale

About

Scientific contributions rarely develop in isolation, but instead build upon prior discoveries. We formulate the task of automated technological roadmapping as extracting scientific contributions from scholarly articles and linking them to their prerequisites. We present the Scientific Contribution Graph, a large-scale AI/NLP-domain resource containing 2 million detailed scientific contributions extracted from 230k open-access papers and connected by 12.5 million prerequisite edges. We further introduce scientific prerequisite prediction, a scientific discovery task in which models predict which existing technologies can enable future discoveries, and show that contemporary models are rapidly improving on this task, reaching 0.48 MAP when evaluated using temporally filtered backtesting. We anticipate technological roadmapping resources such as this will support scientific impact assessment and automated scientific discovery.

Peter A. Jansen• 2026

Related benchmarks

TaskDatasetResultRank
Technological requirement identificationScientific Contribution Graph 1.0 (entire set)--
10
Technological requirement identificationScientific Contribution Graph 1.0 (pre-cutoff)--
9
Technological requirement identificationScientific Contribution Graph 1.0 (post-cutoff)--
9
Seq2Seq contribution generation on full textSCI. CONT. GRAPH
Number of Nodes2
1
Span/relation labelingSciERC--
1
Span/relation labelingSciREX--
1
Span/relation labelingNLPCONT--
1
Span/relation labelingSCINLP-KG--
1
Span/relation labelingSCICLAIM--
1
Span/relation labelingCS-KG V2--
1
Showing 10 of 10 rows

Other info

Follow for update