ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery

About

Scientific artifacts such as models and datasets are foundations for research. With the rapid growth of platforms like HuggingFace, researchers now have access to a large number of artifacts. Yet, a key challenge remains: how can we automatically discover the state-of-the-art (SOTA) model for a given dataset by fully leveraging existing artifacts? We formalize this task as automatic SOTA discovery by modeling HuggingFace as an artifact graph, where nodes are models/datasets and edges represent evaluations. We propose ArtifactLinker, a two-stage framework: (1) ranking promising unobserved model--dataset links using Graph Neural Networks (GNNs) or graph-augmented Large Language Models (LLMs), and (2) verifying top-ranked links via coding experiments with LLM-based agents. We further introduce a benchmark named ArtifactBench with 14,053 artifacts and 51,337 relations to evaluate the performance of both stages. Results show that (1) graph structures between existing artifacts are effective for missing link prediction; (2) end-to-end ranking and verification with ArtifactLinker help discover potential SOTA results and research insights.

Haofei Yu, Jiaxuan You, Peter Clark, Bodhisattwa Prasad Majumder, Kyle Richardson• 2026

Related benchmarks

Task	Dataset	Result
Attribute Prediction	ARTIFACTBENCH (transductive)	--	12
Attribute Prediction	ARTIFACTBENCH (inductive)	--	12
Attribute Ranking	ARTIFACTBENCH (transductive)	--	12
Attribute Ranking	ARTIFACTBENCH (inductive)	--	12
Link Ranking	ARTIFACTBENCH transductive 1.0 (val test)	--	10
Link Ranking	ARTIFACTBENCH inductive 1.0 (val test)	--	10
Link Prediction	ARTIFACTBENCH transductive 1.0 (val test)	--	8
Link Prediction	ARTIFACTBENCH inductive 1.0 (val test)	--	8

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord