Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery

About

Scientific artifacts such as models and datasets are foundations for research. With the rapid growth of platforms like HuggingFace, researchers now have access to a large number of artifacts. Yet, a key challenge remains: how can we automatically discover the state-of-the-art (SOTA) model for a given dataset by fully leveraging existing artifacts? We formalize this task as automatic SOTA discovery by modeling HuggingFace as an artifact graph, where nodes are models/datasets and edges represent evaluations. We propose ArtifactLinker, a two-stage framework: (1) ranking promising unobserved model--dataset links using Graph Neural Networks (GNNs) or graph-augmented Large Language Models (LLMs), and (2) verifying top-ranked links via coding experiments with LLM-based agents. We further introduce a benchmark named ArtifactBench with 14,053 artifacts and 51,337 relations to evaluate the performance of both stages. Results show that (1) graph structures between existing artifacts are effective for missing link prediction; (2) end-to-end ranking and verification with ArtifactLinker help discover potential SOTA results and research insights.

Haofei Yu, Jiaxuan You, Peter Clark, Bodhisattwa Prasad Majumder, Kyle Richardson• 2026

Related benchmarks

TaskDatasetResultRank
Attribute PredictionARTIFACTBENCH (transductive)--
12
Attribute PredictionARTIFACTBENCH (inductive)--
12
Attribute RankingARTIFACTBENCH (transductive)--
12
Attribute RankingARTIFACTBENCH (inductive)--
12
Link RankingARTIFACTBENCH transductive 1.0 (val test)--
10
Link RankingARTIFACTBENCH inductive 1.0 (val test)--
10
Link PredictionARTIFACTBENCH transductive 1.0 (val test)--
8
Link PredictionARTIFACTBENCH inductive 1.0 (val test)--
8
Showing 8 of 8 rows

Other info

Follow for update