Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Task Scarcity and Label Leakage in Relational Transfer Learning

About

Training relational foundation models requires learning representations that transfer across tasks, yet available supervision is typically limited to a small number of prediction targets per database. This task scarcity causes learned representations to encode task-specific shortcuts that degrade transfer even within the same schema, a problem we call label leakage. We study this using K-Space, a modular architecture combining frozen pretrained tabular encoders with a lightweight message-passing core. To suppress leakage, we introduce a gradient projection method that removes label-predictive directions from representation updates. On RelBench, this improves within-dataset transfer by +0.145 AUROC on average, often recovering near single-task performance. Our results suggest that limited task diversity, not just limited data, constrains relational foundation models.

Francisco Galuppo Azevedo, Clarissa Lima Loures, Denis Oliveira Correa• 2026

Related benchmarks

TaskDatasetResultRank
User Clicks Predictionrel-avito
ROC-AUC64.6
84
User Engagement Predictionrel-stack
ROC-AUC84.4
69
Driver DNF Predictionrel-f1
ROC-AUC0.735
54
Driver Top 3 Predictionrel-f1
ROC-AUC82.9
54
Item Churn Predictionrel-amazon
ROC-AUC77.8
54
User Churn PredictionAmazon Rel
ROC-AUC0.644
54
Study Outcome Predictionrel (trial)
ROC-AUC0.651
52
User Churn Predictionrel-hm
ROC-AUC67.4
52
User Ignore PredictionRel Event
ROC-AUC0.858
50
User Repeat PredictionRel Event
ROC-AUC73.6
50
Showing 10 of 12 rows

Other info

Follow for update