Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RDBLearn: Simple In-Context Prediction Over Relational Databases

About

Recent advances in tabular in-context learning (ICL) show that a single pretrained model can adapt to new prediction tasks from a small set of labeled examples, avoiding per-task training and heavy tuning. However, many real-world tasks live in relational databases, where predictive signal is spread across multiple linked tables rather than a single flat table. We show that tabular ICL can be extended to relational prediction with a simple recipe: automatically featurize each target row using relational aggregations over its linked records, materialize the resulting augmented table, and run an off-the-shelf tabular foundation model on it. We package this approach in \textit{RDBLearn} (https://github.com/HKUSHXLab/rdblearn), an easy-to-use toolkit with a scikit-learn-style estimator interface that makes it straightforward to swap different tabular ICL backends; a complementary agent-specific interface is provided as well. Across a broad collection of RelBench and 4DBInfer datasets, RDBLearn is the best-performing foundation model approach we evaluate, at times even outperforming strong supervised baselines trained or fine-tuned on each dataset.

Yanlin Zhang, Linjie Xu, Quan Gan, David Wipf, Minjie Wang• 2026

Related benchmarks

TaskDatasetResultRank
Entity RegressionRelBench V1
F1 Positive Error (MAE)3.834
11
Entity ClassificationRelBench V1
DNF Score71.72
11
Showing 2 of 2 rows

Other info

Follow for update