RDBLearn: Simple In-Context Prediction Over Relational Databases

About

Recent advances in tabular in-context learning (ICL) show that a single pretrained model can adapt to new prediction tasks from a small set of labeled examples, avoiding per-task training and heavy tuning. However, many real-world tasks live in relational databases, where predictive signal is spread across multiple linked tables rather than a single flat table. We show that tabular ICL can be extended to relational prediction with a simple recipe: automatically featurize each target row using relational aggregations over its linked records, materialize the resulting augmented table, and run an off-the-shelf tabular foundation model on it. We package this approach in \textit{RDBLearn} (https://github.com/HKUSHXLab/rdblearn), an easy-to-use toolkit with a scikit-learn-style estimator interface that makes it straightforward to swap different tabular ICL backends; a complementary agent-specific interface is provided as well. Across a broad collection of RelBench and 4DBInfer datasets, RDBLearn is the best-performing foundation model approach we evaluate, at times even outperforming strong supervised baselines trained or fine-tuned on each dataset.

Yanlin Zhang, Linjie Xu, Quan Gan, David Wipf, Minjie Wang• 2026

Related benchmarks

Task	Dataset	Result	Rank
Entity Regression	RelBench V1	Avito CTR Error (MAE)0.034		26
Entity Classification	RelBench V1	DNF Score71.72		11

Showing 2 of 2 rows

Other info

Follow for update

@wizwand_team Discord