KumoRFM-2: Scaling Foundation Models for Relational Learning
About
We introduce KumoRFM-2, the next iteration of a pre-trained foundation model for relational data. KumoRFM-2 supports in-context learning as well as fine-tuning and is applicable to a wide range of predictive tasks. In contrast to tabular foundation models, KumoRFM-2 natively operates on relational data, processing one or more connected tables simultaneously without manual table flattening or target variable generation, all while preserving temporal consistency. KumoRFM-2 leverages a large corpus of synthetic and real-world data to pre-train across four axes: the row and column dimensions at the individual table level, and the foreign key and cross-sample dimensions at the database level. In contrast to its predecessor, KumoRFM-2 injects task information as early as possible, enabling sharper selection of task-relevant columns and improved robustness to noisy data. Through extensive experiments on 41 challenging benchmarks and analysis around expressivity and sensitivity, we demonstrate that KumoRFM-2 outperforms supervised and foundational approaches by up to 8%, while maintaining strong performance under extreme settings of cold start and noisy data. To our knowledge, this is the first time a few-shot foundation model has been shown to surpass supervised approaches on common benchmark tasks, with performance further improving upon fine-tuning. Finally, while KumoRFM-1 was limited to small-scale in-memory datasets, KumoRFM-2 scales to billion-scale relational datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Entity Regression | RelBench v1.0 (test) | LTV (Amazon Item)46.992 | 27 | |
| Binary Classification | RelBench 1.0 (test) | Relational Amazon User Churn72.03 | 26 | |
| Multi-class classification | SALT (test) | Accuracy (Plant)99 | 12 | |
| Binary Classification | 4DBInfer (test) | AUROC (AB Churn)0.7706 | 11 | |
| Regression | RelBench v2 (test) | MAE (RateBeer User-Count)7.298 | 6 | |
| Binary Classification | RelBench v2 (test) | AUROC (mimic stay)55.28 | 4 |