Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TabPFN-2.5: Advancing the State of the Art in Tabular Foundation Models

About

The first tabular foundation model, TabPFN, and its successor TabPFNv2 have impacted tabular AI substantially, with dozens of methods building on it and hundreds of applications across different use cases. This report introduces TabPFN-2.5, the next generation of our tabular foundation model, built for datasets with up to 50,000 data points and 2,000 features, a 20x increase in data cells compared to TabPFNv2. TabPFN-2.5 is now the leading method for the industry standard benchmark TabArena (which contains datasets with up to 100,000 training data points), substantially outperforming tuned tree-based models and matching the accuracy of AutoGluon 1.4, a complex four-hour tuned ensemble that even includes the previous TabPFNv2. Remarkably, default TabPFN-2.5 has a 100% win rate against default XGBoost on small to medium-sized classification datasets (<=10,000 data points, 500 features) and a 87% win rate on larger datasets up to 100K samples and 2K features (85% for regression). For production use cases, we introduce a new distillation engine that converts TabPFN-2.5 into a compact MLP or tree ensemble, preserving most of its accuracy while delivering orders-of-magnitude lower latency and plug-and-play deployment. This new release will immediately strengthen the performance of the many applications and methods already built on the TabPFN ecosystem.

L\'eo Grinsztajn, Klemens Fl\"oge, Oscar Key, Felix Birkel, Philipp Jund, Brendan Roof, Benjamin J\"ager, Dominik Safaric, Simone Alessi, Adrian Hayler, Mihir Manium, Rosen Yu, Felix Jablonski, Shi Bin Hoo, Anurag Garg, Jake Robertson, Magnus B\"uhler, Vladyslav Moroshan, Lennart Purucker, Clara Cornu, Lilly Charlotte Wehrhahn, Alessandro Bonetto, Bernhard Sch\"olkopf, Sauraj Gambhir, Noah Hollmann, Frank Hutter• 2025

Related benchmarks

TaskDatasetResultRank
User Clicks Predictionrel-avito
ROC-AUC61.31
84
User Engagement Predictionrel-stack
ROC-AUC82.54
69
Driver DNF Predictionrel-f1
ROC-AUC0.7173
54
Item Churn Predictionrel-amazon
ROC-AUC77.27
54
Driver Top 3 Predictionrel-f1
ROC-AUC80.56
54
User Churn PredictionAmazon Rel
ROC-AUC0.631
54
Study Outcome Predictionrel (trial)
ROC-AUC0.5913
52
User Churn Predictionrel-hm
ROC-AUC64.68
52
User Ignore PredictionRel Event
ROC-AUC0.8191
50
User Repeat PredictionRel Event
ROC-AUC67.32
50
Showing 10 of 40 rows

Other info

Follow for update