Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mambular: A Sequential Model for Tabular Deep Learning

About

The analysis of tabular data has traditionally been dominated by gradient-boosted decision trees (GBDTs), known for their proficiency with mixed categorical and numerical features. However, recent deep learning innovations are challenging this dominance. This paper investigates the use of autoregressive state-space models for tabular data and compares their performance against established benchmark models. Additionally, we explore various adaptations of these models, including different pooling strategies, feature interaction mechanisms, and bi-directional processing techniques to understand their effectiveness for tabular data. Our findings indicate that interpreting features as a sequence and processing them and their interactions through structured state-space layers can lead to significant performance improvement. This research underscores the versatility of autoregressive models in tabular data analysis, positioning them as a promising alternative that could substantially enhance deep learning capabilities in this traditionally challenging area. The source code is available at https://github.com/basf/mamba-tabular.

Anton Frederik Thielmann, Manish Kumar, Christoph Weisser, Arik Reuter, Benjamin S\"afken, Soheila Samiee• 2024

Related benchmarks

TaskDatasetResultRank
ClassificationLung
ACC89.9
96
ClassificationAdult
Accuracy81.8
86
ClassificationTOX_171
Accuracy84.95
78
ClassificationColon
Accuracy83.55
78
ClassificationGLI_85
Accuracy46.92
78
ClassificationALLAML
Accuracy60.8
72
ClassificationSMK_CAN_187
Accuracy32.9
72
ClassificationHDLSS Datasets Summary
Average Rank23.12
66
ClassificationProstate_GE
Accuracy81.12
64
ClassificationARCENE
Accuracy69.65
60
Showing 10 of 21 rows

Other info

Follow for update