Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large Language Models Can Automatically Engineer Features for Few-Shot Tabular Learning

About

Large Language Models (LLMs), with their remarkable ability to tackle challenging and unseen reasoning problems, hold immense potential for tabular learning, that is vital for many real-world applications. In this paper, we propose a novel in-context learning framework, FeatLLM, which employs LLMs as feature engineers to produce an input data set that is optimally suited for tabular predictions. The generated features are used to infer class likelihood with a simple downstream machine learning model, such as linear regression and yields high performance few-shot learning. The proposed FeatLLM framework only uses this simple predictive model with the discovered features at inference time. Compared to existing LLM-based approaches, FeatLLM eliminates the need to send queries to the LLM for each sample at inference time. Moreover, it merely requires API-level access to LLMs, and overcomes prompt size limitations. As demonstrated across numerous tabular datasets from a wide range of domains, FeatLLM generates high-quality rules, significantly (10% on average) outperforming alternatives such as TabLLM and STUNT.

Sungwon Han, Jinsung Yoon, Sercan O Arik, Tomas Pfister• 2024

Related benchmarks

TaskDatasetResultRank
ClassificationGerman Credit UCIrvine
Macro F176.35
25
RegressionAirfoil UCIrvine
1-RAE0.5877
24
RegressionOpenml_586
1-RAE0.6477
24
Classificationbank-marketing
ROC AUC0.738
19
ClassificationAmazon Employee Kaggle (5-fold cross-validation)
Macro F193.62
17
ClassificationPimaIndian Kaggle (5-fold cross-validation)
Macro F1 Score89.66
17
ClassificationGerman Credit UCIrvine (5-fold cross-val)
Macro F10.7635
17
ClassificationIonosphere UCIrvine (5-fold cross-validation)
Macro F1 Score95.38
17
ClassificationMessidor Feature UCIrvine (5-fold cross-validation)
Macro F10.7262
17
ClassificationSVMGuide3 LibSVM (5-fold cross-val)
Macro F181.17
17
Showing 10 of 41 rows

Other info

Follow for update