Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FAMOSE: A ReAct Approach to Automated Feature Discovery

About

Feature engineering remains a critical yet challenging bottleneck in machine learning, particularly for tabular data, as identifying optimal features from an exponentially large feature space traditionally demands substantial domain expertise. To address this challenge, we introduce FAMOSE (Feature AugMentation and Optimal Selection agEnt), a novel framework that leverages the ReAct paradigm to autonomously explore, generate, and refine features while integrating feature selection and evaluation tools within an agent architecture. To our knowledge, FAMOSE represents the first application of an agentic ReAct framework to automated feature engineering, especially for both regression and classification tasks. Extensive experiments demonstrate that FAMOSE is at or near the state-of-the-art on classification tasks (especially tasks with more than 10K instances, where ROC-AUC increases 0.23% on average), and achieves the state-of-the-art for regression tasks by reducing RMSE by 2.0% on average, while remaining more robust to errors than other algorithms. We hypothesize that FAMOSE's strong performance is because ReAct allows the LLM context window to record (via iterative feature discovery and evaluation steps) what features did or did not work. This is similar to a few-shot prompt and guides the LLM to invent better, more innovative features. Our work offers evidence that AI agents are remarkably effective in solving problems that require highly inventive solutions, such as feature engineering.

Keith Burghardt, Jienan Liu, Sadman Sakib, Yuning Hao, Bo Li• 2026

Related benchmarks

TaskDatasetResultRank
Classificationbank-marketing
ROC AUC0.928
19
Classificationtic-tac-toe
ROC-AUC100
15
ClassificationBalance Scale
ROC AUC1
14
ClassificationCredit-g
ROC AUC0.757
14
Classificationvehicle
ROC AUC91.8
14
ClassificationAdult
ROC-AUC0.929
13
Classificationbreast-w
ROC-AUC0.989
13
ClassificationDiabetes
ROC AUC0.705
11
Binary ClassificationJungle Chess
AUC0.994
10
Classificationblood
ROC-AUC0.704
6
Showing 10 of 19 rows

Other info

Follow for update