Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language models are weak learners

About

A central notion in practical and theoretical machine learning is that of a $\textit{weak learner}$, classifiers that achieve better-than-random performance (on any given distribution over data), even by a small margin. Such weak learners form the practical basis for canonical machine learning methods such as boosting. In this work, we illustrate that prompt-based large language models can operate effectively as said weak learners. Specifically, we illustrate the use of a large language model (LLM) as a weak learner in a boosting algorithm applied to tabular data. We show that by providing (properly sampled according to the distribution of interest) text descriptions of tabular data samples, LLMs can produce a summary of the samples that serves as a template for classification and achieves the aim of acting as a weak learner on this task. We incorporate these models into a boosting approach, which in some settings can leverage the knowledge within the LLM to outperform traditional tree-based boosting. The model outperforms both few-shot learning and occasionally even more involved fine-tuning procedures, particularly for tasks involving small numbers of data points. The results illustrate the potential for prompt-based LLMs to function not just as few-shot learners themselves, but as components of larger machine learning pipelines.

Hariharan Manikandan, Yiding Jiang, J Zico Kolter• 2023

Related benchmarks

TaskDatasetResultRank
Tabular Classificationdiabetes 37 (test)
Test Error34.4
15
Tabular Classificationwholesale-customers (test)
Test Error0.33
10
Tabular Classificationcaesarian 42901 (test)
Test Error30
9
Tabular Classificationhaberman-survival (test)
Error Rate0.25
9
Tabular Classificationvisualizing-hamster 708 (test)
Test Error0.207
9
Tabular Classificationtae 48 (test)
Error Rate45.4
9
Tabular Classificationsomerville-happiness-survey (test)
Test Error0.35
9
Tabular Classificationblood-transfusion-center (1464) (test)
Test Error24
9
Tabular Classificationvehicle 54 (test)
Test Error (%)41
9
Tabular Classificationvertebra-column 1524 (test)
Test Error0.262
9
Showing 10 of 23 rows

Other info

Follow for update