AutoAdapt: An Automated Domain Adaptation Framework for LLMs

About

Large language models (LLMs) excel in open domains but struggle in specialized settings with limited data and evolving knowledge. Existing domain adaptation practices rely heavily on manual trial-and-error processes, incur significant hyperparameter complexity, and are highly sensitive to data and user preferences, all under the high cost of LLM training. Moreover, the interactions and transferability of hyperparameter choices across models/domains remain poorly understood, making adaptation gains uncertain even with substantial effort. To solve these challenges, we present AutoAdapt, a novel end-to-end automated framework for efficient and reliable LLM domain adaptation. AutoAdapt leverages curated knowledge bases from literature and open-source resources to reduce expert intervention. To narrow the search space, we design a novel multi-agent debating system in which proposal and critic agents iteratively interact to align user intent and incorporate data signals and best practices into the planning process. To optimize hyperparameters under tight budgets, we propose AutoRefine, a novel LLM-based surrogate that replaces costly black-box search. Across 10 tasks, AutoAdapt achieves a 25% average relative accuracy improvement over state-of-the-art Automated Machine Learning baselines with minimal overhead.

Sidharth Sinha, Anson Bastos, Xuchao Zhang, Akshay Nambi, Chetan Bansal, Saravan Rajmohan• 2026

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	MATH	Accuracy27.8	882
Code Generation	MBPP+	Accuracy68.3	243
Medical Question Answering	MedQA	Accuracy61.27	179
Science Question Answering	ARC	ARC Accuracy83.79	82
Legal Reasoning	CaseHOLD (test)	Test Accuracy89.22	22
Legal Reasoning	CaseHold	Cumulative Score (CS)96	8
Mathematical Reasoning	UC Berkeley MATH	Cumulative Score (CS)58	8
Temporal Reasoning	When2Call	Performance Score100	8
Medical Question Answering	MedQA	Cumulative Score (CS)82	8
Professional Writing	PW	Accuracy67.69	5

Showing 10 of 29 rows

Other info

Follow for update

@wizwand_team Discord