Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AutoAdapt: An Automated Domain Adaptation Framework for LLMs

About

Large language models (LLMs) excel in open domains but struggle in specialized settings with limited data and evolving knowledge. Existing domain adaptation practices rely heavily on manual trial-and-error processes, incur significant hyperparameter complexity, and are highly sensitive to data and user preferences, all under the high cost of LLM training. Moreover, the interactions and transferability of hyperparameter choices across models/domains remain poorly understood, making adaptation gains uncertain even with substantial effort. To solve these challenges, we present AutoAdapt, a novel end-to-end automated framework for efficient and reliable LLM domain adaptation. AutoAdapt leverages curated knowledge bases from literature and open-source resources to reduce expert intervention. To narrow the search space, we design a novel multi-agent debating system in which proposal and critic agents iteratively interact to align user intent and incorporate data signals and best practices into the planning process. To optimize hyperparameters under tight budgets, we propose AutoRefine, a novel LLM-based surrogate that replaces costly black-box search. Across 10 tasks, AutoAdapt achieves a 25% average relative accuracy improvement over state-of-the-art Automated Machine Learning baselines with minimal overhead.

Sidharth Sinha, Anson Bastos, Xuchao Zhang, Akshay Nambi, Chetan Bansal, Saravan Rajmohan• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH
Accuracy27.8
882
Code GenerationMBPP+
Accuracy68.3
104
Science Question AnsweringARC
ARC Accuracy83.79
46
Medical Question AnsweringMedQA
Accuracy61.27
40
Legal ReasoningCaseHOLD (test)
Test Accuracy89.22
22
Legal ReasoningCaseHold
Cumulative Score (CS)96
8
Mathematical ReasoningUC Berkeley MATH
Cumulative Score (CS)58
8
Temporal ReasoningWhen2Call
Performance Score100
8
Medical Question AnsweringMedQA
Cumulative Score (CS)82
8
Professional WritingPW
Accuracy67.69
5
Showing 10 of 29 rows

Other info

Follow for update