Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Skill-Based Mixture-of-Experts: Adaptive Routing for Heterogeneous Reasoning via Inferred Skills

About

Combining existing pre-trained LLMs is a promising approach for diverse reasoning tasks. However, task-level expert selection is often too coarse-grained, since different instances may require different expertise. To address this, we propose Skill-MoE, a symbolic, skill-based, and gradient-free Mixture-of-Experts framework for instance-level expert selection. Skill-MoE infers skills (e.g., algebra in mathematics) from each query, selects experts based on skill relevance, and lets each expert generate its own reasoning. The resulting k outputs are then synthesized by an aggregator chosen for its ability to integrate diverse responses. While instance-level selection substantially improves performance, naively implementing it incurs heavy overhead from repeated model loading and offloading. We address this with a batch inference strategy that groups instances by assigned experts, allowing each model to be loaded only once. As a result, Skill-MoE integrates 16 expert models on a single GPU with runtime comparable to prior multi-agent baselines using 4 GPUs. Across diverse benchmarks (MMLU-Pro, GPQA, AIME, and MedMCQA), Skill-MoE achieves an average absolute improvement of 8.15% over the best baseline. It also generalizes well to unseen tasks and outperforms discussion-based methods without requiring expensive multi-round interactions.

Justin Chih-Yao Chen, Sukwon Yun, Elias Stengel-Eskin, Tianlong Chen, Mohit Bansal• 2025

Related benchmarks

TaskDatasetResultRank
Medical Question AnsweringMedMCQA
Accuracy59.35
521
ReasoningMMLU-Pro
Accuracy80.6
241
ReasoningGPQA Diamond
Accuracy62.63
185
Mathematical ReasoningOmni-MATH
Accuracy52.03
123
Instruction FollowingIFEval
Accuracy (IFEval)89
89
Mathematical Problem SolvingMATH500
Accuracy90.4
83
Graduate-level Science Question AnsweringGPQA
Accuracy (GPQA)57.78
72
Medical ReasoningMedMCQA
Accuracy74.88
58
Mathematical Problem SolvingAIME 2024
Top-1 Accuracy50
54
General Multi-domain ReasoningOverall (AIME, MMLU-Pro, MedMCQA, GPQA)
Average Score62.43
12
Showing 10 of 12 rows

Other info

Follow for update