Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Parameter-Efficient Fine-Tuning of LLMs with Mixture of Space Experts

About

Large Language Models (LLMs) have achieved remarkable progress, with Parameter-Efficient Fine-Tuning (PEFT) emerging as a key technique for downstream task adaptation. However, existing PEFT methods mainly operate in Euclidean space, fundamentally limiting their capacity to capture complex geometric structures inherent in language data. While alternative geometric spaces, like hyperbolic geometries for hierarchical data and spherical manifolds for circular patterns, offer theoretical advantages, forcing representations into a single manifold type ultimately limits expressiveness, even when curvature parameters are learnable. To address this, we propose Mixture of Space (MoS), a unified framework that leverages multiple geometric spaces simultaneously to learn richer, curvature-aware representations. Building on this scheme, we develop MoSLoRA, which extends Low-Rank Adaptation (LoRA) with heterogeneous geometric experts, enabling models to dynamically select or combine appropriate geometric spaces based on input context. Furthermore, to address the computational overhead of frequent manifold switching, we develop a lightweight routing mechanism. Moreover, we provide empirical insights into how curvature optimization impacts training stability and model performance. Our experiments across diverse benchmarks demonstrate that MoSLoRA consistently outperforms strong baselines, achieving up to 5.6% improvement on MATH500 and 15.9% on MAWPS.

Buze Zhang, Jinkai Tao, Zilang Zeng, Neil He, Ali Maatouk, Menglin Yang, Rex Ying• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH 500
Accuracy14.4
442
Mathematical ReasoningSVAMP
Accuracy73
403
Mathematical ReasoningGSM8K
Accuracy63.68
312
Mathematical ReasoningMAWPS
Accuracy81.73
234
Mathematical ReasoningAQUA
Accuracy31.5
146
Commonsense ReasoningCSQA
CSQA Accuracy82.47
126
Commonsense ReasoningOBQA
Accuracy89
117
Mathematical ReasoningMATH500
Accuracy18
82
Math ReasoningMATH500
Accuracy29.6
51
Arithmetic ReasoningAQUA
Accuracy39.37
31
Showing 10 of 14 rows

Other info

Follow for update