Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation
About
We introduce Pion, a spectrum-preserving optimizer for large language model (LLM) training based on orthogonal equivalence transformation. Unlike additive optimizers such as Adam and Muon, Pion updates each weight matrix through left and right orthogonal transformations, preserving its singular values throughout training. This yields an optimization mechanism that modulates the geometry of weight matrices while keeping their spectral norm fixed. We derive the Pion update rule, systematically examine its design choices, and analyze its convergence behavior along with several key properties. Empirical results show that Pion offers a stable and competitive alternative to standard optimizers for both LLM pretraining and finetuning.
Kexuan Shi, Hanxuan Li, Zeju Qiu, Yandong Wen, Simon Buchholz, Weiyang Liu• 2026
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Language Modeling | C4 (val) | -- | 737 | |
| Question Answering | ARC Challenge | Accuracy (ARC)26.79 | 598 | |
| Commonsense Reasoning | WinoGrande | Accuracy53.59 | 453 | |
| Question Answering | ARC Easy | Accuracy49.41 | 210 | |
| Science Question Answering | SciQ | Accuracy (SciQ)73.4 | 101 | |
| Physical Commonsense Reasoning | PIQA | Accuracy (PIQA)71.27 | 99 | |
| Boolean Question Answering | BoolQ | Accuracy57.58 | 57 | |
| Mathematical Reasoning | Olympiad Bench | Pass@846.43 | 30 | |
| Code Generation | HumanEval In-Domain | Accuracy53.05 | 8 | |
| Mathematical Reasoning | AIME25 | Avg@3224.38 | 8 |
Showing 10 of 14 rows