Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scaling Latent Reasoning via Looped Language Models

About

Modern LLMs are trained to "think" primarily via explicit text generation, such as chain-of-thought (CoT), which defers reasoning to post-training and under-leverages pre-training data. We present and open-source Ouro, named after the recursive Ouroboros, a family of pre-trained Looped Language Models (LoopLM) that instead build reasoning into the pre-training phase through (i) iterative computation in latent space, (ii) an entropy-regularized objective for learned depth allocation, and (iii) scaling to 7.7T tokens. Ouro 1.4B and 2.6B models enjoy superior performance that match the results of up to 12B SOTA LLMs across a wide range of benchmarks. Through controlled experiments, we show this advantage stems not from increased knowledge capacity, but from superior knowledge manipulation capabilities. We also show that LoopLM yields reasoning traces more aligned with final outputs than explicit CoT. We hope our results show the potential of LoopLM as a novel scaling direction in the reasoning era. Our model is available here: http://ouro-llm.github.io.

Rui-Jie Zhu, Zixuan Wang, Kai Hua, Tianyu Zhang, Ziniu Li, Haoran Que, Boyi Wei, Zixin Wen, Fan Yin, He Xing, Lu Li, Jiajun Shi, Kaijing Ma, Shanda Li, Taylor Kergan, Andrew Smith, Xingwei Qu, Mude Hui, Bohong Wu, Qiyang Min, Hongzhi Huang, Xun Zhou, Wei Ye, Jiaheng Liu, Jian Yang, Yunfeng Shi, Chenghua Lin, Enduo Zhao, Tianle Cai, Ge Zhang, Wenhao Huang, Yoshua Bengio, Jason Eshraghian• 2025

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningHellaSwag
Accuracy83.68
1891
Code GenerationHumanEval--
1036
Language UnderstandingMMLU
Accuracy76.63
825
ReasoningBBH
Accuracy78.41
672
Code GenerationHumanEval+
Pass@170.7
383
Commonsense ReasoningWinoGrande
Accuracy77.74
372
Code GenerationMBPP+
Pass@166.6
216
Mathematical ReasoningAIME24
Accuracy16.7
160
Code GenerationMBPP
Accuracy (%)61.3
146
Mathematical ReasoningMATH 500
MATH 500 Accuracy71.6
106
Showing 10 of 22 rows

Other info

Follow for update