Universal Reasoning Model

About

Universal transformers (UTs) have been widely used for complex reasoning tasks such as ARC-AGI and Sudoku, yet the specific sources of their performance gains remain underexplored. In this work, we systematically analyze UTs variants and show that improvements on ARC-AGI primarily arise from the recurrent inductive bias and strong nonlinear components of Transformer, rather than from elaborate architectural designs. Motivated by this finding, we propose the Universal Reasoning Model (URM), which enhances the UT with short convolution and truncated backpropagation. Our approach substantially improves reasoning performance, achieving state-of-the-art 53.8% pass@1 on ARC-AGI 1 and 16.0% pass@1 on ARC-AGI 2. Our code is avaliable at https://github.com/UbiquantAI/URM.

Zitian Gao, Lynx Chen, Yihao Xiao, He Xing, Ran Tao, Haoming Luo, Joey Zhou, Bryan Dai• 2025

Related benchmarks

Task	Dataset	Result
Sudoku Solving	Sudoku-Extreme (test)	Accuracy77.6	31
Puzzle Solving	Sudoku-Extreme (test)	Pass@1 Success Rate77.6	9
Maze	Maze-Unique (test)	Exact Accuracy51.4	7
Abstraction and Reasoning	ARC-1 (test)	--	4
Abstraction and Reasoning	ARC 2 (test)	--	4

Showing 5 of 5 rows

Other info

GitHub

Follow for update

@wizwand_team Discord