Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization

About

Human cognitive behavior arises from the interaction of specialized brain networks dedicated to distinct functions, such as language, logic, and social reasoning. Inspired by this organization, we propose Mixture of Cognitive Reasoners (MiCRo): a modular, transformer-based architecture post-trained with a curriculum that induces functional specialization across experts. Concretely, we partition the layers of a pretrained language model into four expert modules aligned with well-studied cognitive networks in the human brain. MiCRo offers three key advantages over standard language models. (1) The specialized experts are interpretable and causally meaningful -- ablating a module causes substantial drops on benchmarks requiring its specialized domain. (2) MiCRo's behavior can be dynamically steered at inference time by routing tokens to particular experts (e.g., favoring social over logical reasoning), enabling fine-grained control over outputs. (3) MiCRo outperforms or matches comparable baselines on both machine-learning reasoning benchmarks (e.g., GSM8K, BBH) and alignment to human behavior (CogBench), while maintaining interpretability. Taken together, cognitively grounded functional specialization yields models that are both more human-like and more human-interpretable.

Badr AlKhamissi, C. Nicol\`o De Sabbata, Greta Tuckute, Zeming Chen, Martin Schrimpf, Antoine Bosselut• 2025

Related benchmarks

TaskDatasetResultRank
Physical Commonsense ReasoningPIQA
Accuracy76.6
572
Commonsense ReasoningHellaSwag
HellaSwag Accuracy67.4
350
Mathematical ReasoningMinerva Math
Accuracy13
186
Science Question AnsweringARC Easy
Accuracy73
155
Multi-task Language UnderstandingMMLU
MMLU Accuracy45.4
59
ReasoningBig-Bench Hard (BBH)
Accuracy42
33
Multi-task Language UnderstandingMMLU-Pro
Accuracy19
28
Science Question AnsweringARC Challenge
Accuracy43.3
17
Showing 8 of 8 rows

Other info

Follow for update