Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CyberCorrect: A Cybernetic Framework for Closed-Loop Self-Correction in Large Language Models

About

Large language model (LLM) self-correction -- the ability to detect and fix errors in generated outputs -- remains largely ad hoc, relying on generic prompts such as "please reconsider your answer" without systematic error analysis or convergence guarantees. We propose CyberCorrect, a framework that formalizes LLM self-correction as a closed-loop control system grounded in cybernetic theory. The framework models the LLM generator as the plant and introduces a tri-modal Error Detector (combining self-consistency, verbalized confidence, and logic-chain verification) as the sensor. A type-directed Correction Controller generates targeted repair instructions based on diagnosed error categories, while a Convergence Judge determines iteration termination using stability criteria adapted from control theory. We further introduce three control-theoretic evaluation metrics -- convergence rate, overshoot rate, and oscillation rate -- that capture correction dynamics beyond final accuracy. Experiments on our constructed CyberCorrect-Bench (440 reasoning tasks with annotated error types and correction paths) show that CyberCorrect achieves 79.8% final accuracy, improving upon the best existing self-correction method by 6.2 percentage points, while reducing overshoot (erroneous over-correction) by 41% through its convergence control mechanism.

Yuning Wu, Yingmin Liu, Yang Shu• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH Levels 3, 4, 5 (test)
Overall Accuracy59.6
15
Reasoning CorrectionCyberCorrect-Bench
Accuracy79.8
7
Commonsense multi-hop reasoningStrategyQA 500 questions (test)
Accuracy81.4
5
Showing 3 of 3 rows

Other info

Follow for update