IQuest-Coder-V1 Technical Report
About
In this report, we introduce the IQuest-Coder-V1 series-(7B/14B/40B/40B-Loop), a new family of code large language models (LLMs). Moving beyond static code representations, we propose the code-flow multi-stage training paradigm, which captures the dynamic evolution of software logic through different phases of the pipeline. Our models are developed through the evolutionary pipeline, starting with the initial pre-training consisting of code facts, repository, and completion data. Following that, we implement a specialized mid-training stage that integrates reasoning and agentic trajectories in 32k-context and repository-scale in 128k-context to forge deep logical foundations. The models are then finalized with post-training of specialized coding capabilities, which is bifurcated into two specialized paths: the thinking path (utilizing reasoning-driven RL) and the instruct path (optimized for general assistance). IQuest-Coder-V1 achieves state-of-the-art performance among competitive models across critical dimensions of code intelligence: agentic software engineering, competitive programming, and complex tool use. To address deployment constraints, the IQuest-Coder-V1-Loop variant introduces a recurrent mechanism designed to optimize the trade-off between model capacity and deployment footprint, offering an architecturally enhanced path for efficacy-efficiency trade-off. We believe the release of the IQuest-Coder-V1 series, including the complete white-box chain of checkpoints from pre-training bases to the final thinking and instruction models, will advance research in autonomous code intelligence and real-world agentic systems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Code Generation | HumanEval+ | Pass@197.6 | 383 | |
| Code Generation | MBPP+ | Pass@177.8 | 216 | |
| Text-to-SQL | Spider | Exec Acc (All)92.2 | 91 | |
| Text-to-SQL | Bird | Total Execution Accuracy70.5 | 64 | |
| Code Reasoning | CRUXEval | Input-CoT Accuracy93.5 | 56 | |
| Code Generation | FullStackBench | Pass@171.4 | 45 | |
| Safety Evaluation | HarmBench | -- | 42 | |
| Code Generation | LiveCodeBench v6 | Score81.1 | 41 | |
| Code Generation | BigCodeBench Full | Pass@154.2 | 38 | |
| Code Generation | BigCodeBench Hard | Pass@133.1 | 38 |