A.X K1 Technical Report
About
We introduce A.X K1, a 519B-parameter Mixture-of-Experts (MoE) language model trained from scratch. Our design leverages scaling laws to optimize training configurations and vocabulary size under fixed computational budgets. A.X K1 is pre-trained on a corpus of approximately 10T tokens, curated by a multi-stage data processing pipeline. Designed to bridge the gap between reasoning capability and inference efficiency, A.X K1 supports explicitly controllable reasoning to facilitate scalable deployment across diverse real-world scenarios. We propose a simple yet effective Think-Fusion training recipe, enabling user-controlled switching between thinking and non-thinking modes within a single unified model. Extensive evaluations demonstrate that A.X K1 achieves performance competitive with leading open-source models, while establishing a distinctive advantage in Korean-language benchmarks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Instruction Following | IFEval | -- | 292 | |
| Code Generation | HumanEval+ | -- | 189 | |
| Instruction Following | IFBench | Pass@1 (Strict)44.3 | 68 | |
| Code Generation | MBPP+ | Score85.7 | 43 | |
| Knowledge Evaluation | KMMLU, KMMLU Redux, KMMLU Pro, CLIcK, KoBALT, MMLU Pro, GPQA Diamond | Accuracy84.9 | 21 | |
| Code Generation | LiveCodeBench v6, LiveCodeBench-ko, HumanEval+, HumanEval+ ko, MBPP+, SciCode | Pass@10.93 | 18 | |
| Agentic Task | Tau2-Telecom | Accuracy58.1 | 13 | |
| Mathematics | AIME25, AIME25-ko, HRM8K, KMO25 | Accuracy93 | 12 | |
| Instruction Following | IFBench, IFEval, IFEval-ko | Accuracy81 | 9 | |
| Knowledge | Click | Score77.2 | 7 |