Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Phoenix-VL 1.5 Medium Technical Report

About

We introduce Phoenix-VL 1.5 Medium, a 123B-parameter natively multimodal and multilingual foundation model, adapted to regional languages and the Singapore context. Developed as a sovereign AI asset, it demonstrates that deep domain adaptation can be achieved with minimal degradation to broad-spectrum intelligence and alignment. Continued pretraining was performed on Mistral Medium 3.1 using a localized 1-trillion tokens multimodal corpus, followed by a 250-billion tokens long-context extension phase. Subsequent post-training incorporated a novel human-annotated Singapore multimodal dataset and curated textual corpus on Singapore culture, knowledge, and legislation, totaling 22-billion tokens. An additional 5 billion tokens of model alignment was performed through Online Direct Preference Optimization. Phoenix-VL 1.5 Medium achieves state-of-the-art performance for its size on Singapore multimodal, legal, and government policy benchmarks while remaining globally competitive on general multimodal intelligence, multilingual, and STEM benchmarks. We also introduce a novel evaluation suite encompassing localized knowledge benchmarks and an institutionally aligned model behavior and safety framework. We report the data curation principles, training methodology, and highlight benchmark and inference performance.

Team Phoenix: Arka Ray, Askar Ali Mohamed Jawad, Biondi Lee, Elijah Seah, Eva Lim, Fiona Teo, Grace Toh, Guang Xiang Teo, Jun En Tan, Jia Hui Bong, Jiale Wang, Jonathan Ng, Justin Tan, Kai Zhe Yew, Matthew Ong, Shun Yi Yeo, Wen Jett Lam, Wen Xiu Tan, Ze Yu Zhang, Gee Wah Ng, Chee Wee Ang, Mistral AI: Adrien Sad\'e, Guillaume Kunsch, Jia Sin Loh, Nicolas Schuhl, Rupert Menneer, Umar Jamil, Vincent Maladi\`ere, Yimu Pan• 2026

Related benchmarks

TaskDatasetResultRank
Instruction FollowingIFEval--
836
Massive Multi-discipline Multimodal UnderstandingMMMU
Accuracy60.78
216
Real-world Visual Question AnsweringRealworldQA
Accuracy66.27
173
Multitask Language UnderstandingMMLU-Pro
pass@176.81
38
Multilingual Reading ComprehensionBelebele--
18
Code GenerationLiveCode
Pass@132.4
15
Singapore knowledge evaluationSG-Gov
Pass@192.65
6
Singapore knowledge evaluationSG-Legal
Pass@186.4
6
Singapore knowledge evaluationHT-Lexicon
Pass@190.81
6
Graduate-level Science Question AnsweringGPQA Diamond
Score59.09
6
Showing 10 of 17 rows

Other info

Follow for update