Phoenix-VL 1.5 Medium Technical Report

About

We introduce Phoenix-VL 1.5 Medium, a 123B-parameter natively multimodal and multilingual foundation model, adapted to regional languages and the Singapore context. Developed as a sovereign AI asset, it demonstrates that deep domain adaptation can be achieved with minimal degradation to broad-spectrum intelligence and alignment. Continued pretraining was performed on Mistral Medium 3.1 using a localized 1-trillion tokens multimodal corpus, followed by a 250-billion tokens long-context extension phase. Subsequent post-training incorporated a novel human-annotated Singapore multimodal dataset and curated textual corpus on Singapore culture, knowledge, and legislation, totaling 22-billion tokens. An additional 5 billion tokens of model alignment was performed through Online Direct Preference Optimization. Phoenix-VL 1.5 Medium achieves state-of-the-art performance for its size on Singapore multimodal, legal, and government policy benchmarks while remaining globally competitive on general multimodal intelligence, multilingual, and STEM benchmarks. We also introduce a novel evaluation suite encompassing localized knowledge benchmarks and an institutionally aligned model behavior and safety framework. We report the data curation principles, training methodology, and highlight benchmark and inference performance.

Team Phoenix: Arka Ray, Askar Ali Mohamed Jawad, Biondi Lee, Elijah Seah, Eva Lim, Fiona Teo, Grace Toh, Guang Xiang Teo, Jun En Tan, Jia Hui Bong, Jiale Wang, Jonathan Ng, Justin Tan, Kai Zhe Yew, Matthew Ong, Shun Yi Yeo, Wen Jett Lam, Wen Xiu Tan, Ze Yu Zhang, Gee Wah Ng, Chee Wee Ang, Mistral AI: Adrien Sad\'e, Guillaume Kunsch, Jia Sin Loh, Nicolas Schuhl, Rupert Menneer, Umar Jamil, Vincent Maladi\`ere, Yimu Pan• 2026

Related benchmarks

Task	Dataset	Result
Instruction Following	IFEval	--	854
Massive Multi-discipline Multimodal Understanding	MMMU	Accuracy60.78	249
Real-world Visual Question Answering	RealworldQA	Accuracy66.27	183
Multitask Language Understanding	MMLU-Pro	pass@176.81	38
Multilingual Reading Comprehension	Belebele	--	18
Code Generation	LiveCode	Pass@132.4	15
Singapore knowledge evaluation	SG-Gov	Pass@192.65	6
Singapore knowledge evaluation	SG-Legal	Pass@186.4	6
Singapore knowledge evaluation	HT-Lexicon	Pass@190.81	6
Graduate-level Science Question Answering	GPQA Diamond	Score59.09	6

Showing 10 of 17 rows

Other info

Follow for update

@wizwand_team Discord