WisdomInterrogatory (LuWen): An Open-Source Legal Large Language Model Technical Report

About

Large language models have demonstrated remarkable capabilities across a wide range of natural language processing tasks, yet their application in the legal domain remains challenging due to the specialized terminology, complex reasoning requirements, and rapidly evolving legal knowledge involved. In this paper, we present WisdomInterrogatory (LuWen), an open-source Chinese legal language model built upon the Baichuan foundation model through three key techniques: continual pre-training on a large-scale legal corpus, supervised fine-tuning with carefully curated legal instruction data, and retrieval-augmented generation integrated with a comprehensive legal knowledge base. We evaluate LuWen on five representative legal tasks spanning both prediction and generation settings, including legal judgment prediction, judicial examination, legal text summarization, law article question answering, and judicial decision reasoning. Experimental results show that LuWen outperforms several strong baselines, demonstrating the effectiveness of our approach in adapting general-purpose language models to the legal domain.

Yiquan Wu, Yuhang Liu, Yifei Liu, Ang Li, Siying Zhou, Kun Kuang, Fei Wu• 2026

Related benchmarks

Task	Dataset	Result
Legal Reasoning	LexEval	Memoization3.2	35
Legal Knowledge and Reasoning Benchmark	LawBench	Memorization Score29.23	30
Political and Legal Affairs Assessment	PoliLegal	Average Score55.85	14
Legal Judgment Prediction	Legal Judgment Prediction (test)	Statute Accuracy35	6
Judicial Examination	Judicial Examination (test)	Judicial Examination Score18	6
Judicial Decision Reasoning	Luwen Legal Generation Dataset	Human Evaluation Score53.7	3
Law Article Question Answering	Luwen Legal Generation Dataset	Human Evaluation Score84	3
Legal Text Summarization	Luwen Legal Generation	Human Evaluation Score71.6	3

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord