Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LawThinker: A Deep Research Legal Agent in Dynamic Environments

About

Legal reasoning requires not only correct outcomes but also procedurally compliant reasoning processes. However, existing methods lack mechanisms to verify intermediate reasoning steps, allowing errors such as inapplicable statute citations to propagate undetected through the reasoning chain. To address this, we propose LawThinker, an autonomous legal research agent that adopts an Explore-Verify-Memorize strategy for dynamic judicial environments. The core idea is to enforce verification as an atomic operation after every knowledge exploration step. A DeepVerifier module examines each retrieval result along three dimensions of knowledge accuracy, fact-law relevance, and procedural compliance, with a memory module for cross-round knowledge reuse in long-horizon tasks. Experiments on the dynamic benchmark J1-EVAL show that LawThinker achieves a 24% improvement over direct reasoning and an 11% gain over workflow-based methods, with particularly strong improvements on process-oriented metrics. Evaluations on three static benchmarks further confirm its generalization capability. The code is available at https://github.com/yxy-919/LawThinker-agent .

Xinyu Yang, Chenlong Deng, Tongyu Wen, Binyu Xie, Zhicheng Dou• 2026

Related benchmarks

TaskDatasetResultRank
Civil CourtJ1 EVAL
PFS Score50.3
14
Complaint DraftingJ1 EVAL
FOR Score87.7
14
Criminal CourtJ1 EVAL
PFS (Procedural Fairness Score)41.5
14
Defence DraftingJ1 EVAL
FOR81.4
14
Knowledge QuestioningJ1 EVAL
Average Score64.1
14
Legal ConsultationJ1 EVAL
Average Score52.4
14
Legal EvaluationLawBench
Accuracy62
6
Legal EvaluationLexEval
Accuracy71.1
6
Legal EvaluationUnilaw Eval R1
Accuracy53.7
6
Showing 9 of 9 rows

Other info

GitHub

Follow for update