Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Judicial Decision Reasoning on Luwen Legal Generation
Loading...
53.7
Human Evaluation Score
Luwen
28.22
34.835
41.45
48.065
Apr 8, 2026
Human Evaluation Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Human Evaluation Score
Luwen
2026.04
53.7
GPT-3.5
2026.04
42.7
Baichuan
SFT=true
2026.04
29.2
Feedback
Search any
task
Search any
task