Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Legal Text Generation on Legal Task Petition (test)
Loading...
14.46
Rouge-L
LEGALMIDM-11B
9.1248
10.5099
11.895
13.2801
Apr 28, 2026
Rouge-L
GPT-4o Judge Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Rouge-L
GPT-4o Judge Score
LEGALMIDM-11B
Setting=Zero-shot
2026.04
14.46
8.27
Qwen2.5-32B
Setting=Zero-shot
2026.04
14.08
7.75
EXAONE-3.5-32B
Setting=Zero-shot
2026.04
11.28
7.88
Gemma-2-27b
Setting=Zero-shot
2026.04
11.17
7.91
Llama3.3-70B
Setting=Zero-shot
2026.04
9.33
7.66
Feedback
Search any
task
Search any
task