Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Legal Summarization on Legal Task Summary (test)
Loading...
47.94
ROUGE-L
LEGALMIDM-11B
24.5712
30.6381
36.705
42.7719
Apr 28, 2026
ROUGE-L
GPT-4o Judge Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
ROUGE-L
GPT-4o Judge Score
LEGALMIDM-11B
Setting=Zero-shot
2026.04
47.94
8.62
Gemma-2-27b
Setting=Zero-shot
2026.04
32.37
8.55
Qwen2.5-32B
Setting=Zero-shot
2026.04
30.76
8.64
Llama3.3-70B
Setting=Zero-shot
2026.04
30.3
8.39
EXAONE-3.5-32B
Setting=Zero-shot
2026.04
25.47
8.06
Feedback
Search any
task
Search any
task