Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Legal Question Answering on Legal Task QA (test)
Loading...
17.74
ROUGE-L
LEGALMIDM-11B
12.0096
13.4973
14.985
16.4727
Apr 28, 2026
ROUGE-L
GPT-4o Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
ROUGE-L
GPT-4o Score
LEGALMIDM-11B
Setting=Zero-shot
2026.04
17.74
6.1
Qwen2.5-32B
Setting=Zero-shot
2026.04
15.7
6.18
EXAONE-3.5-32B
Setting=Zero-shot
2026.04
14.98
6.49
Gemma-2-27b
Setting=Zero-shot
2026.04
13.51
5.78
Llama3.3-70B
Setting=Zero-shot
2026.04
12.23
5.5
Feedback
Search any
task
Search any
task