Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Legal Machine Reading Comprehension on Legal Task - MRC (test)
Loading...
57.5
Rouge-L
LEGALMIDM-11B
21.3808
30.7579
40.135
49.5121
Apr 28, 2026
Rouge-L
GPT-4o Judge Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Rouge-L
GPT-4o Judge Score
LEGALMIDM-11B
Setting=Zero-shot
2026.04
57.5
8.94
Qwen2.5-32B
Setting=Zero-shot
2026.04
33.86
9.07
EXAONE-3.5-32B
Setting=Zero-shot
2026.04
30.6
8.48
Gemma-2-27b
Setting=Zero-shot
2026.04
30.09
8.71
Llama3.3-70B
Setting=Zero-shot
2026.04
22.77
8.58
Feedback
Search any
task
Search any
task