Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Legal Reasoning on LexEval (Pass@1)
Loading...
57.1
Pass@1
Qwen2.5-7B + PPO w/ VERL.
37.964
42.932
47.9
52.868
Sep 28, 2025
Pass@1
Updated 5d ago
Evaluation Results
Method
Method
Links
Pass@1
Qwen2.5-7B + PPO w/ VERL.
Base Model=Qwen2.5-7B,...
2025.09
57.1
Qwen2.5-7B + PPO
Base Model=Qwen2.5-7B,...
2025.09
55.2
Qwen2.5-7B + GRPO w/ VERL.
Base Model=Qwen2.5-7B,...
2025.09
54.6
Qwen2.5-7B + GRPO
Base Model=Qwen2.5-7B,...
2025.09
53.6
Qwen2.5-7B
Base Model=Qwen2.5-7B
2025.09
50.4
Mathstral-7B-v0.1 + GRPO w/ VERL.
Base Model=Mathstral-7...
2025.09
47.6
Mathstral-7B-v0.1 + PPO
Base Model=Mathstral-7...
2025.09
44.5
Mathstral-7B-v0.1 + GRPO
Base Model=Mathstral-7...
2025.09
42.8
Mathstral-7B-v0.1 + PPO w/ VERL.
Base Model=Mathstral-7...
2025.09
42.2
Mathstral-7B-v0.1
Base Model=Mathstral-7...
2025.09
38.7
Feedback
Search any
task
Search any
task