Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Legal Reasoning on Law (LLM-as-judge score)
Loading...
34.4
LLM-as-judge Score
ProbMoE
2.16
10.53
18.9
27.27
Jun 1, 2026
LLM-as-judge Score
Updated 19h ago
Evaluation Results
Method
Method
Links
LLM-as-judge Score
ProbMoE
Backbone=Qwen1.5-MoE-A...
2026.06
34.4
Frozen Router
Backbone=Qwen1.5-MoE-A...
2026.06
33.01
DefaultMoE
Backbone=Qwen1.5-MoE-A...
2026.06
31.2
DenseMixer
Backbone=Qwen1.5-MoE-A...
2026.06
30.75
Conventional
Backbone=Qwen1.5-MoE-A...
2026.06
29.5
ProbMoE
Backbone=OLMoE-1B-7B,...
2026.06
29
DenseMixer
Backbone=OLMoE-1B-7B,...
2026.06
27.9
ReMoE
Backbone=Qwen1.5-MoE-A...
2026.06
25.5
Conventional
Backbone=OLMoE-1B-7B,...
2026.06
25
Frozen Router
Backbone=OLMoE-1B-7B,...
2026.06
22.5
Base Model
Backbone=Qwen1.5-MoE-A...
2026.06
18.2
Base Model
Backbone=OLMoE-1B-7B,...
2026.06
5.7
SparseMixer
Backbone=Qwen1.5-MoE-A...
2026.06
3.4
Feedback
Search any
task
Search any
task