Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Legal Reasoning on LegalBench (Exact Match)
Loading...
69
Exact Match
ReConcile
30.936
40.818
50.7
60.582
May 24, 2026
Exact Match
Updated 8d ago
Evaluation Results
Method
Method
Links
Exact Match
ReConcile
Models=Qwen2.5-7B-Inst...
2026.05
69
DarkForest
Models=Qwen2.5-7B-Inst...
2026.05
68
Self-Consistency
Models=Qwen2.5-7B-Inst...
2026.05
65
Refine
Models=Qwen2.5-7B-Inst...
2026.05
60.8
Debate
Models=Qwen2.5-7B-Inst...
2026.05
57.6
Mixture-of-Agent
Models=Qwen2.5-7B-Inst...
2026.05
53.8
Graph-of-Agent (Mean)
Models=Qwen2.5-7B-Inst...
2026.05
46.4
Graph-of-Agent (Max)
Models=Qwen2.5-7B-Inst...
2026.05
32.4
Feedback
Search any
task
Search any
task