Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Legal interpretation drafting on Expert-verified regulatory chunks (blind evaluation set)
Loading...
30
Wins
HUMBR
10.24
15.37
20.5
25.63
Apr 13, 2026
Wins
Losses
Win Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Wins
Losses
Win Rate
HUMBR
Comparison=vs. Human E...
2026.04
30
7
81
HUMBR
Comparison=vs. Univers...
2026.04
17
2
89.5
Universal Self-Consistency
Comparison=vs. Human
2026.04
11
6
64.7
Feedback
Search any
task
Search any
task