Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Legal Reasoning on CaseHold
Loading...
96
Cumulative Score (CS)
AutoAdapt
-3.80776
22.10387
48.0155
73.92713
Mar 9, 2026
Cumulative Score (CS)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Cumulative Score (CS)
AutoAdapt
2026.03
96
AutoMLAgent
2026.03
49
DS-Agent
2026.03
14
MLCopilot
2026.03
12
AutoAdapt
setting=Template Free...
2026.03
0.9253
AutoMLAgent
setting=Template Free...
2026.03
0.859
DS-Agent
setting=Template Free...
2026.03
0.0316
MLCopilot
setting=Template Free...
2026.03
0.031
Feedback
Search any
task
Search any
task