Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CaseHOLD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Legal ReasoningCaseHOLD (test)
Test Accuracy89.22
22
Case holding classificationCaseHOLD (test)
Mean macro F178.5
12
Legal ReasoningCaseHold
Cumulative Score (CS)96
8
Showing 3 of 3 rows