Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LSAT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Logical ReasoningLSAT HELM
Balanced Accuracy24.38
17
Logical Reasoning Question AnsweringLSAT
Pass@10.29
11
Item Response Theory AssessmentLSAT
AUC70.7
9
Logical Reasoning and Reading ComprehensionLSAT PT 150–159
LR Accuracy99.1
8
Logical Reasoning and Reading ComprehensionLSAT Official (test N=77)
LR Accuracy100
8
Human Proficiency ExamLSAT
Accuracy81.1
7
Question AnsweringLSAT (OOD)
Accuracy26.58
5
Logical ReasoningLSAT
Accuracy37.4
5
Query RoutingLSAT In-Distribution (test)
CPT (90%)80.14
4
Query RoutingLSAT OOD
CPT 85%70.54
4
Query RoutingLSAT
CPT (95%)90.01
4
Query RoutingLSAT
CPT (90%)80.07
4
Model RoutingLSAT (ID)
CPT (80%)60.9
4
Model RoutingLSAT ID queries
CPT (85%)70.35
4
Model RoutingLSAT
CPT (95%)90
4
Query RoutingLSAT
Hypervolume91.75
4
Query RoutingLSAT OOD
CPT Score (80%)61
4
Legal ReasoningLSAT (test)
Hypervolume0.9188
4
Law School Admission TestingLSAT
Score163
3
Preference LearningLSAT (test)
AUC0.707
2
Showing 20 of 20 rows