Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Deduction

Benchmarks

Task NameDataset NameSOTA ResultTrend
Logical ReasoningLogical Deduction
Pass@181.03
20
Logical DeductionLogical Deduction 5 objects (test)
Accuracy61.1
16
Showing 2 of 2 rows