Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CLUTRR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Inductive ReasoningClutrr
Pass@195.5
18
Binary ClassificationCLUTRR
Accuracy78
18
Logical ReasoningCLUTRR
Accuracy95.9
14
Logical ReasoningCLUTRR (test)
Accuracy80.1
7
Showing 4 of 4 rows