Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Logical Reasoning on CLUTRR (test)
Loading...
80.1
Accuracy
SATLM
39.228
49.839
60.45
71.061
May 16, 2023
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
SATLM
Language Model=code-da...
2023.05
80.1
PROGLM
Language Model=code-da...
2023.05
71.9
SATLM
Language Model=code-da...
2023.05
68.3
PROGLM
Language Model=code-da...
2023.05
58.9
COT
Language Model=code-da...
2023.05
45.7
STANDARD
Language Model=code-da...
2023.05
41.2
COT
Language Model=code-da...
2023.05
40.8
Feedback
Search any
task
Search any
task