Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Symbolic Reasoning on COLOR (Colored Object)
Loading...
99.4
Accuracy
SATLM
74.752
81.151
87.55
93.949
May 16, 2023
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
SATLM
Language Model=code-da...
2023.05
99.4
PROGLM
Language Model=code-da...
2023.05
98
SATLM
Language Model=code-da...
2023.05
97.7
PROGLM
Language Model=code-da...
2023.05
95.1
COT
Language Model=code-da...
2023.05
90.6
COT
Language Model=code-da...
2023.05
86.3
STANDARD
Language Model=code-da...
2023.05
75.7
Feedback
Search any
task
Search any
task