Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Logical Reasoning on Penguins
Loading...
72.734
Accuracy
Output Refinement
58.74808
62.37904
66.01
69.64096
Oct 3, 2023
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Output Refinement
Prompting Technique=Ou...
2023.10
72.734
PROMPTED
Prompting Technique=PR...
2023.10
69.434
Zero-Shot CoT
Prompting Technique=Ze...
2023.10
62.143
Zero-Shot
Prompting Technique=Ze...
2023.10
59.286
Feedback
Search any
task
Search any
task