Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Symbolic Reasoning on LastLetter (test)
Loading...
86.96
Accuracy
SGE
59.5456
66.6628
73.78
80.8972
Apr 23, 2024
Apr 28, 2024
May 4, 2024
May 10, 2024
May 16, 2024
May 22, 2024
May 28, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
SGE
Model=GPT-4, Tool=Code...
2024.05
86.96
Decomp Prompting
Model=GPT-4, Tool=Code...
2024.05
85.64
CoT Prompting
Model=GPT-4, Tool=Code...
2024.05
85.18
Refine Prompting
Model=GPT-4, Tool=Code...
2024.05
84.82
Least-to-Most
LLM=GPT-3.5-turbo, Pro...
2024.04
83.2
IO Prompting
Model=GPT-4, Tool=Code...
2024.05
81.98
DUP
LLM=GPT-3.5-turbo, Pro...
2024.04
81.2
Few-shot Auto-CoT
LLM=GPT-3.5-turbo, Pro...
2024.04
81.2
Few-shot Manual-CoT
LLM=GPT-3.5-turbo, Pro...
2024.04
74.4
Zero-shot CoT
LLM=GPT-3.5-turbo, Pro...
2024.04
60.8
Zero-shot PS+
LLM=GPT-3.5-turbo, Pro...
2024.04
60.6
Feedback
Search any
task
Search any
task