Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Symbolic Reasoning on Coin Flip (test)
Loading...
98.6
Accuracy
Few-shot Auto-CoT
82.168
86.434
90.7
94.966
Apr 23, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Few-shot Auto-CoT
LLM=GPT-3.5-turbo, Pro...
2024.04
98.6
Few-shot Manual-CoT
LLM=GPT-3.5-turbo, Pro...
2024.04
98.2
DUP
LLM=GPT-3.5-turbo, Pro...
2024.04
97.6
Zero-shot PS+
LLM=GPT-3.5-turbo, Pro...
2024.04
95.4
Zero-shot CoT
LLM=GPT-3.5-turbo, Pro...
2024.04
94.4
Least-to-Most
LLM=GPT-3.5-turbo, Pro...
2024.04
82.8
Feedback
Search any
task
Search any
task