Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Symbolic Reasoning on Coin Flip
Loading...
99.9
Accuracy
Auto-CoT
9.316
32.833
56.35
79.867
Oct 7, 2022
Nov 17, 2022
Dec 28, 2022
Feb 7, 2023
Mar 20, 2023
Apr 30, 2023
Jun 10, 2023
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Auto-CoT
Prompting=Automatic Ch...
2022.10
99.9
Auto-CoT
Base Model=text-davinc...
2023.06
99.9
CoK + SC
Base Model=gpt-3.5-turbo
2023.06
99.2
Manual CoT + SC
Base Model=gpt-3.5-turbo
2023.06
99
CoK
Base Model=gpt-3.5-turbo
2023.06
98
CoK
Base Model=text-davinc...
2023.06
97.4
Manual CoT
Base Model=gpt-3.5-turbo
2023.06
97.4
Manual-CoT
Prompting=Manual Chain...
2022.10
97.2
Zero-Shot-CoT
Prompting=Zero-shot Ch...
2022.10
91.4
Zero-Shot CoT
Base Model=text-davinc...
2023.06
91.4
Manual CoT
Base Model=text-davinc...
2023.06
74.5
Few-Shot
Prompting=Few-shot
2022.10
57.2
Zero-Shot
Prompting=Zero-shot
2022.10
53.8
Few-Shot SP
Base Model=text-davinc...
2023.06
49.1
Zero-Shot SP
Base Model=text-davinc...
2023.06
12.8
Feedback
Search any
task
Search any
task