Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Symbolic Reasoning on Coinflip OOD: 3
Loading...
98.6
Accuracy
PaLM 540B
9.68
32.765
55.85
78.935
Dec 16, 2022
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
PaLM 540B
Method=CoT 8-shot
2022.12
98.6
T5 XXL
Method=CoT Finetuned
2022.12
86.7
T5 XXL
Method=Baseline
2022.12
13.1
Feedback
Search any
task
Search any
task