Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Symbolic Reasoning on Coinflip OOD: 4
Loading...
90.2
Accuracy
PaLM 540B
69.712
75.031
80.35
85.669
Dec 16, 2022
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
PaLM 540B
Method=CoT 8-shot
2022.12
90.2
T5 XXL
Method=Baseline
2022.12
73.8
T5 XXL
Method=CoT Finetuned
2022.12
70.5
Feedback
Search any
task
Search any
task