Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Chain-of-Thought Generation on ANLI (test)
Loading...
4.01
GPT-4 Score
Gold
3.3132
3.4941
3.675
3.8559
Mar 5, 2024
GPT-4 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
GPT-4 Score
Gold
Self-consistency=true
2024.03
4.01
Gold
Self-consistency=false
2024.03
3.82
DSS
Self-consistency=false
2024.03
3.48
DSS
Self-consistency=true
2024.03
3.44
MI-based distillation
Self-consistency=false
2024.03
3.42
MI-based distillation
Self-consistency=true
2024.03
3.34
Feedback
Search any
task
Search any
task