Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Chain-of-Thought Generation on CQA (test)
Loading...
4.11
GPT-4 Score
Gold
3.5796
3.7173
3.855
3.9927
Mar 5, 2024
GPT-4 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
GPT-4 Score
Gold
Self-consistency=true
2024.03
4.11
Gold
Self-consistency=false
2024.03
3.95
MI-based distillation
Self-consistency=false
2024.03
3.7
DSS
Self-consistency=true
2024.03
3.64
MI-based distillation
Self-consistency=true
2024.03
3.63
DSS
Self-consistency=false
2024.03
3.6
Feedback
Search any
task
Search any
task