Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ECQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language Explanation GenerationECQA
Human Evaluation Score73.33
7
Commonsense Question AnsweringECQA (test)
Accuracy79.7
7
Explanation GenerationECQA (out-domain)
Grammar Score2.99
7
Natural Language Explanation GenerationECQA (test)
Accuracy59.4
6
Explanation GenerationECQA complete (test)
BERTScore87.67
6
Open-Label QAECQA
COS-E0.398
4
Commonsense ReasoningECQA
Pass@10.7612
3
Natural Language Explanation GenerationECQA few-shot 60-shot
Accuracy24.53
3
Commonsense Question AnsweringECQA
Performance Score (Finetune Baseline vs Predict Baseline)57.2
2
Showing 9 of 9 rows