Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

E-SNLI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language InferenceE-SNLI
Accuracy91.31
46
Multiple Choice Classificatione-SNLI
Accuracy89.6
16
Natural Language Inferencee-SNLI (test)
Accuracy94
9
Logical Refinement of Natural Language Explanationse-SNLI
Initial Performance41
8
Natural Language Explanation Generatione-SNLI
Human Evaluation Score50
7
Explanation Generatione-SNLI (out-domain)
Grammar Score2.98
7
Natural Language Inferencee-SNLI abundant
Accuracy88.8
6
Natural Language Inferencee-SNLI (medium)
Accuracy87.5
6
Natural Language Inferencee-SNLI scarce
Accuracy86.3
6
Natural Language Explanation Generatione-SNLI (test)
Accuracy86.66
6
Chain-of-Thought Generatione-SNLI (test)
GPT-4 Score3.49
6
Natural Language Inferencee-SNLI
ECE4.35
4
Natural Language Explanation Generatione-SNLI 60-shot
Accuracy40.1
3
Showing 13 of 13 rows