Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SNLI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language InferenceSNLI (test)
Accuracy94.7
681
Natural Language InferenceSNLI
Accuracy100
174
Natural Language InferenceSNLI (train)
Accuracy99.7
154
Natural Language InferenceSNLI (dev)
Accuracy93.6
71
Counterfactual GenerationSNLI Hypothesis
LFR83
37
Counterfactual GenerationSNLI Premise
LFR0.759
37
Natural Language InferenceSNLI hard 1.0 (test)
Accuracy84.48
27
Explanation FaithfulnessSNLI
Delta AF0.989
24
Masked Language ModelingSNLI (randomly sampled)
PPL (U)8.57
20
Natural Language InferenceSNLI 1.0 (test)
Accuracy90.67
19
Explanation EvaluationSNLI (test)
Sufficiency43.76
16
Membership Inference AttackSNLI
ROC AUC99.8
12
Natural Language InferenceSNLI source: MNLI (test)
Accuracy80.2
12
Ranking correlation with full dataset evaluationSNLI
Kendall Correlation0.93
10
Identifying plausible explanationsδ-SNLI
Accuracy81.6
9
Natural Language InferenceSNLI 1.0 (train)
Accuracy93.1
9
Natural Language InferenceSNLI Counterfactual
Accuracy59.9
8
Natural Language InferenceSNLI In-Domain (test)
Accuracy91.68
8
Natural Language Inferenceadv-SNLI TextFooler-RoBERTa
Accuracy52.6
8
Natural Language Inferenceadv-SNLI TextFooler-BERT
Accuracy62.3
8
Ordinal ClassificationSNLI standard (test)
F1 Score89.1
7
Text ClassificationSNLI
Accuracy88.2
6
Natural Language InferenceSNLI 3-Choice
ΔAcc11.7
6
Counterfactual FaithfulnessSNLI
Faithfulness Score0.243
6
Redaction FaithfulnessSNLI
Faithfulness Score0.355
6
Showing 25 of 44 rows