Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SNLI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language InferenceSNLI (test)
Accuracy94.7
694
Natural Language InferenceSNLI
Accuracy100
196
Natural Language InferenceSNLI (train)
Accuracy99.7
154
Natural Language InferenceSNLI (dev)
Accuracy93.6
71
Counterfactual GenerationSNLI Hypothesis
LFR83
37
Counterfactual GenerationSNLI Premise
LFR0.759
37
Natural Language InferenceSNLI hard 1.0 (test)
Accuracy84.48
27
Explanation FaithfulnessSNLI
Delta AF0.989
24
Masked Language ModelingSNLI (randomly sampled)
PPL (U)8.57
20
Natural Language InferenceSNLI 1.0 (test)
Accuracy90.67
19
Explanation EvaluationSNLI (test)
Sufficiency43.76
16
Natural Language InferenceSNLI-Neg
Accuracy75.9
14
Membership Inference AttackSNLI
ROC AUC99.8
12
Natural Language InferenceSNLI source: MNLI (test)
Accuracy80.2
12
Natural Language InferenceSNLI
Correlation Coefficient83.05
10
Natural Language InferenceSNLI Combined variant (test)
Accuracy88.93
10
Natural Language InferenceSNLI Noise variant (test)
Accuracy89.77
10
Natural Language InferenceSNLI Emoji variant (test)
Accuracy88.96
10
Natural Language InferenceSNLI Slang variant (test)
Accuracy92.8
10
Natural Language InferenceSNLI Original (test)
Accuracy93.12
10
Ranking correlation with full dataset evaluationSNLI
Kendall Correlation0.93
10
Human AlignmentSNLI
R@118.5
9
Natural Language InferenceSNLI
Macro-F172.59
9
Semantic DifferentiationSNLI
Wasserstein Distance3.72
9
Comparative Reasoningdelta-SNLI
Accuracy88.9
9
Showing 25 of 59 rows