Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

QNLI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language InferenceQNLI
Accuracy96.56
78
Natural Language InferenceQNLI (test)
Accuracy93.3
27
Natural Language InferenceQNLI 64 instances (test)
Accuracy91.1
20
Sentence-pair classificationQNLI
Accuracy92.45
20
Natural Language InferenceQNLI few-shot zero-shot
Accuracy71.6
16
Text ClassificationQNLI
Accuracy (%)92.79
15
Natural Language UnderstandingQNLI
Exact Match82.5
14
Text ClassificationQNLI (test)
Accuracy (Clean)92.8
14
Embedding InversionQNLI (test)
ROUGE-L0.2226
12
Natural Language InferenceQNLI (val)
Accuracy88.05
11
Ranking correlation with full dataset evaluationQNLI
Kendall Correlation0.91
10
Natural Language InferenceQNLI Dir(0.1) (test)
Accuracy88.83
8
Natural Language InferenceQNLI (test)
SEAT e-size (Names: Career/Family)0.01
8
Bias MitigationQNLI
Accuracy85.39
8
Backdoor DefenseQNLI
Accuracy85.46
8
Natural Language InferenceQNLI standard (test dev)
SAcc92.1
6
Natural Language InferenceQNLI (dev)
Accuracy0.945
6
Question-Answer EntailmentQNLI (val)
AUC77.09
6
Natural Language InferenceQNLI
Total Running Time (s)3,011
5
Generalization gap predictionQNLI Case 9
Gap Prediction Error0.18
5
Generalization gap predictionQNLI
Generalization Gap Error0.14
5
Question Answering NLIQNLI GLUE (test)
Accuracy0.903
5
Binary ClassificationQNLI
AUC77.11
5
Semantic CachingQNLI
End-to-End Latency (min)1,504.43
4
Natural Language InferenceQNLI Non-Biased
Accuracy90.9
4
Showing 25 of 31 rows