Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

XNLI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language InferenceXNLI (test)
Average Accuracy90
167
Natural Language InferenceXNLI
Accuracy87.1
131
Zero-Shot Cross-Lingual TransferXNLI
Pearson Correlation0.9639
48
Natural Language InferenceXNLI 1.0 (test)
Accuracy (en)89.7
40
Natural Language InferenceXNLI Ur (test)
Accuracy0.9643
26
Natural Language InferenceXNLI Ur (dev)
Accuracy70.6
26
Natural Language InferenceXNLI Hi (dev)
Accuracy76.91
26
Natural Language InferenceXNLI (dev)
Accuracy82.7
24
Text ClassificationXNLI (test)
Accuracy (Max)81.3
20
Natural Language InferenceXNLI French
Accuracy59.1
18
Natural Language InferenceXNLI Sw (test)
Accuracy65.34
18
Natural Language InferenceXNLI Sw (dev)
Accuracy65.68
18
Natural Language InferenceXNLI Hi (test)
Accuracy71.65
18
Zero-shot performance predictionXNLI
MAE1.53
18
Natural Language InferenceXNLI French (test)
Accuracy85.7
16
Natural Language InferenceXNLI 2.0
Accuracy45.21
15
Sentence-pair classificationXNLI 1.1 (test)
Accuracy (EN)67.97
14
Cross-lingual Natural Language InferenceXNLI (test)
Accuracy (All)74.3
10
Natural Language InferenceXNLI Arabic
Accuracy (Normalized)33.73
10
Sentence Pair ClassificationXNLI Chinese portion (test)
Accuracy81.3
9
Sentence Pair ClassificationXNLI Chinese portion (dev)
Accuracy82.4
9
Sequence ClassificationXNLI European Languages
Accuracy86.8
8
Sequence ClassificationXNLI All Languages
Accuracy84.1
8
Natural Language InferenceXNLI Hindi (test)
Accuracy97.31
8
Natural Language InferenceXNLI Arabic (test)
Accuracy100
8
Showing 25 of 39 rows