Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NLI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language InferenceNLI adversarial benchmark (test)
Average Score75.4
18
Natural Language InferenceNLI
Accuracy91.2
14
Natural Language InferenceNLI ANLI and HANS (unseen)
ANLI Score32.4
9
Natural Language InferenceNLI domain average
Best Accuracy87.5
8
Natural Language InferenceNLI (test)
Relative CPU Speed2.89
2
Showing 5 of 5 rows