Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BLiMP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Linguistic Minimal Pair ScoringBLiMP
Overall Accuracy88.9
49
Linguistic Minimal Pair EvaluationBLiMP (test)
NPI lic. (2)100
28
Linguistic ProbingBLiMP
Performance64.8
10
Linguistic AcceptabilityBLiMP English (all)
Accuracy81.6
9
SyntaxBLiMP
Accuracy84.61
8
Audio Language ModelingsBLIMP
Accuracy64.7
8
Zero-shot Language ModelingBLiMP (test)
Accuracy79.6
8
Syntactic GeneralizationBLiMP (test)
BLiMP Accuracy0.773
8
Semantic Anomaly DetectionBLIMP Animacy
Accuracy78.7
6
Morphosyntax Anomaly DetectionBLIMP Det-Noun
Accuracy99.9
6
Morphosyntax Anomaly DetectionBLIMP Subject-Verb
Accuracy97.1
6
Linguistic AnalysisBLiMP
Accuracy60.5
4
Linguistic Competence ClassificationBLiMP
Accuracy83
3
Showing 13 of 13 rows