Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GNLI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language InferenceGNLI Human (test)
Accuracy84.69
21
Intrinsic ReasoningGNLI
Spearman Correlation0.843
9
Showing 2 of 2 rows