Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BBH-NLP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language Processing ReasoningBBH-NLP (test)
Accuracy (ACC)66.2
16
Showing 1 of 1 rows