Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Portuguese Evaluation Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingPortuguese evaluation suite (test)
NPM5.92
27
Language ModelingPortuguese Evaluation Suite Hard Set
NPM0.99
15
Language ModelingPortuguese Evaluation Suite Easy Set
NPM18.7
15
Language ModelingPortuguese Evaluation Suite Total
NPM19.89
15
Showing 4 of 4 rows