Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Natural Language Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
General NLP EvaluationNatural Language Benchmarks Aggregate
Average Score62.31
30
Constrained Text GenerationNatural Language Benchmarks Word Length constraint
Saturation95
3
Constrained Text GenerationNatural Language Benchmarks Between (ubd.) constraint
Saturation93.8
3
Constrained Text GenerationNatural Language Benchmarks Between-n constraint
Saturation85.5
3
Constrained Text GenerationNatural Language Benchmarks Appearance constraint
Saturation92.5
3
Constrained Text GenerationNatural Language Benchmarks Suffix constraint
Saturation96.8
3
Constrained Text GenerationNatural Language Benchmarks Prefix constraint
Saturation95.7
3
Showing 7 of 7 rows