Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Task Category A

Benchmarks

Task NameDataset NameSOTA ResultTrend
Functionally Diverse Response GenerationTask Category A
Functionally Diverse Responses Count3.83
35
Showing 1 of 1 rows