Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

P-Soups

Benchmarks

Task NameDataset NameSOTA ResultTrend
Response SelectionP-Soups Expertise
Accuracy83.66
16
Response SelectionP-Soups Style
Accuracy0.88
16
Response SelectionP-Soups Informativeness
Accuracy78.07
16
Showing 3 of 3 rows