Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

POPQUORN

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hate Speech DetectionPOPQUORN
Accuracy76.8
19
Politeness RatingPOPQUORN Avg Age
Coverage100
10
Offensiveness ratingPOPQUORN Age 50+ 1.0 (test)
Coverage95
5
Offensiveness ratingPOPQUORN Age 35–49 1.0 (test)
Coverage95
5
Offensiveness ratingPOPQUORN Age 18–34 1.0 (test)
Coverage (Cov.)100
5
Offensiveness ratingPOPQUORN Avg Age 1.0 (test)
Coverage95
5
Politeness RatingPOPQUORN (Age 50+)
Coverage95
5
Politeness RatingPOPQUORN (Age 18–34)
Coverage100
5
Showing 8 of 8 rows