Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CivilComments

Benchmarks

Task NameDataset NameSOTA ResultTrend
Toxicity ClassificationCivilComments sensitive attribute: MUSLIM (test)
Balanced Accuracy59.9
57
ClassificationCivilComments (test)
Worst-case Accuracy82.2
47
Robust ClassificationCivilComments
Worst-Group Accuracy72.6
23
Toxicity detectionCivilComments-WILDS (test)
Average Accuracy92.7
19
Sentiment ClassificationCivilComments HELM
Balanced Acc65.81
18
Text ClassificationCivilComments-WILDS (test)
Accuracy92.34
13
Toxicity ClassificationCivilComments (CC) (test)
Worst-Group Accuracy79.66
13
Toxicity DetectionCivilComments (test)
WGA71.6
9
Text ClassificationCivilComments (val)
Accuracy69.1
6
Domain GeneralizationCivilComments Wilds (test)
Average Accuracy92.2
6
Domain GeneralizationCivilComments Wilds (val)
Average Accuracy92.3
6
Toxicity ClassificationCivilComments
Average Accuracy92.6
3
Showing 12 of 12 rows