Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BoLD

Benchmarks

Task NameDataset NameSOTA ResultTrend
DetoxificationBOLD
Toxicity (Max)1.9
28
Toxicity EvaluationBOLD 23679 prompts (test)
Avg Toxicity (Max)0.02
18
Bias and Sentiment EvaluationBOLD
BOLD Score50.3
17
Toxicity EvaluationBOLD
Avg Toxicity (Max)0.022
14
Emotion RecognitionBoLD
mAP26.66
8
Language Generation Bias EvaluationBOLD
Toxicity Score (All)0.016
5
Emotion RecognitionBoLD official (test)
mR20.1597
3
Showing 7 of 7 rows