Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Toxicity and Diversity Evaluation on SafeNLP

0.028Toxicity

Mistral-7b-Instruct + CP

0.015880.097690.17950.26131Jan 16, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.01
0.0280.090.410.68
2024.01
0.0430.30.60.72
2024.01
0.1320.10.190.21
2024.01
0.2570.440.70.71
2024.01
0.2690.180.540.76
2024.01
0.3310.320.590.65