Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Toxicity Generation on RealToxicityPrompts 100k prompts
Loading...
10.4
Toxicity Score (Basic)
LLaMA
10.304
10.952
11.6
12.248
Feb 27, 2023
Toxicity Score (Basic)
Respectful Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Toxicity Score (Basic)
Respectful Score
LLaMA
Model Size=13B, Decode...
2023.02
10.4
9.5
LLaMA
Model Size=7B, Decoder...
2023.02
10.6
8.1
LLaMA
Model Size=33B, Decode...
2023.02
10.7
8.7
LLaMA
Model Size=65B, Decode...
2023.02
12.8
14.1
Feedback
Search any
task
Search any
task