Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Toxicity Evaluation on DialogSum (DS)
Loading...
0
Toxic Fraction
GPT-4o
-0.008
0.046
0.1
0.154
Jul 15, 2024
Toxic Fraction
Updated 1mo ago
Evaluation Results
Method
Method
Links
Toxic Fraction
GPT-4o
2024.07
0
GPT-4o-m
2024.07
0
Gem-Fl
2024.07
0.1
Gem-Pro
2024.07
0.1
Gem-Fl-Lt
2024.07
0.2
Feedback
Search any
task
Search any
task