Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Trolling-oriented generations

Benchmarks

Task NameDataset NameSOTA ResultTrend
Harmful content detectionTrolling-oriented generations DeepSeek-Llama 70B
Accuracy16.24
4
Harmful content detectionTrolling-oriented generations Llama-3.1 70B
Accuracy26.04
4
Harmful content detectionTrolling-oriented generations GPT-4o
Accuracy19.88
4
Showing 3 of 3 rows