Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Toxicity Classification on JIGSAW Toxicity Dataset Translated FR
Loading...
94.9
Class 0 Precision
GPT-4o Mini
84.292
87.046
89.8
92.554
Aug 15, 2025
Class 0 Precision
Class 0 Recall
Class 0 F1
Class 1 Precision
Class 1 Recall
Class 1 F1
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Class 0 Precision
Class 0 Recall
Class 0 F1
Class 1 Precision
Class 1 Recall
Class 1 F1
Accuracy
GPT-4o Mini
2025.08
94.9
93.3
91.7
94.8
91.4
93.1
93.2
DeepSeek-Chat
2025.08
93.7
89.4
91.5
89.9
93.9
91.9
91.7
GPT-4o
2025.08
90.7
88.4
93.1
88.9
93.4
91.1
90.9
Mistral Large
2025.08
88.9
93.4
91.1
93.1
88.4
90.7
90.9
Our Model
2025.08
84.7
92.4
88.4
91.7
83.3
87.3
87.9
Feedback
Search any
task
Search any
task