Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Toxicity Classification on JIGSAW Toxicity Dataset Original EN
Loading...
96.7
Class 0 Precision
DeepSeek-Chat
87.028
89.539
92.05
94.561
Aug 15, 2025
Class 0 Precision
Class 0 Recall
Class 0 F1
Class 1 Precision
Class 1 Recall
Class 1 F1
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Class 0 Precision
Class 0 Recall
Class 0 F1
Class 1 Precision
Class 1 Recall
Class 1 F1
Accuracy
DeepSeek-Chat
2025.08
96.7
89.4
92.9
90.1
97
93.4
93.2
GPT-4o Mini
2025.08
92.8
93.9
91.6
93.8
91.4
92.6
92.7
GPT-4o
2025.08
91.8
90.4
91.1
90.5
91.9
91.2
91.2
Our Model
2025.08
87.4
94.9
91
94.5
86.4
90.2
90.7
Feedback
Search any
task
Search any
task