Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Content Moderation on Content Moderation Korean (test)
Loading...
64.7
Abusive Rate
CulturePark
35.268
42.909
50.55
58.191
May 24, 2024
Abusive Rate
Hate Rate
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Abusive Rate
Hate Rate
Average Score
CulturePark
Backbone=GPT-3.5-turbo
2024.05
64.7
64
64.3
CultureBank
Training=fine-tuning G...
2024.05
63.5
52.2
57.9
Synatra-7B-v0.3-dpo
2024.05
39
46.5
42.8
EEVE-Korean-10.8B-v1.0
2024.05
36.4
43.7
40
Feedback
Search any
task
Search any
task