Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Content Moderation on Content Moderation Korean (test)
Loading...
64.7
Abusive Rate
CulturePark
35.268
42.909
50.55
58.191
May 24, 2024
Abusive Rate
Hate Rate
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Abusive Rate
Hate Rate
Average Score
CulturePark
Backbone=GPT-3.5-turbo
2024.05
64.7
64
64.3
CultureBank
Training=fine-tuning G...
2024.05
63.5
52.2
57.9
Synatra-7B-v0.3-dpo
2024.05
39
46.5
42.8
EEVE-Korean-10.8B-v1.0
2024.05
36.4
43.7
40
Feedback
Search any
task
Search any
task