Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

English dataset

Benchmarks

Task NameDataset NameSOTA ResultTrend
Word Embedding AccuracyEnglish dataset
Accuracy85.8
18
Misinformation DetectionEnglish Dataset
Macro F176.08
18
Text ClassificationEnglish Dataset
Accuracy0.9148
11
Jailbreak Safety EvaluationEnglish dataset Multi-Image
StrongREJECT (Perturbed)14
6
Jailbreak Safety EvaluationEnglish dataset Single-Image
StrongREJECT (Perturbed)10
6
Jailbreak Safety EvaluationEnglish dataset Text
StrongREJECT Rate0.01
6
Speech ReconstructionEnglish dataset 2 kHz sampling
LSD1.01
5
Speech ReconstructionEnglish dataset 1 kHz sampling
LSD1.19
5
Speech ReconstructionEnglish dataset 500 Hz sampling
LSD1.29
5
Showing 9 of 9 rows