Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DeepSeek-R1

Benchmarks

Task NameDataset NameSOTA ResultTrend
Annotation AccuracyDeepSeek-R1 Experiment 1
F1 Score (Ga)100
40
LLM Attack EffectivenessDeepSeek-R1-Distill-Llama-8B serving environment
TTFT (s)0.08
6
Text Naturalness EvaluationDeepSeek-R1 Experiment 2
BERT Score0.99
5
Showing 3 of 3 rows