Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Qwen2.5

Benchmarks

Task NameDataset NameSOTA ResultTrend
Pairwise Preference ComparisonQwen2.5-3B responses (test)
Avg Preference Score82.7
30
Language Modeling InferenceQwen2.5-7B 8K context length
Decode Latency (ms/token)7.1
4
Showing 2 of 2 rows