Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Character Task on CharacterEval
Loading...
65.3
Win Rate
PDD
38.052
45.126
52.2
59.274
Mar 2, 2026
Win Rate
Lose Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Win Rate
Lose Rate
PDD
Base Model=Qwen2.5-7B-...
2026.03
65.3
33.1
PDD
Base Model=LLaMA-3-8B-...
2026.03
63.1
35.7
PDD
Base Model=Qwen2.5-7B-...
2026.03
52.8
41.5
PDD
Base Model=LLaMA-3-8B-...
2026.03
52.5
43.1
PDD
Base Model=Qwen2.5-7B-...
2026.03
51.2
34.7
PDD
Base Model=Qwen2.5-7B-...
2026.03
48.7
38.5
PDD
Base Model=LLaMA-3-8B-...
2026.03
48.2
41.6
PDD
Base Model=LLaMA-3-8B-...
2026.03
39.1
31.3
Feedback
Search any
task
Search any
task