Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Character Task on BEYOND DIALOGUE
Loading...
64.2
Win Rate
PDD
42.152
47.876
53.6
59.324
Mar 2, 2026
Win Rate
Lose Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Win Rate
Lose Rate
PDD
Base Model=LLaMA-3-8B-...
2026.03
64.2
35
PDD
Base Model=Qwen2.5-7B-...
2026.03
63.9
30.2
PDD
Base Model=Qwen2.5-7B-...
2026.03
60.9
35.4
PDD
Base Model=LLaMA-3-8B-...
2026.03
56.2
41.9
PDD
Base Model=Qwen2.5-7B-...
2026.03
49
43.5
PDD
Base Model=LLaMA-3-8B-...
2026.03
47.6
36.8
PDD
Base Model=LLaMA-3-8B-...
2026.03
46.8
36.5
PDD
Base Model=Qwen2.5-7B-...
2026.03
43
37.6
Feedback
Search any
task
Search any
task