Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Persuasion on args.me
Loading...
12.15
Agreement Shift
Qwen2.5-3B + ToMAP
2.6964
5.1507
7.605
10.0593
May 29, 2025
Agreement Shift
Updated 1d ago
Evaluation Results
Method
Method
Links
Agreement Shift
Qwen2.5-3B + ToMAP
Persuadee=Qwen3-Next-8...
2025.05
12.15
Qwen2.5-3B + ToMAP
Persuadee=GPT-4o-mini
2025.05
11.09
Qwen2.5-3B + RL
Persuadee=GPT-4o-mini
2025.05
10.53
Qwen2.5-3B + RL
Persuadee=Qwen3-Next-8...
2025.05
7.56
Qwen2.5-3B
Persuadee=GPT-4o-mini
2025.05
5.34
Qwen2.5-3B
Persuadee=Qwen3-Next-8...
2025.05
3.06
Feedback
Search any
task
Search any
task