Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Persuasion Evaluation on Anthropic
Loading...
1.33
Persuasion Gain
DeepSeek-R1
0.03
0.3675
0.705
1.0425
Sep 26, 2025
Persuasion Gain
Updated 1mo ago
Evaluation Results
Method
Method
Links
Persuasion Gain
DeepSeek-R1
Setting=Dynamic, Recei...
2025.09
1.33
Claude 3.7 Sonnet
Setting=Dynamic, Recei...
2025.09
1.13
GPT-4o
Setting=Dynamic, Recei...
2025.09
0.73
Mistral-7B-Instruct-v0.3
Setting=Dynamic, Recei...
2025.09
0.6
Qwen2.5-7B-Instruct
Setting=Dynamic, Recei...
2025.09
0.51
Llama-3.3-70B-Instruct
Setting=Dynamic, Recei...
2025.09
0.49
Llama-3.1-8B-Instruct
Setting=Dynamic, Recei...
2025.09
0.44
DeepSeek-R1
Setting=Static, Receiv...
2025.09
0.29
Claude 3.7 Sonnet
Setting=Static, Receiv...
2025.09
0.28
GPT-4o
Setting=Static, Receiv...
2025.09
0.15
Llama-3.1-8B-Instruct
Setting=Static, Receiv...
2025.09
0.12
Mistral-7B-Instruct-v0.3
Setting=Static, Receiv...
2025.09
0.11
Qwen2.5-7B-Instruct
Setting=Static, Receiv...
2025.09
0.08
Llama-3.3-70B-Instruct
Setting=Static, Receiv...
2025.09
0.08
Feedback
Search any
task
Search any
task