Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Persuasion on Persuasion Datasets Average
Loading...
1.27
Persuasion Gain
DeepSeek-R1
-0.0404
0.2998
0.64
0.9802
Sep 26, 2025
Persuasion Gain
Updated 1mo ago
Evaluation Results
Method
Method
Links
Persuasion Gain
DeepSeek-R1
Setting=Dynamic, Recei...
2025.09
1.27
Claude 3.7 Sonnet
Setting=Dynamic, Recei...
2025.09
1.04
GPT-4o
Setting=Dynamic, Recei...
2025.09
0.62
Llama-3.3-70B-Instruct
Setting=Dynamic, Recei...
2025.09
0.44
Llama-3.1-8B-Instruct
Setting=Dynamic, Recei...
2025.09
0.42
Mistral-7B-Instruct-v0.3
Setting=Dynamic, Recei...
2025.09
0.31
Qwen2.5-7B-Instruct
Setting=Dynamic, Recei...
2025.09
0.23
DeepSeek-R1
Setting=Static, Receiv...
2025.09
0.23
Claude 3.7 Sonnet
Setting=Static, Receiv...
2025.09
0.14
Llama-3.3-70B-Instruct
Setting=Static, Receiv...
2025.09
0.06
GPT-4o
Setting=Static, Receiv...
2025.09
0.06
Llama-3.1-8B-Instruct
Setting=Static, Receiv...
2025.09
0.04
Qwen2.5-7B-Instruct
Setting=Static, Receiv...
2025.09
0.02
Mistral-7B-Instruct-v0.3
Setting=Static, Receiv...
2025.09
0.01
Feedback
Search any
task
Search any
task