Share your thoughts, 1 month free Claude Pro on usSee more

Persuasion Evaluation on Anthropic

1.33Persuasion Gain

DeepSeek-R1

Updated 4mo ago

Evaluation Results

Method	Links
DeepSeek-R1 2025.09		1.33
Claude 3.7 Sonnet 2025.09		1.13
GPT-4o 2025.09		0.73
Mistral-7B-Instruct-v0.3 2025.09		0.6
Qwen2.5-7B-Instruct 2025.09		0.51
Llama-3.3-70B-Instruct 2025.09		0.49
Llama-3.1-8B-Instruct 2025.09		0.44
DeepSeek-R1 2025.09		0.29
Claude 3.7 Sonnet 2025.09		0.28
GPT-4o 2025.09		0.15
Llama-3.1-8B-Instruct 2025.09		0.12
Mistral-7B-Instruct-v0.3 2025.09		0.11
Qwen2.5-7B-Instruct 2025.09		0.08
Llama-3.3-70B-Instruct 2025.09		0.08