Share your thoughts, 1 month free Claude Pro on usSee more

Discrimination on PsyCLIENT-CP +behavior

26.1Accuracy (A)

DeepSeek-R1

Updated 4mo ago

Evaluation Results

Method	Links
DeepSeek-R1 2026.01		26.1	9.2
Claude-Sonet-3.5 2026.01		14.4	7.5
Qwen3-235B-A22B 2026.01		14.2	17.2
DeepSeek-V3-0324 2026.01		1.4	0.3
GPT-4o 2026.01		0.6	0.6
Qwen2.5-72B-Instruct 2026.01		0	0