Share your thoughts, 1 month free Claude Pro on usSee more

Interactive Social Privacy and Theory of Mind on Social Intelligence in Adversarial Dialogue (test)

26.7Fooling % (Hard)

GPT-5.4

Updated 3mo ago

Evaluation Results

Method	Links
GPT-5.4 2026.04		26.7	51.1	49.8	49.9	3.53
GPT-5.4 2026.04		16.7	45.3	44.7	49.9	4.2
GPT-5.4 2026.04		15.6	24.7	22	45.6	4.99
GPT-5.4 2026.04		6.2	38.7	40.7	46.2	4.15