Share your thoughts, 1 month free Claude Pro on usSee more

Privacy Expectation Estimation on CONFAIDE Tier 2 (InfoFlow-Expectation)

-83.3Sensitivity

GPT-4 (Drunk language inducement)

Updated 5mo ago

Evaluation Results

Method	Links
GPT-4 (Drunk language inducement) 2026.01		-83.3
GPT-4 2026.01		-74.9
GPT-4 (Drunk language inducement) 2026.01		-61.9
GPT-3.5 2026.01		-56.1
LLaMA3-8B (Drunk language inducement) 2026.01		-55.5
LLaMA3-8B 2026.01		-51.8
GPT-3.5 (Drunk language inducement) 2026.01		-50.4
LLaMA3-8B (Drunk language inducement) 2026.01		-37.3
LLaMA3-8B (Drunk language inducement) 2026.01		-35.6
LLaMA2-7B (Drunk language inducement) 2026.01		-3.6
LLaMA2-7B 2026.01		-2.6
Mistral-7B (Drunk language inducement) 2026.01		-1
LLaMA2-7B (Drunk language inducement) 2026.01		-0.4
LLaMA2-7B (Drunk language inducement) 2026.01		6.8
Mistral-7B 2026.01		26
Mistral-7B (Drunk language inducement) 2026.01		66.3
Mistral-7B (Drunk language inducement) 2026.01		89.8