Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Prompt Reconstruction Defense (TokenInfer attack) on WikiText2

97.54TRA

No Protection

-3.506422.726848.9675.1932Feb 27, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
97.540.9401
2026.02
97.210.9444
2026.02
96.870.9375
2026.02
84.650.8906
2026.02
84.550.8984
2026.02
0.470.4299
2026.02
0.380.4242