Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Text Safety Filter Bypass on NSFW-200 Target Filter: GPT-4.1 1.0 (cross-filter transfer)

97.5Bypass Rate

OptJail

94.995.57596.2596.925May 25, 2025
Updated 8d ago

Evaluation Results

MethodLinks
2025.05
97.5
2025.05
97
2025.05
95