Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Finetuning with implicit harmful data on Identity-shift

53Utility

Safe Lora

33.2438.3743.548.63May 13, 2026
Updated 19d ago

Evaluation Results

MethodLinks
2026.05
53623.27
2026.05
52291.92
2026.05
52532.98
2026.05
51753.75
2026.05
5181.24
2026.05
51111.37
2026.05
5111.01
2026.05
3401