Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-modal Red-teaming on RTP and PTP (test)

2.3Refusal Rate (LLaVA)

STARE

2.1443.1974.255.303May 1, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.05
2.32.95.8
2026.05
2.53.14.4
2026.05
6.27.810.2