Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-modal Red-teaming on RTP and PTP (test)
Loading...
2.3
Refusal Rate (LLaVA)
STARE
2.144
3.197
4.25
5.303
May 1, 2026
Refusal Rate (LLaVA)
Refusal Rate (Qwen)
Refusal Rate (Gemini)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Refusal Rate (LLaVA)
Refusal Rate (Qwen)
Refusal Rate (Gemini)
STARE
align=0.2
2026.05
2.3
2.9
5.8
Text-Only
2026.05
2.5
3.1
4.4
Text+SD
2026.05
6.2
7.8
10.2
Feedback
Search any
task
Search any
task