Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety Alignment on XSTest
Loading...
95.2
Compliance
Yi-VL-6B
76.48
81.34
86.2
91.06
Apr 14, 2025
Compliance
Rejection Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Compliance
Rejection Rate
Yi-VL-6B
Finetuning Dataset=Origin
2025.04
95.2
41.5
LLaVA-NeXT-Mistral-7B
Finetuning Dataset=Origin
2025.04
94.8
58.5
LLaVA-v1.5-7B
Finetuning Dataset=Origin
2025.04
92
75.5
LLaVA-NeXT-Mistral-7B
Finetuning Dataset=Ours
2025.04
91.2
65.5
LLaVA-v1.5-7B
Finetuning Dataset=Ours
2025.04
90.4
82
LLaVA-v1.5-13B
Finetuning Dataset=Ours
2025.04
90.4
90.5
LLaVA-v1.5-13B
Finetuning Dataset=Origin
2025.04
90
84.5
Yi-VL-6B
Finetuning Dataset=Ours
2025.04
89.2
67
LLaVA-NeXT-Mistral-7B
Finetuning Dataset=VLG...
2025.04
87.6
89.5
Yi-VL-6B
Finetuning Dataset=VLG...
2025.04
84.4
93
LLaVA-v1.5-13B
Finetuning Dataset=VLG...
2025.04
77.6
97
LLaVA-v1.5-7B
Finetuning Dataset=VLG...
2025.04
77.2
96.5
Feedback
Search any
task
Search any
task