Share your thoughts, 1 month free Claude Pro on usSee more

Helpfulness Evaluation on MM-Vet2 (test)

54.4GPT-Eval Score

No Defense

Updated 3mo ago

Evaluation Results

Method	Links
No Defense 2025.08		54.4
PRISM 2025.08		48.9
SPA-VL 2025.08		46.8
PRISM 2025.08		20.4
SPA-VL 2025.08		20.2
SafeRLHF-V 2025.08		19.3
VLGuard 2025.08		17.7
No Defense 2025.08		13.1
SafeRLHF-V 2025.08		12.9
VLGuard 2025.08		12.3