Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Harmfulness Assessment on Bi-directional Anchoring evaluation set

28.5Harmful Score

Vaccine-SFT

28.25629.90331.5533.197May 28, 2024
Updated 3mo ago

Evaluation Results

MethodLinks
2024.05
28.5
2024.05
30.3
2024.05
33.9
2024.05
33.9
2024.05
33.9
2024.05
34.6