Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Harmlessness evaluation on Beavertails

58.4Helpful Score

Aligner

0.88815.81930.7545.681Feb 4, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.02
58.430.3
2024.02
5155.9
2024.02
5030
2024.02
38.315.1
2024.02
37.616.6
2024.02
34.947
2024.02
33.925.1
2024.02
33.363.3
2024.02
31.826.7
2024.02
26.415.9
2024.02
25.47.2
2024.02
24.452.4
2024.02
21.912
2024.02
20.110.3
2024.02
19.97.4
2024.02
19.414.9
2024.02
19.124
2024.02
18.625.8
2024.02
18.412.3
2024.02
17.85.5
2024.02
16.915.8
2024.02
16.710.6
2024.02
15.110.9
2024.02
14.219.1
2024.02
13.54.6
2024.02
10.61.9
2024.02
9.912.1
2024.02
9.39.3
2024.02
8.553.4
2024.02
828.6
2024.02
5.845
2024.02
5.22.4
2024.02
3.17.6