Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Harmfulness Evaluation on AdvBench

1.06Harmfulness Score

SAFEPATH-FT

0.95281.67642.43.1236Aug 6, 2025
Updated 27d ago

Evaluation Results

MethodLinks
2025.08
1.06
2025.08
1.08
2025.08
1.14
2025.08
1.16
2025.08
1.34
2025.08
1.4
2025.08
1.42
2025.08
1.42
2025.08
1.48
2025.08
1.52
2025.08
1.78
2025.08
2.1
2025.08
2.32
2025.08
2.32
2025.08
2.92
2025.08
3.1
2025.08
3.34
2025.08
3.54
2025.08
3.58
2025.08
3.64
2025.08
3.74
2025.08
3.74