Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fine-tuning Robustness against Harmful Data Attacks on GSM8K

0.95Harmful Score (Clean)

Lisa

0.9061.2031.51.797Feb 28, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
0.953.053.824.625.868.164.419.979.579.68.99.239.29.41
2026.02
0.951.140.951.141.241.291.1215.8314.5715.415.415.0714.3715.12
2026.02
1.297.1513.0719.5523.629.3315.6712.2712.2311.7711.7711.7711.7711.93
2026.02
1.381.762.915.19.0613.495.6216.5315.9716.516.316.4315.716.24
2026.02
2.0510.6815.5919.0623.9427.8516.5312.2311.8311.4310.910.910.5311.3