Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Adversarial Robustness on MUSE-Book Harry Potter
Loading...
1.5
ASR
SFT
-1.26
17.37
36
54.63
Apr 16, 2026
ASR
Updated 1mo ago
Evaluation Results
Method
Method
Links
ASR
SFT
2026.04
1.5
Upper Bound
2026.04
6.7
MULE
2026.04
12.5
Ensemble Teacher (ET)
2026.04
13.5
NGDiff
2026.04
17.9
FLAT
Training objective=Obj...
2026.04
39
DUET
Teacher model integrat...
2026.04
39.5
MOLLM
2026.04
69.2
GA
Training objective=Obj...
2026.04
70
Base Model
2026.04
70.5
SimNPO
Training objective=Obj...
2026.04
70.5
Feedback
Search any
task
Search any
task