Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Emergent Misalignment Measurement on Medical General Evaluation
Loading...
0.38
Misalignment
Persona Vectors
-1.0216
8.4392
17.9
27.3608
Aug 8, 2025
Misalignment
Incoherence
Updated 1mo ago
Evaluation Results
Method
Method
Links
Misalignment
Incoherence
Persona Vectors
Model=Qwen2.5-32B
2025.08
0.38
0
KL
Model=Qwen2.5-32B
2025.08
2.5
0
Interleaving++
Model=Qwen2.5-32B
2025.08
7.89
7.94
Interleaving
Model=Qwen2.5-32B
2025.08
11.33
7.42
Interleaving+
Model=Qwen2.5-32B
2025.08
16.74
9.21
Misaligned
Model=Qwen2.5-32B
2025.08
35.42
5.42
Feedback
Search any
task
Search any
task