Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Output-based feature description evaluation on Llama Residual SAE features 3.1

75.4Score

Ensemble Concat

63.75266.77669.872.824Jan 14, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.01
75.4
72
2025.01
71.8
2025.01
71.2
2025.01
68.9
2025.01
68
2025.01
67.4
2025.01
64.2