Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
SAE latent interpretation on Goodfire SAEs (test)
Loading...
62.8
Hit Rate
SelfIE (SA)
33.576
41.163
48.75
56.337
Feb 10, 2026
Hit Rate
Coverage
Updated 1mo ago
Evaluation Results
Method
Method
Links
Hit Rate
Coverage
SelfIE (SA)
Training Source=Goodfi...
2026.02
62.8
87.7
Original + 5 Paraphrases
Label candidates=6
2026.02
62.2
89.5
SelfIE (SA+LR)
Training Source=Goodfi...
2026.02
59.2
87.4
Auto-Interp Labels x6
Label candidates=6
2026.02
59
87.2
SelfIE (SA)
Training Source=Llama...
2026.02
56.4
83.9
SelfIE (SA)
Training Source=Wikipe...
2026.02
49.7
74
SelfIE (SA+LR)
Training Source=Llama...
2026.02
46.5
76.1
Untrained SelfIE
Training Source=None
2026.02
40.4
69.1
SelfIE (Full-Rank)
Training Source=Wikipe...
2026.02
34.7
60.7
Feedback
Search any
task
Search any
task