Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Input-based feature description faithfulness on GPT2 MLP SAE
Loading...
51.2
Faithfulness Score
EnsembleR (MA+VP)
4.296
16.473
28.65
40.827
Jan 14, 2025
Faithfulness Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Faithfulness Score
EnsembleR (MA+VP)
SAE Width=32k, Layer A...
2025.01
51.2
EnsembleR (MA+TC)
SAE Width=32k, Layer A...
2025.01
51.1
EnsembleR (All)
SAE Width=32k, Layer A...
2025.01
50.2
MaxAct
SAE Width=32k, Layer A...
2025.01
39.7
EnsembleC (All)
SAE Width=32k, Layer A...
2025.01
24.4
EnsembleR (VP+TC)
SAE Width=32k, Layer A...
2025.01
7.1
VocabProj
SAE Width=32k, Layer A...
2025.01
6.3
TokenChange
SAE Width=32k, Layer A...
2025.01
6.1
Feedback
Search any
task
Search any
task