Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Output-based feature description faithfulness on GPT2 Res. SAE
Loading...
47.2
Faithfulness Score
EnsembleR (MA+VP)
42.624
43.812
45
46.188
Jan 14, 2025
Faithfulness Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Faithfulness Score
EnsembleR (MA+VP)
SAE Width=32k, Layer A...
2025.01
47.2
EnsembleR (MA+TC)
SAE Width=32k, Layer A...
2025.01
47.2
EnsembleR (All)
SAE Width=32k, Layer A...
2025.01
47.2
EnsembleC (All)
SAE Width=32k, Layer A...
2025.01
46.9
EnsembleR (VP+TC)
SAE Width=32k, Layer A...
2025.01
44.2
MaxAct
SAE Width=32k, Layer A...
2025.01
44.1
TokenChange
SAE Width=32k, Layer A...
2025.01
43.4
VocabProj
SAE Width=32k, Layer A...
2025.01
42.8
Feedback
Search any
task
Search any
task