Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SAEBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Sparse ProbingSAEBench
Average F1 Score81.9
16
ReconstructionSAEBench held-out data
MSE0.03
16
Showing 2 of 2 rows