Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Interpretability Evaluation on Top-activated Texts

0.74Embedding Similarity

CLIP+SAE

-0.10240.11630.3350.5537Feb 16, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.02
0.7461
2025.02
0.4559
2025.02
0.4560
2025.02
-0.0746