Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GPT2

Benchmarks

Task NameDataset NameSOTA ResultTrend
Training ThroughputGPT2-1.5B
Throughput4.1
25
Training Data AttributionGPT2-small
LDS Score0.3936
10
Output-based feature description faithfulnessGPT2 MLP SAE
Faithfulness Score40.9
8
Input-based feature description faithfulnessGPT2 MLP SAE
Faithfulness Score51.2
8
Output-based feature description faithfulnessGPT2 Res. SAE
Faithfulness Score47.2
8
Input-based feature description faithfulnessGPT2 Res. SAE
Faithfulness Score60.4
8
Private text generationGPT2-base (124M)
Usage Fraction100
7
Private InferenceGPT2-base (124M)
Embed Inference Time (s)5.17
7
Feature MatchingGPT2 Layer 0 match with Layer 11
LLM Eval Score1.39
6
Feature MatchingGPT2 Layer 5 match with Layer 11
LLM Eval1.56
6
Adversarial AttackGPT2 F.t.
ASR (%)74.25
6
Circuit CompressionGPT2-small Digit Addition
Accuracy68.12
5
Feature MatchingGPT2 Layer 5 match with Layer 6
LLM Eval2.53
4
Sparse ProbingGPT2 Small
Average F174.3
4
Activation ReconstructionGPT2 Small
MSE0.32
4
Showing 15 of 15 rows