Share your thoughts, 1 month free Claude Pro on usSee more

GPT-2

Benchmarks

Task Name	Dataset Name	SOTA Result
Language Modeling	GPT-2	Delta PPL0.44	32
Language Modeling	GPT-2 Pre-training (val)	Validation Loss2.493	21
Language Modeling	GPT-2 Evaluation Set	Hyper-Prior BPT29.91	20
Language Modeling	GPT-2 Pretraining Data (train)	Training Loss2.9167	12
Layer Pruning	GPT-2-Medium standardized evaluator	Delta (%)109.1	10
Linguistic Steganography	GPT-2	Avg KLD (bits/token)0	10
Privacy-Preserving Transformer Inference	GPT-2 Transformer Block	Latency (s)0.2	10
Language Modeling	GPT-2 124M held-out (test)	Perplexity17.33	10
Machine-Generated Text Detection	GPT-2 (full)	Acc91.1	9
Data Extraction	GPT-2 (train)	Pearson r0.48	8
Concentration of target information	GPT-2 Small suite Aggregate (test)	Gini Coefficient0.71	6
Concentration of target information	GPT-2 Small 5 (test)	Gini Coefficient0.72	6
Concentration of target information	GPT-2 Small (test 4)	Gini Coefficient0.74	6
Concentration of target information	GPT-2 Small 3 (test)	Gini Coefficient0.73	6
Concentration of target information	GPT-2 Small (test)	Gini Coefficient0.71	6
Concentration of target information	GPT-2 Small (test 1)	Gini Coefficient0.33	6
Machine-generated text detection	GPT-2 (test)	Accuracy85.75	5
Language Modeling	GPT-2 style (val)	BPB0.9766	4
LLM Pre-training	GPT-2 (val)	Loss3.405	4
Language Modeling	GPT-2 1,000 samples	PPL27.99	4
Training Data Composition Estimation	GPT-2 mixtures (controlled sandbox)	Overlap Accuracy87.53	3
Prompt Recovery	GPT-2 Small 20-token prompts	Mean Time (s)28.01	3
Model Retraining	GPT-2 (train)	Average Retraining Time (s)9.76	2
Recursive circuit gate count analysis	GPT-2	Nova Gate Count1,048,576	1
Language Modeling	GPT-2 (val)	Base CE3.48	1

Showing 25 of 27 rows