SOTA Controlled Text Generation benchmarks and papers with code

Benchmarks

Dataset Name	SOTA Method	Metric
Syntactic Control (Q ∝ pq) (test)	GPT2-large + SIS	Log Probability Q(y)0.0001	12	4mo ago
Syntactic Control (Q = p) (test)	Llama3-8B (5-shot) + SIS	Log Probability p(y)-22.71	12	4mo ago
RealToxicityPrompts 10K nontoxic prompts	DEXPERTS	Avg Max Toxicity30.2	9	4mo ago
Base Language Model Efficiency Comparison	PPLM	Speed Ratio270.11	8	4mo ago
SST-5 No-Pos	GENhance	Positiveness Score70	8	4mo ago
SST-5 200-Pos	GENhance	Positiveness Score91	8	4mo ago
Yelp Formality (test)	LATENTOPS	Accuracy97	4	4mo ago
Amazon Tense (test)		Accuracy97	4	4mo ago
Single-Attribute Control prompts PPLM (test)	PriorControl	Average Score4.13	3	4mo ago
Single-Attribute Control		Sentiment Avg99.9	3	4mo ago

Showing 10 of 10 rows