Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Text Generation benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Text Generation
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
DomainBench
SYTTA-16
BLEU (Agriculture)
71.37
144
23d ago
OpenWebText
SDTT
Perplexity
3.18
86
1mo ago
LM1B (test)
UDLM
Entropy
2.46
72
1mo ago
CNN/Daily Mail (test)
DEL
COH Score
83
64
1mo ago
Medical Chatbot
Baseline
ASR
100
42
1mo ago
OWT
DFM (ESD)
GPT2 Perplexity
5.33
41
4d ago
5 Generation tasks
POP
Accuracy
57.96
36
1mo ago
Text Generation
Dense
PPL
11.9
33
1mo ago
Text model inference M4 Max
vllm-mlx
Throughput (tok/s)
525.5
31
1mo ago
MSCOCO
SARE
BLEU-1
57.2
26
1mo ago
LM1B
DFM (ESD)
Perplexity (PPL)
68.11
24
4d ago
Wikitext-103
Refined by Gemma3 27B
Perplexity
32.88
23
4d ago
Spec-Bench Overall
SpecBound
SD Score
2.33
21
3d ago
NoveltyBench
STATe of Thoughts
Diversity
5.39
21
1mo ago
AbGen
ROMA
Importance
4.91
20
1mo ago
Open Web Text (OWT) (val)
Masked D-MMD
GPT-2 GM Score
0.456
19
26d ago
Aggregate NLP Tasks (GEC, Smart Reply, Summarization, Tone Adjustment, QA) (test)
Separate single-task LoRAs
Average Score
32.9
18
1mo ago
WebNLG seen categories (test)
CGE-LW
BLEU
63.69
18
1mo ago
Hazard Detection (val)
Qwen2-VL-7B ft
BLEU-4
0.658
17
19d ago
FuseEval English Scenario
SpecEM
ROUGE-1
31.19
16
1mo ago
Decoding Throughput
AWQ
Decoding Throughput (tokens/s)
320
16
1mo ago
Wikipedia Biographies (test)
Pretrained
Delta Precision (δ-P)
25.37
16
1mo ago
VPI Generation Tasks Llama3-8B Mistral-7B (test)
Backdoor
ASR
100
16
1mo ago
AutoPoison Generation Llama3-8B Mistral-7B (test)
Backdoor
ASR
82.7
16
1mo ago
DTBA Llama3-8B Mistral-7B (test)
Backdoor
ASR
77
16
1mo ago
Showing 25 of 121 rows
25 / page
50 / page
100 / page
1
2
3
4
5
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs