Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Modeling on AG News (test)
Loading...
52.09
Perplexity
AR
51.1012
57.7756
64.45
71.1244
Oct 28, 2024
Dec 26, 2024
Feb 24, 2025
Apr 25, 2025
Jun 24, 2025
Aug 23, 2025
Oct 22, 2025
Perplexity
Updated 4d ago
Evaluation Results
Method
Method
Links
Perplexity
AR
zero-shot=true
2024.10
52.09
GPT-2
Zero-shot=true, Traine...
2025.05
52.09
EDLM-AR
zero-shot=true
2024.10
57.27
EDLM-coAR
zero-shot=true
2024.10
57.94
EDLM-NCE
zero-shot=true
2024.10
60.02
MLDM
zero-shot=true
2024.10
61.15
SEDD
zero-shot=true
2024.10
62.09
LDDM-M
Diffusion Framework=Ma...
2025.10
62.55
VADD
Zero-shot=true, Traine...
2025.05
68
MDLM
Zero-shot=true, Traine...
2025.05
68.57
MDLM
Diffusion Framework=Ma...
2025.10
68.62
SEDD Absorb
Diffusion Framework=Ma...
2025.10
76.54
UDLM
Diffusion Framework=Un...
2025.10
76.81
LDDM-U
Diffusion Framework=Un...
2025.10
76.81
Feedback
Search any
task
Search any
task