Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text Generation Evaluation Correlation on WebText (test)
Loading...
0.643
Perplexity (PPL)
Bradley-Terry Score (Interesting)
0.63632
0.68141
0.7265
0.77159
Feb 2, 2021
Perplexity (PPL)
Zipf Coefficient
Repetition Score (REP)
Distinct-4
Self-BLEU
MAUVE
Updated 4d ago
Evaluation Results
Method
Method
Links
Perplexity (PPL)
Zipf Coefficient
Repetition Score (REP)
Distinct-4
Self-BLEU
MAUVE
Bradley-Terry Score (Interesting)
Human Evaluation Crite...
2021.02
0.643
0.524
-0.143
52.4
40.5
0.81
Bradley-Terry Score (Sensible)
Human Evaluation Crite...
2021.02
0.738
0.69
-0.071
59.5
52.4
0.857
Bradley-Terry Score (Human-like)
Human Evaluation Crite...
2021.02
0.81
0.833
-0.167
73.8
59.5
0.952
Feedback
Search any
task
Search any
task