Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text Generation on Human Evaluation 6 tasks (10 pairs sampled per task)
Loading...
0.311
Fluency Win Rate
ADS
0.2018
0.23015
0.2585
0.28685
May 8, 2023
Fluency Win Rate
Fluency Loss Rate
Fluency Tie Rate
Fluency Zeta Score (ζ)
Coherence Win Rate
Coherence Loss Rate
Coherence Tie Rate
Coherence Zeta Score (ζ)
Relevance Win Rate
Relevance Loss Rate
Relevance Tie Rate
Relevance Zeta Score (ζ)
Updated 4d ago
Evaluation Results
Method
Method
Links
Fluency Win Rate
Fluency Loss Rate
Fluency Tie Rate
Fluency Zeta Score (ζ)
Coherence Win Rate
Coherence Loss Rate
Coherence Tie Rate
Coherence Zeta Score (ζ)
Relevance Win Rate
Relevance Loss Rate
Relevance Tie Rate
Relevance Zeta Score (ζ)
ADS
Steps=20
2023.05
0.311
0.15
0.539
0.666
0.328
0.172
0.5
0.74
0.267
0.156
0.577
0.817
ADS
Steps=2000
2023.05
0.206
0.138
0.656
0.859
0.217
0.128
0.655
0.71
0.278
0.161
0.561
0.878
Feedback
Search any
task
Search any
task