Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Text Generation Evaluation Correlation on WebText (test)

0.643Perplexity (PPL)

Bradley-Terry Score (Interesting)

0.636320.681410.72650.77159Feb 2, 2021
Updated 1mo ago

Evaluation Results

MethodLinks
2021.02
0.6430.524-0.14352.440.50.81
2021.02
0.7380.69-0.07159.552.40.857
2021.02
0.810.833-0.16773.859.50.952