Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Data-to-text generation on RotoWire (test)

7.57Factual Support Score

Templ

2.79644.03575.2756.5143Sep 3, 2018Apr 3, 2019Nov 1, 2019Jun 1, 2020Dec 30, 2020Jul 30, 2021Feb 28, 2022
Updated 1mo ago

Evaluation Results

MethodLinks
2021.02
7.570.08-61.67-52.92-36.67
2022.02
7.570.08-57.33-55.33-34.67
2018.09
6.980.21-0.89-4.891.33
2019.06
6.980.21-3.7-3.3317.78
2021.02
5.080.6713.334.583.75
2022.02
5.080.6761.33-0.67
2018.09
4.90.9-2.44-2.44-3.55
2019.06
4.90.9-3.33-3.7-3.7
2022.02
4.840.17420.6710.67
2019.06
4.770.82.963.7-3.33
2021.02
40.27510.426.67
2022.02
40.270.677.3310
2021.02
3.920.915-8.33-4.58
2022.02
3.920.914-14.67-13.33
2021.02
3.630.0738.3346.2530.83
2022.02
3.630.0742.6740.6728
2018.09
3.191.09-4.22-4.89-6.44
2018.09
2.980.2811.781613.78
2019.06
2.980.284.073.33-10.74