Share your thoughts, 1 month free Claude Pro on usSee more

Long Text Generation on Stylized Feedback Generation Benchmark

24.2R-1

PAT

Updated 2mo ago

Evaluation Results

Method	Links
PAT 2026.04		24.2	17.9	21	3.357
PAT 2026.04		23.1	17.5	17.9	3.171
PGraph 2026.04		21.3	15.3	19.1	3.685
PGraph 2026.04		21.1	15.2	17.5	3.29
GraSPeR 2026.04		19.6	14.9	16	3.04
GraSPeR 2026.04		19.1	15.4	13.4	2.93
LaMP 2026.04		18.1	12.2	16.4	2.873
LaMP 2026.04		17.9	12.4	16.1	3.107