Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Rebuttal Generation on Human Evaluation Set (100 comments) 1.0 (test)

9.92Attitude Score

w GPT4.1-reward

7.27847.96428.659.3358Jan 22, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
9.929.629.289.549.59
2026.01
9.869.389.349.689.57
2026.01
9.328.88.79.148.99
2026.01
9.39.289.049.429.26
9.249.088.869.169.08
2026.01
9.168.98.849.078.96
2026.01
8.888.68.128.48.5
2026.01
7.386.86.36.56.75