Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Personalized Review Writing on LongLaMP
Loading...
27.84
ROUGE-L
PerCE
18.6256
21.0178
23.41
25.8022
Feb 4, 2026
ROUGE-L
METEOR
Updated 1mo ago
Evaluation Results
Method
Method
Links
ROUGE-L
METEOR
PerCE
Model=Qwen3-14B
2026.02
27.84
27.17
PerCE
Model=Llama3-8B
2026.02
27.71
27.13
PerCE
Model=Qwen3-4B
2026.02
26.68
26.6
LossCE
Model=Qwen3-14B
2026.02
26.28
23.48
EntCE
Model=Qwen3-14B
2026.02
25.68
22.79
LossCE
Model=Llama3-8B
2026.02
25.31
21.56
EntCE
Model=Llama3-8B
2026.02
24.69
23.47
CE
Model=Qwen3-14B
2026.02
23.21
25.56
CE
Model=Llama3-8B
2026.02
23.11
21.43
EntCE
Model=Qwen3-4B
2026.02
21.93
19.38
LossCE
Model=Qwen3-4B
2026.02
21.83
19.58
CE
Model=Qwen3-4B
2026.02
18.98
15.83
Feedback
Search any
task
Search any
task