Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Short Text Generation on Hotel Experience
Loading...
15
ROUGE-1
GraSPeR
10.216
11.458
12.7
13.942
Apr 27, 2026
ROUGE-1
ROUGE-L
METEOR
LLM-as-a-Judge Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
ROUGE-1
ROUGE-L
METEOR
LLM-as-a-Judge Score
GraSPeR
LLM=Qwen3
2026.04
15
14.5
11.8
3.282
PAT
LLM=Qwen3
2026.04
13.3
12.3
11.8
3.739
PAT
LLM=LlaMA3
2026.04
12.6
12.1
8.5
3.462
PGraph
LLM=LlaMA3
2026.04
12.5
11.6
11.1
3.196
PGraph
LLM=Qwen3
2026.04
12.2
11.4
10.6
3.651
GraSPeR
LLM=LlaMA3
2026.04
12
11.7
10.8
3.087
LaMP
LLM=Qwen3
2026.04
11.2
10.4
9.6
3.512
LaMP
LLM=LlaMA3
2026.04
10.4
9.6
10
3.051
Feedback
Search any
task
Search any
task