Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-Form Question Answering on MM-Telco Telecom Blog
Loading...
27
ROUGE-1
Qwen2.5VL 7B
14.52
17.76
21
24.24
Nov 17, 2025
ROUGE-1
ROUGE-2
ROUGE-L
Semantic Score
BLEU
Updated 1mo ago
Evaluation Results
Method
Method
Links
ROUGE-1
ROUGE-2
ROUGE-L
Semantic Score
BLEU
Qwen2.5VL 7B
Parameters=7B
2025.11
27
11
20
83
5.65
LLama3.1 8B
Parameters=8B
2025.11
24
11
19
84
5.72
LLama3.2 3B
Parameters=3B
2025.11
20
9
16
74
4.41
GPT-4o
2025.11
19
9
15
85
4.36
phi4 14B
Parameters=14B
2025.11
16
7
12
84
2.83
Nemotron 70B
Parameters=70B
2025.11
15
7
11
84
2.56
Feedback
Search any
task
Search any
task