Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Generation on Vicuna
Loading...
15.05
Rouge-L
XPERT-OLMoE
12.9284
13.4792
14.03
14.5808
May 9, 2026
Rouge-L
Updated 21d ago
Evaluation Results
Method
Method
Links
Rouge-L
XPERT-OLMoE
#Params=391M, #Size=di...
2026.05
15.05
XPERT-OLMoE
#Params=480M, #Size=di...
2026.05
14.83
XPERT-OLMoE
#Params=570M, #Size=di...
2026.05
14.57
XPERT-OLMoE
#Params=270M, #Size=di...
2026.05
14.46
Distillation
#Params=480M, #Size=di...
2026.05
14.39
XPERT-DeepSeek
#Params=570M, #Size=di...
2026.05
14.37
Scratch
#Params=391M, #Size=di...
2026.05
14.25
Scratch
#Params=480M, #Size=di...
2026.05
14.2
Distillation
#Params=570M, #Size=di...
2026.05
14.09
XPERT-DeepSeek
#Params=391M, #Size=di...
2026.05
13.99
XPERT-DeepSeek
#Params=480M, #Size=di...
2026.05
13.92
Scratch
#Params=270M, #Size=di...
2026.05
13.77
Distillation
#Params=270M, #Size=di...
2026.05
13.52
Scratch
#Params=570M, #Size=di...
2026.05
13.42
XPERT-DeepSeek
#Params=270M, #Size=di...
2026.05
13.25
Distillation
#Params=391M, #Size=di...
2026.05
13.01
Feedback
Search any
task
Search any
task