Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Generation on DollyEval
Loading...
24.19
ROUGE-L
XPERT-OLMoE
21.1948
21.9724
22.75
23.5276
May 9, 2026
ROUGE-L
Updated 21d ago
Evaluation Results
Method
Method
Links
ROUGE-L
XPERT-OLMoE
#Params=570M, #Size=di...
2026.05
24.19
XPERT-OLMoE
#Params=270M, #Size=di...
2026.05
23.67
XPERT-DeepSeek
#Params=570M, #Size=di...
2026.05
23.42
XPERT-DeepSeek
#Params=480M, #Size=di...
2026.05
23.38
XPERT-OLMoE
#Params=480M, #Size=di...
2026.05
23.36
XPERT-DeepSeek
#Params=391M, #Size=di...
2026.05
23.35
XPERT-DeepSeek
#Params=270M, #Size=di...
2026.05
23.22
XPERT-OLMoE
#Params=391M, #Size=di...
2026.05
23.17
Scratch
#Params=391M, #Size=di...
2026.05
22.94
Distillation
#Params=480M, #Size=di...
2026.05
22.9
Scratch
#Params=570M, #Size=di...
2026.05
22.86
Distillation
#Params=391M, #Size=di...
2026.05
22.78
Scratch
#Params=480M, #Size=di...
2026.05
22.48
Distillation
#Params=270M, #Size=di...
2026.05
22.43
Distillation
#Params=570M, #Size=di...
2026.05
22.09
Scratch
#Params=270M, #Size=di...
2026.05
21.31
Feedback
Search any
task
Search any
task