Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Text Generation Quality on OpenAI Summaries
Loading...
0.81
MAUVE
Vanilla
0.7684
0.7792
0.79
0.8008
May 28, 2026
MAUVE
Updated 2d ago
Evaluation Results
Method
Method
Links
MAUVE
Vanilla
Backbone=Mistral-7B-In...
2026.05
0.81
DT-CD⋆
Backbone=Mistral-7B-In...
2026.05
0.81
COFT
Backbone=Mistral-7B-In...
2026.05
0.81
SDD
Backbone=Mistral-7B-In...
2026.05
0.8
Vanilla
Backbone=LLaMA-2-13B
2026.05
0.79
COFT
Backbone=LLaMA-2-13B
2026.05
0.79
DExperts
Backbone=Mistral-7B-In...
2026.05
0.79
SDD
Backbone=LLaMA-2-13B
2026.05
0.78
DT-CD⋆
Backbone=LLaMA-2-13B
2026.05
0.78
DExperts
Backbone=LLaMA-2-13B
2026.05
0.77
Feedback
Search any
task
Search any
task