Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Open-ended Text Generation on MT-Bench

3.7LLM Judge Score

CTC-trained MDLM

2.142.5452.953.355Jan 30, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
3.7
2026.01
3.3
2026.01
2.8
2026.01
2.2