Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-turn Dialogue Evaluation on MT-Eval (Expansion, Follow-up, Recollection, Refinement)

7.34Expansion Score

SFT

3.51284.50645.56.4936Aug 1, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.08
7.348.16.245.686.84
2024.08
7.038.376.635.967
2024.08
6.87.155.555.176.17
2024.08
6.637.715.635.296.32
2024.08
6.577.725.955.366.4
2024.08
6.437.555.324.315.9
2024.08
6.217.15.135.035.87
2024.08
4.76.033.842.924.37
2024.08
3.664.231.392.092.84