Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Open-Ended Dialogue

Benchmarks

Task NameDataset NameSOTA ResultTrend
Open-Ended DialogueOpen-Ended Dialogue (out-of-distribution)
MT-Bench68.3
4
Open-Ended DialogueOpen-Ended Dialogue (in-distribution)
Helpful Score69.4
4
Showing 2 of 2 rows