Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OR-QUAC

Benchmarks

Task NameDataset NameSOTA ResultTrend
Conversational Response GenerationOR-QUAC (test)
F1 Score17.8
9
Open-Domain Conversational Question AnsweringOR-QuAC (test)
F1 Score36.84
4
Knowledge-Grounded DialogOR-QUAC
BLEU-47.76
3
Showing 3 of 3 rows