Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Persona

Benchmarks

Task NameDataset NameSOTA ResultTrend
Watch duration predictionPersona B
SMAPE0.86
4
Watch duration predictionPersona A
SMAPE0.617
4
Dialogue Quality EvaluationPersona High Info
BF1 (qt, at)0.62
1
Dialogue Quality EvaluationPersona Med. Info
BF1 (qt, at)61
1
Dialogue Quality EvaluationPersona Low Info
BF1 (qt, at)61
1
Dialogue Quality EvaluationPersona Sing. Inst.
BF1 (qt, at)0.58
1
Showing 6 of 6 rows