Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

P4G

Benchmarks

Task NameDataset NameSOTA ResultTrend
Charity PersuasionP4G User Simulation
Success Rate (SR)78
16
Proactive dialogueP4G
SR96.67
10
Proactive dialogueP4G+
Success Rate (SR)59.17
9
Strategy PredictionP4G
Macro F10.1496
6
Dialogue Response GenerationP4G (test)
Accuracy89
3
Interactive Persuasive DialogueP4G (interactive)
Competence4.21
2
Showing 6 of 6 rows