Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Role-playing evaluation

Benchmarks

Task NameDataset NameSOTA ResultTrend
Role-playingRole-playing evaluation (Main characters)
ROUGE-L (Haruhi)83.88
12
Role-playingRole-playing evaluation (Minor characters)
K-On! ROUGE-L21.21
5
Showing 2 of 2 rows