HOTVCOM: Generating Buzzworthy Comments for Videos
About
In the era of social media video platforms, popular ``hot-comments'' play a crucial role in attracting user impressions of short-form videos, making them vital for marketing and branding purpose. However, existing research predominantly focuses on generating descriptive comments or ``danmaku'' in English, offering immediate reactions to specific video moments. Addressing this gap, our study introduces \textsc{HotVCom}, the largest Chinese video hot-comment dataset, comprising 94k diverse videos and 137 million comments. We also present the \texttt{ComHeat} framework, which synergistically integrates visual, auditory, and textual data to generate influential hot-comments on the Chinese video dataset. Empirical evaluations highlight the effectiveness of our framework, demonstrating its excellence on both the newly constructed and existing datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Captioning | MSR-VTT (test) | CIDEr104.2 | 121 | |
| Video Captioning | MSVD (test) | CIDEr66.3 | 111 | |
| Video Comment Generation | TikTok English | Info Score83.46 | 8 | |
| Video Comment Generation | Livebot (test) | R@120.34 | 7 | |
| Video Comment Generation | HOTVCOM original (test) | Informativeness93.54 | 6 | |
| Video Comment Generation | VideoIC (test) | R@137.24 | 6 | |
| Video Comment Retrieval | MovieLC | R@110.34 | 5 | |
| Video Captioning | HOTVCOM 1.0 (test) | BLEU41.21 | 4 |