Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SumGraph: Video Summarization via Recursive Graph Modeling

About

The goal of video summarization is to select keyframes that are visually diverse and can represent a whole story of an input video. State-of-the-art approaches for video summarization have mostly regarded the task as a frame-wise keyframe selection problem by aggregating all frames with equal weight. However, to find informative parts of the video, it is necessary to consider how all the frames of the video are related to each other. To this end, we cast video summarization as a graph modeling problem. We propose recursive graph modeling networks for video summarization, termed SumGraph, to represent a relation graph, where frames are regarded as nodes and nodes are connected by semantic relationships among frames. Our networks accomplish this through a recursive approach to refine an initially estimated graph to correctly classify each node as a keyframe by reasoning the graph representation via graph convolutional networks. To leverage SumGraph in a more practical environment, we also present a way to adapt our graph modeling in an unsupervised fashion. With SumGraph, we achieved state-of-the-art performance on several benchmarks for video summarization in both supervised and unsupervised manners.

Jungin Park, Jiyoung Lee, Ig-Jae Kim, Kwanghoon Sohn• 2020

Related benchmarks

TaskDatasetResultRank
Video SummarizationTVSum
F-Measure65.8
213
Video SummarizationSumMe--
130
Video SummarizationTVSum
Kendall's Tau0.094
55
Showing 3 of 3 rows

Other info

Follow for update