Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks

About

Shot boundary detection (SBD) is an important pre-processing step for video manipulation. Here, each segment of frames is classified as either sharp, gradual or no transition. Current SBD techniques analyze hand-crafted features and attempt to optimize both detection accuracy and processing speed. However, the heavy computations of optical flow prevents this. To achieve this aim, we present an SBD technique based on spatio-temporal Convolutional Neural Networks (CNN). Since current datasets are not large enough to train an accurate SBD CNN, we present a new dataset containing more than 3.5 million frames of sharp and gradual transitions. The transitions are generated synthetically using image compositing models. Our dataset contain additional 70,000 frames of important hard-negative no transitions. We perform the largest evaluation to date for one SBD algorithm, on real and synthetic data, containing more than 4.85 million frames. In comparison to the state of the art, we outperform dissolve gradual detection, generate competitive performance for sharp detections and produce significant improvement in wipes. In addition, we are up to 11 times faster than the state of the art.

Ahmed Hassanien, Mohamed Elgharib, Ahmed Selim, Sung-Ho Bae, Mohamed Hefeeda, Wojciech Matusik• 2017

Related benchmarks

TaskDatasetResultRank
Shot Boundary DetectionTRECVID 2007 (test)
Abrupt F-score0.9749
18
Shot Boundary DetectionRAI dataset
F1 Score94
14
Shot Boundary DetectionRAI (test)
F1 Score0.939
10
Shot Boundary Detection (Gradual)TRECVID 2007 (test)
Precision0.826
8
Shot Boundary DetectionClipShots
F1 Score75.9
5
Shot Boundary DetectionBBC
F1 Score92.6
5
Shot Boundary DetectionTRECVID 4,096s video (test)
Speed-up Factor19.3
4
Cut Transition DetectionClipShots
Precision76.5
4
Gradual Transition DetectionClipShots
Precision0.837
4
Shot Transition DetectionClipShots (test)
F1 Score75.9
4
Showing 10 of 21 rows

Other info

Follow for update