Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Memory Based Video Scene Parsing

About

Video scene parsing is a long-standing challenging task in computer vision, aiming to assign pre-defined semantic labels to pixels of all frames in a given video. Compared with image semantic segmentation, this task pays more attention on studying how to adopt the temporal information to obtain higher predictive accuracy. In this report, we introduce our solution for the 1st Video Scene Parsing in the Wild Challenge, which achieves a mIoU of 57.44 and obtained the 2nd place (our team name is CharlesBLWX).

Zhenchao Jin, Dongdong Yu, Kai Su, Zehuan Yuan, Changhu Wang• 2021

Related benchmarks

TaskDatasetResultRank
Video Semantic SegmentationVSPW (val)
mIoU61.44
92
Video Semantic SegmentationVSPW old codalab (test)
mIoU (%)57.44
5
Showing 2 of 2 rows

Other info

Follow for update