Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CDGNet: Class Distribution Guided Network for Human Parsing

About

The objective of human parsing is to partition a human in an image into constituent parts. This task involves labeling each pixel of the human image according to the classes. Since the human body comprises hierarchically structured parts, each body part of an image can have its sole position distribution characteristic. Probably, a human head is less likely to be under the feet, and arms are more likely to be near the torso. Inspired by this observation, we make instance class distributions by accumulating the original human parsing label in the horizontal and vertical directions, which can be utilized as supervision signals. Using these horizontal and vertical class distribution labels, the network is guided to exploit the intrinsic position distribution of each class. We combine two guided features to form a spatial guidance map, which is then superimposed onto the baseline network by multiplication and concatenation to distinguish the human parts precisely. We conducted extensive experiments to demonstrate the effectiveness and superiority of our method on three well-known benchmarks: LIP, ATR, and CIHP databases.

Kunliang Liu, Ouk Choi, Jianming Wang, Wonjun Hwang• 2021

Related benchmarks

TaskDatasetResultRank
Human ParsingLIP (val)
mIoU60.3
111
Human ParsingLIP
mIoU60.3
39
Human ParsingLIP (test)
mIoU60.3
25
Human ParsingCIHP
mIoU65.56
18
Human ParsingATR (test)
Overall Accuracy97.39
14
Human ParsingLIP 62
mIoU60.3
13
Human ParsingLIP 62 (test)
mIoU60.3
13
Human ParsingCIHP (val)
mIoU65.56
12
Human ParsingCIHP 99
mIoU65.56
11
Human ParsingCIHP 99 (test)
mIoU65.56
11
Showing 10 of 10 rows

Other info

Code

Follow for update