Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Semantic Neural Tree for Human Parsing

About

The majority of existing human parsing methods formulate the task as semantic segmentation, which regard each semantic category equally and fail to exploit the intrinsic physiological structure of human body, resulting in inaccurate results. In this paper, we design a novel semantic neural tree for human parsing, which uses a tree architecture to encode physiological structure of human body, and designs a coarse to fine process in a cascade manner to generate accurate results. Specifically, the semantic neural tree is designed to segment human regions into multiple semantic subregions (e.g., face, arms, and legs) in a hierarchical way using a new designed attention routing module. Meanwhile, we introduce the semantic aggregation module to combine multiple hierarchical features to exploit more context information for better performance. Our semantic neural tree can be trained in an end-to-end fashion by standard stochastic gradient descent (SGD) with back-propagation. Several experiments conducted on four challenging datasets for both single and multiple human parsing, i.e., LIP, PASCAL-Person-Part, CIHP and MHP-v2, demonstrate the effectiveness of the proposed method. Code can be found at https://isrc.iscas.ac.cn/gitlab/research/sematree.

Ruyi Ji, Dawei Du, Libo Zhang, Longyin Wen, Yanjun Wu, Chen Zhao, Feiyue Huang, Siwei Lyu• 2019

Related benchmarks

TaskDatasetResultRank
Human ParsingLIP (val)
mIoU54.73
111
Human Part ParsingPASCAL-Person-Part (test)
mIoU71.59
68
Human ParsingMHP v2.0 (val)
APp5034.4
27
Human ParsingLIP 62
mIoU54.73
13
Human ParsingLIP 62 (test)
mIoU54.73
13
Human ParsingCIHP (val)
mIoU60.87
12
Human ParsingCIHP 99
mIoU60.87
11
Human ParsingCIHP 99 (test)
mIoU60.87
11
Hierarchical Human ParsingPASCAL-Person-Part (test)
Head Accuracy89.15
9
Multi-Human ParsingMHP v2.0 (val)
Inference Time (ms)3.55e+3
9
Showing 10 of 10 rows

Other info

Follow for update