Semi-supervised Hand Appearance Recovery via Structure Disentanglement and Dual Adversarial Discrimination

About

Enormous hand images with reliable annotations are collected through marker-based MoCap. Unfortunately, degradations caused by markers limit their application in hand appearance reconstruction. A clear appearance recovery insight is an image-to-image translation trained with unpaired data. However, most frameworks fail because there exists structure inconsistency from a degraded hand to a bare one. The core of our approach is to first disentangle the bare hand structure from those degraded images and then wrap the appearance to this structure with a dual adversarial discrimination (DAD) scheme. Both modules take full advantage of the semi-supervised learning paradigm: The structure disentanglement benefits from the modeling ability of ViT, and the translator is enhanced by the dual discrimination on both translation processes and translation results. Comprehensive evaluations have been conducted to prove that our framework can robustly recover photo-realistic hand appearance from diverse marker-contained and even object-occluded datasets. It provides a novel avenue to acquire bare hand appearance data for other downstream learning problems.The codes will be publicly available at https://www.yangangwang.com

Zimeng Zhao, Binghui Zuo, Zhiyu Long, Yangang Wang• 2023

Related benchmarks

Task	Dataset	Result	Rank
Hand Appearance Recovery	A1 → B marker-contained to bare hand	FID (Initial)60.37		6
Hand Appearance Recovery	A2 → B (object-occluded to bare hand)	FID (Input)41.53		6

Showing 2 of 2 rows

Other info

Code

Follow for update

@wizwand_team Discord