Mesh Graphormer

About

We present a graph-convolution-reinforced transformer, named Mesh Graphormer, for 3D human pose and mesh reconstruction from a single image. Recently both transformers and graph convolutional neural networks (GCNNs) have shown promising progress in human mesh reconstruction. Transformer-based approaches are effective in modeling non-local interactions among 3D mesh vertices and body joints, whereas GCNNs are good at exploiting neighborhood vertex interactions based on a pre-specified mesh topology. In this paper, we study how to combine graph convolutions and self-attentions in a transformer to model both local and global interactions. Experimental results show that our proposed method, Mesh Graphormer, significantly outperforms the previous state-of-the-art methods on multiple benchmarks, including Human3.6M, 3DPW, and FreiHAND datasets. Code and pre-trained models are available at https://github.com/microsoft/MeshGraphormer

Kevin Lin, Lijuan Wang, Zicheng Liu• 2021

Related benchmarks

Task	Dataset	Result
3D Human Pose Estimation	Human3.6M (test)	--	570
3D Human Pose Estimation	3DPW (test)	PA-MPJPE45.6	514
3D Human Mesh Recovery	3DPW (test)	MPJPE74.7	341
3D Human Pose Estimation	Human3.6M	MPJPE51.2	193
3D Hand Reconstruction	FreiHAND (test)	F@15mm98.7	165
Human Mesh Recovery	3DPW	PA-MPJPE45.6	159
3D Human Pose and Shape Estimation	3DPW (test)	MPJPE-PA45.6	158
3D Human Mesh Recovery	Human3.6M (test)	PA-MPJPE34.5	145
3D Human Pose Estimation	3DPW	PA-MPJPE45.6	137
3D Human Pose and Shape Estimation	Human3.6M (test)	PA-MPJPE34.5	119

Showing 10 of 52 rows

Other info

Code

Follow for update

@wizwand_team Discord