Decoupled Iterative Refinement Framework for Interacting Hands Reconstruction from a Single RGB Image

Ren, Pengfei; Wen, Chao; Zheng, Xiaozheng; Xue, Zhou; Sun, Haifeng; Qi, Qi; Wang, Jingyu; Liao, Jianxin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2302.02410 (cs)

[Submitted on 5 Feb 2023 (v1), last revised 21 Aug 2023 (this version, v2)]

Title:Decoupled Iterative Refinement Framework for Interacting Hands Reconstruction from a Single RGB Image

Authors:Pengfei Ren, Chao Wen, Xiaozheng Zheng, Zhou Xue, Haifeng Sun, Qi Qi, Jingyu Wang, Jianxin Liao

View PDF

Abstract:Reconstructing interacting hands from a single RGB image is a very challenging task. On the one hand, severe mutual occlusion and similar local appearance between two hands confuse the extraction of visual features, resulting in the misalignment of estimated hand meshes and the image. On the other hand, there are complex spatial relationship between interacting hands, which significantly increases the solution space of hand poses and increases the difficulty of network learning. In this paper, we propose a decoupled iterative refinement framework to achieve pixel-alignment hand reconstruction while efficiently modeling the spatial relationship between hands. Specifically, we define two feature spaces with different characteristics, namely 2D visual feature space and 3D joint feature space. First, we obtain joint-wise features from the visual feature map and utilize a graph convolution network and a transformer to perform intra- and inter-hand information interaction in the 3D joint feature space, respectively. Then, we project the joint features with global information back into the 2D visual feature space in an obfuscation-free manner and utilize the 2D convolution for pixel-wise enhancement. By performing multiple alternate enhancements in the two feature spaces, our method can achieve an accurate and robust reconstruction of interacting hands. Our method outperforms all existing two-hand reconstruction methods by a large margin on the InterHand2.6M dataset.

Comments:	Accepted to ICCV 2023 (Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2302.02410 [cs.CV]
	(or arXiv:2302.02410v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2302.02410

Submission history

From: Pengfei Ren [view email]
[v1] Sun, 5 Feb 2023 15:46:57 UTC (2,799 KB)
[v2] Mon, 21 Aug 2023 03:46:50 UTC (2,320 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Decoupled Iterative Refinement Framework for Interacting Hands Reconstruction from a Single RGB Image

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Decoupled Iterative Refinement Framework for Interacting Hands Reconstruction from a Single RGB Image

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators