SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction

Swamy, Anilkumar; Leroy, Vincent; Weinzaepfel, Philippe; Baradel, Fabien; Galaaoui, Salma; Bregier, Romain; Armando, Matthieu; Franco, Jean-Sebastien; Rogez, Gregory

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.10748 (cs)

[Submitted on 19 Sep 2023]

Title:SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction

Authors:Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Fabien Baradel, Salma Galaaoui, Romain Bregier, Matthieu Armando, Jean-Sebastien Franco, Gregory Rogez

View PDF

Abstract:Recent hand-object interaction datasets show limited real object variability and rely on fitting the MANO parametric model to obtain groundtruth hand shapes. To go beyond these limitations and spur further research, we introduce the SHOWMe dataset which consists of 96 videos, annotated with real and detailed hand-object 3D textured meshes. Following recent work, we consider a rigid hand-object scenario, in which the pose of the hand with respect to the object remains constant during the whole video sequence. This assumption allows us to register sub-millimetre-precise groundtruth 3D scans to the image sequences in SHOWMe. Although simpler, this hypothesis makes sense in terms of applications where the required accuracy and level of detail is important eg., object hand-over in human-robot collaboration, object scanning, or manipulation and contact point analysis. Importantly, the rigidity of the hand-object systems allows to tackle video-based 3D reconstruction of unknown hand-held objects using a 2-stage pipeline consisting of a rigid registration step followed by a multi-view reconstruction (MVR) part. We carefully evaluate a set of non-trivial baselines for these two stages and show that it is possible to achieve promising object-agnostic 3D hand-object reconstructions employing an SfM toolbox or a hand pose estimator to recover the rigid transforms and off-the-shelf MVR algorithms. However, these methods remain sensitive to the initial camera pose estimates which might be imprecise due to lack of textures on the objects or heavy occlusions of the hands, leaving room for improvements in the reconstruction. Code and dataset are available at this https URL

Comments:	Paper and Appendix, Accepted in ACVR workshop at ICCV conference
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2309.10748 [cs.CV]
	(or arXiv:2309.10748v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.10748

Submission history

From: Anilkumar Swamy [view email]
[v1] Tue, 19 Sep 2023 16:48:29 UTC (34,757 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators