GenHeld: Generating and Editing Handheld Objects

Min, Chaerin; Sridhar, Srinath

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.05059v2 (cs)

[Submitted on 7 Jun 2024 (v1), revised 10 Jun 2024 (this version, v2), latest version 15 Jun 2024 (v3)]

Title:GenHeld: Generating and Editing Handheld Objects

Authors:Chaerin Min, Srinath Sridhar

View PDF HTML (experimental)

Abstract:Grasping is an important human activity that has long been studied in robotics, computer vision, and cognitive science. Most existing works study grasping from the perspective of synthesizing hand poses conditioned on 3D or 2D object representations. We propose GenHeld to address the inverse problem of synthesizing held objects conditioned on 3D hand model or 2D image. Given a 3D model of hand, GenHeld 3D can select a plausible held object from a large dataset using compact object representations called object this http URL selected object is then positioned and oriented to form a plausible grasp without changing hand pose. If only a 2D hand image is available, GenHeld 2D can edit this image to add or replace a held object. GenHeld 2D operates by combining the abilities of GenHeld 3D with diffusion-based image editing. Results and experiments show that we outperform baselines and can generate plausible held objects in both 2D and 3D. Our experiments demonstrate that our method achieves high quality and plausibility of held object synthesis in both 3D and 2D.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.05059 [cs.CV]
	(or arXiv:2406.05059v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.05059

Submission history

From: Chaerin Min [view email]
[v1] Fri, 7 Jun 2024 16:31:41 UTC (16,966 KB)
[v2] Mon, 10 Jun 2024 17:23:32 UTC (16,979 KB)
[v3] Sat, 15 Jun 2024 03:10:59 UTC (16,976 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GenHeld: Generating and Editing Handheld Objects

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GenHeld: Generating and Editing Handheld Objects

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators