Computer Science > Computer Vision and Pattern Recognition
[Submitted on 3 Mar 2022 (this version), latest version 3 Jan 2024 (v4)]
Title:HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
View PDFAbstract:We present HOI4D, a large-scale 4D egocentric dataset with rich annotations, to catalyze the research of category-level human-object interaction. HOI4D consists of 3M RGB-D egocentric video frames over 5000 sequences collected by 9 participants interacting with 1000 different object instances from 20 categories over 610 different indoor rooms. Frame-wise annotations for panoptic segmentation, motion segmentation, 3D hand pose, category-level object pose and hand action have also been provided, together with reconstructed object meshes and scene point clouds. With HOI4D, we establish three benchmarking tasks to promote category-level HOI from 4D visual signals including semantic segmentation of 4D dynamic point cloud sequences, category-level object pose tracking, and egocentric action segmentation with diverse interaction targets. In-depth analysis shows HOI4D poses great challenges to existing methods and produces great research opportunities. We will release the dataset soon.
Submission history
From: Yunze Liu [view email][v1] Thu, 3 Mar 2022 09:02:52 UTC (23,834 KB)
[v2] Tue, 29 Mar 2022 06:51:56 UTC (24,281 KB)
[v3] Fri, 8 Apr 2022 08:34:00 UTC (24,664 KB)
[v4] Wed, 3 Jan 2024 14:31:13 UTC (12,330 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.