Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine Perception

Pan, Xiaqing; Charron, Nicholas; Yang, Yongqian; Peters, Scott; Whelan, Thomas; Kong, Chen; Parkhi, Omkar; Newcombe, Richard; Ren, Carl Yuheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.06362 (cs)

[Submitted on 10 Jun 2023 (v1), last revised 13 Jun 2023 (this version, v2)]

Title:Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine Perception

Authors:Xiaqing Pan, Nicholas Charron, Yongqian Yang, Scott Peters, Thomas Whelan, Chen Kong, Omkar Parkhi, Richard Newcombe, Carl Yuheng Ren

View PDF

Abstract:We introduce the Aria Digital Twin (ADT) - an egocentric dataset captured using Aria glasses with extensive object, environment, and human level ground truth. This ADT release contains 200 sequences of real-world activities conducted by Aria wearers in two real indoor scenes with 398 object instances (324 stationary and 74 dynamic). Each sequence consists of: a) raw data of two monochrome camera streams, one RGB camera stream, two IMU streams; b) complete sensor calibration; c) ground truth data including continuous 6-degree-of-freedom (6DoF) poses of the Aria devices, object 6DoF poses, 3D eye gaze vectors, 3D human poses, 2D image segmentations, image depth maps; and d) photo-realistic synthetic renderings. To the best of our knowledge, there is no existing egocentric dataset with a level of accuracy, photo-realism and comprehensiveness comparable to ADT. By contributing ADT to the research community, our mission is to set a new standard for evaluation in the egocentric machine perception domain, which includes very challenging research problems such as 3D object detection and tracking, scene reconstruction and understanding, sim-to-real learning, human pose prediction - while also inspiring new machine perception tasks for augmented reality (AR) applications. To kick start exploration of the ADT research use cases, we evaluated several existing state-of-the-art methods for object detection, segmentation and image translation tasks that demonstrate the usefulness of ADT as a benchmarking dataset.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2306.06362 [cs.CV]
	(or arXiv:2306.06362v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.06362

Submission history

From: Xiaqing Pan [view email]
[v1] Sat, 10 Jun 2023 06:46:32 UTC (40,261 KB)
[v2] Tue, 13 Jun 2023 06:38:47 UTC (40,261 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine Perception

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine Perception

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators