EPIC Fields: Marrying 3D Geometry and Video Understanding

Tschernezki, Vadim; Darkhalil, Ahmad; Zhu, Zhifan; Fouhey, David; Laina, Iro; Larlus, Diane; Damen, Dima; Vedaldi, Andrea

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.08731 (cs)

[Submitted on 14 Jun 2023 (v1), last revised 1 Feb 2024 (this version, v2)]

Title:EPIC Fields: Marrying 3D Geometry and Video Understanding

Authors:Vadim Tschernezki, Ahmad Darkhalil, Zhifan Zhu, David Fouhey, Iro Laina, Diane Larlus, Dima Damen, Andrea Vedaldi

View PDF

Abstract:Neural rendering is fuelling a unification of learning, 3D geometry and video understanding that has been waiting for more than two decades. Progress, however, is still hampered by a lack of suitable datasets and benchmarks. To address this gap, we introduce EPIC Fields, an augmentation of EPIC-KITCHENS with 3D camera information. Like other datasets for neural rendering, EPIC Fields removes the complex and expensive step of reconstructing cameras using photogrammetry, and allows researchers to focus on modelling problems. We illustrate the challenge of photogrammetry in egocentric videos of dynamic actions and propose innovations to address them. Compared to other neural rendering datasets, EPIC Fields is better tailored to video understanding because it is paired with labelled action segments and the recent VISOR segment annotations. To further motivate the community, we also evaluate two benchmark tasks in neural rendering and segmenting dynamic objects, with strong baselines that showcase what is not possible today. We also highlight the advantage of geometry in semi-supervised video object segmentations on the VISOR annotations. EPIC Fields reconstructs 96% of videos in EPICKITCHENS, registering 19M frames in 99 hours recorded in 45 kitchens.

Comments:	Published at NeurIPS 2023. 24 pages, 15 figures. Project Webpage: this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.08731 [cs.CV]
	(or arXiv:2306.08731v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.08731

Submission history

From: Dima Damen [view email]
[v1] Wed, 14 Jun 2023 20:33:49 UTC (34,663 KB)
[v2] Thu, 1 Feb 2024 09:59:34 UTC (33,391 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:EPIC Fields: Marrying 3D Geometry and Video Understanding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:EPIC Fields: Marrying 3D Geometry and Video Understanding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators