RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark

Ho, Yuan-Hao; Cheng, Jen-Hao; Kuan, Sheng Yao; Jiang, Zhongyu; Chai, Wenhao; Huang, Hsiang-Wei; Lin, Chih-Lung; Hwang, Jenq-Neng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.13930 (cs)

[Submitted on 18 Jul 2024]

Title:RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark

Authors:Yuan-Hao Ho, Jen-Hao Cheng, Sheng Yao Kuan, Zhongyu Jiang, Wenhao Chai, Hsiang-Wei Huang, Chih-Lung Lin, Jenq-Neng Hwang

View PDF HTML (experimental)

Abstract:Traditional methods for human localization and pose estimation (HPE), which mainly rely on RGB images as an input modality, confront substantial limitations in real-world applications due to privacy concerns. In contrast, radar-based HPE methods emerge as a promising alternative, characterized by distinctive attributes such as through-wall recognition and privacy-preserving, rendering the method more conducive to practical deployments. This paper presents a Radar Tensor-based human pose (RT-Pose) dataset and an open-source benchmarking framework. The RT-Pose dataset comprises 4D radar tensors, LiDAR point clouds, and RGB images, and is collected for a total of 72k frames across 240 sequences with six different complexity-level actions. The 4D radar tensor provides raw spatio-temporal information, differentiating it from other radar point cloud-based datasets. We develop an annotation process using RGB images and LiDAR point clouds to accurately label 3D human skeletons. In addition, we propose HRRadarPose, the first single-stage architecture that extracts the high-resolution representation of 4D radar tensors in 3D space to aid human keypoint estimation. HRRadarPose outperforms previous radar-based HPE work on the RT-Pose benchmark. The overall HRRadarPose performance on the RT-Pose dataset, as reflected in a mean per joint position error (MPJPE) of 9.91cm, indicates the persistent challenges in achieving accurate HPE in complex real-world scenarios. RT-Pose is available at this https URL.

Comments:	ECCV 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
Cite as:	arXiv:2407.13930 [cs.CV]
	(or arXiv:2407.13930v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.13930

Submission history

From: Jen-Hao Cheng [view email]
[v1] Thu, 18 Jul 2024 22:46:35 UTC (2,401 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators