UniT: Data Efficient Tactile Representation with Generalization to Unseen Objects

Xu, Zhengtong; Uppuluri, Raghava; Zhang, Xinwei; Fitch, Cael; Crandall, Philip Glen; Shou, Wan; Wang, Dongyi; She, Yu

Computer Science > Robotics

arXiv:2408.06481 (cs)

[Submitted on 12 Aug 2024 (v1), last revised 1 Apr 2025 (this version, v2)]

Title:UniT: Data Efficient Tactile Representation with Generalization to Unseen Objects

Authors:Zhengtong Xu, Raghava Uppuluri, Xinwei Zhang, Cael Fitch, Philip Glen Crandall, Wan Shou, Dongyi Wang, Yu She

View PDF HTML (experimental)

Abstract:UniT is an approach to tactile representation learning, using VQGAN to learn a compact latent space and serve as the tactile representation. It uses tactile images obtained from a single simple object to train the representation with generalizability. This tactile representation can be zero-shot transferred to various downstream tasks, including perception tasks and manipulation policy learning. Our benchmarkings on in-hand 3D pose and 6D pose estimation tasks and a tactile classification task show that UniT outperforms existing visual and tactile representation learning methods. Additionally, UniT's effectiveness in policy learning is demonstrated across three real-world tasks involving diverse manipulated objects and complex robot-object-environment interactions. Through extensive experimentation, UniT is shown to be a simple-to-train, plug-and-play, yet widely effective method for tactile representation learning. For more details, please refer to our open-source repository this https URL and the project website this https URL.

Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2408.06481 [cs.RO]
	(or arXiv:2408.06481v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2408.06481

Submission history

From: Zhengtong Xu [view email]
[v1] Mon, 12 Aug 2024 20:29:09 UTC (46,668 KB)
[v2] Tue, 1 Apr 2025 18:26:36 UTC (39,926 KB)

Computer Science > Robotics

Title:UniT: Data Efficient Tactile Representation with Generalization to Unseen Objects

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:UniT: Data Efficient Tactile Representation with Generalization to Unseen Objects

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators