Integrating Visual Foundation Models for Enhanced Robot Manipulation and Motion Planning: A Layered Approach

Yang, Chen; Zhou, Peng; Qi, Jiaming

Computer Science > Robotics

arXiv:2309.11244 (cs)

[Submitted on 20 Sep 2023]

Title:Integrating Visual Foundation Models for Enhanced Robot Manipulation and Motion Planning: A Layered Approach

Authors:Chen Yang, Peng Zhou, Jiaming Qi

View PDF

Abstract:This paper presents a novel layered framework that integrates visual foundation models to improve robot manipulation tasks and motion planning. The framework consists of five layers: Perception, Cognition, Planning, Execution, and Learning. Using visual foundation models, we enhance the robot's perception of its environment, enabling more efficient task understanding and accurate motion planning. This approach allows for real-time adjustments and continual learning, leading to significant improvements in task execution. Experimental results demonstrate the effectiveness of the proposed framework in various robot manipulation tasks and motion planning scenarios, highlighting its potential for practical deployment in dynamic environments.

Comments:	3 pages, 2 figures, IEEE Workshop
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2309.11244 [cs.RO]
	(or arXiv:2309.11244v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2309.11244

Submission history

From: Chen Yang [view email]
[v1] Wed, 20 Sep 2023 12:11:48 UTC (1,167 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2023-09

Change to browse by:

References & Citations

export BibTeX citation

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Robotics

Title:Integrating Visual Foundation Models for Enhanced Robot Manipulation and Motion Planning: A Layered Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Robotics

Title:Integrating Visual Foundation Models for Enhanced Robot Manipulation and Motion Planning: A Layered Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators