Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics Scenario

Giammarino, Vittorio; Meyer, Andrew J; Biegun, Kai

Computer Science > Robotics

arXiv:2209.12350 (cs)

[Submitted on 25 Sep 2022 (v1), last revised 27 May 2023 (this version, v3)]

Title:Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics Scenario

Authors:Vittorio Giammarino, Andrew J Meyer, Kai Biegun

View PDF

Abstract:We focus on an unloading problem, typical of the logistics sector, modeled as a sequential pick-and-place task. In this type of task, modern machine learning techniques have shown to work better than classic systems since they are more adaptable to stochasticity and better able to cope with large uncertainties. More specifically, supervised and imitation learning have achieved outstanding results in this regard, with the shortcoming of requiring some form of supervision which is not always obtainable for all settings. On the other hand, reinforcement learning (RL) requires much milder form of supervision but still remains impracticable due to its inefficiency. In this paper, we propose and theoretically motivate a novel Unsupervised Reward Shaping algorithm from expert's observations which relaxes the level of supervision required by the agent and works on improving RL performance in our task.

Subjects:	Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2209.12350 [cs.RO]
	(or arXiv:2209.12350v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2209.12350

Submission history

From: Vittorio Giammarino [view email]
[v1] Sun, 25 Sep 2022 23:30:14 UTC (807 KB)
[v2] Mon, 7 Nov 2022 23:50:44 UTC (879 KB)
[v3] Sat, 27 May 2023 14:29:17 UTC (904 KB)

Computer Science > Robotics

Title:Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics Scenario

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics Scenario

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators