Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Kim, Hyeonah; Kim, Minsu; Ahn, Sungsoo; Park, Jinkyoo

Computer Science > Machine Learning

arXiv:2306.01276 (cs)

[Submitted on 2 Jun 2023 (v1), last revised 17 Jul 2024 (this version, v4)]

Title:Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Authors:Hyeonah Kim, Minsu Kim, Sungsoo Ahn, Jinkyoo Park

View PDF HTML (experimental)

Abstract:Deep reinforcement learning (DRL) has significantly advanced the field of combinatorial optimization (CO). However, its practicality is hindered by the necessity for a large number of reward evaluations, especially in scenarios involving computationally intensive function assessments. To enhance the sample efficiency, we propose a simple but effective method, called symmetric replay training (SRT), which can be easily integrated into various DRL methods. Our method leverages high-reward samples to encourage exploration of the under-explored symmetric regions without additional online interactions - free. Through replay training, the policy is trained to maximize the likelihood of the symmetric trajectories of discovered high-rewarded samples. Experimental results demonstrate the consistent improvement of our method in sample efficiency across diverse DRL methods applied to real-world tasks, such as molecular optimization and hardware design.

Comments:	International Conference on Machine Learning
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2306.01276 [cs.LG]
	(or arXiv:2306.01276v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.01276

Submission history

From: Hyeonah Kim [view email]
[v1] Fri, 2 Jun 2023 05:34:01 UTC (1,011 KB)
[v2] Wed, 11 Oct 2023 08:57:34 UTC (2,062 KB)
[v3] Mon, 5 Feb 2024 04:12:30 UTC (3,444 KB)
[v4] Wed, 17 Jul 2024 05:55:45 UTC (3,443 KB)

Computer Science > Machine Learning

Title:Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators