Learning in Domain Randomization via Continuous Time Non-Stochastic Control

Li, Jingwei; Dong, Jing; Chang, Can; Wang, Baoxiang; Zhang, Jingzhao

Mathematics > Optimization and Control

arXiv:2306.01952 (math)

[Submitted on 2 Jun 2023 (v1), last revised 14 Dec 2023 (this version, v2)]

Title:Learning in Domain Randomization via Continuous Time Non-Stochastic Control

Authors:Jingwei Li, Jing Dong, Can Chang, Baoxiang Wang, Jingzhao Zhang

View PDF HTML (experimental)

Abstract:Domain randomization is a popular method for robustly training agents to adapt to diverse environments and real-world tasks. In this paper, we examine how to train an agent in domain randomization environments from a nonstochastic control perspective. We first theoretically study online control of continuous-time linear systems under nonstochastic noises. We present a novel two-level online algorithm, by integrating a higher-level learning strategy and a lower-level feedback control strategy. This method offers a practical solution, and for the first time achieves sublinear regret in continuous-time nonstochastic systems. Compared to standard online learning algorithms, our algorithm features a stack and skip procedure. By applying stack and skip to the SAC (Soft Actor-Critic) algorithm, we achieved improved results in multiple reinforcement learning tasks within domain randomization environments. Our work provides new insights into nonasymptotic analyses of controlling continuous-time systems. Further, our work justifies the importance of stacked and skipped in controller learning under nonstochastic environments.

Subjects:	Optimization and Control (math.OC); Systems and Control (eess.SY)
Cite as:	arXiv:2306.01952 [math.OC]
	(or arXiv:2306.01952v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2306.01952

Submission history

From: Jingwei Li [view email]
[v1] Fri, 2 Jun 2023 23:26:41 UTC (141 KB)
[v2] Thu, 14 Dec 2023 07:24:21 UTC (1,956 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Mathematics > Optimization and Control

Title:Learning in Domain Randomization via Continuous Time Non-Stochastic Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Mathematics > Optimization and Control

Title:Learning in Domain Randomization via Continuous Time Non-Stochastic Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators