Stochastic Optimal Control Matching

Domingo-Enrich, Carles; Han, Jiequn; Amos, Brandon; Bruna, Joan; Chen, Ricky T. Q.

Mathematics > Optimization and Control

arXiv:2312.02027v1 (math)

[Submitted on 4 Dec 2023 (this version), latest version 11 Oct 2024 (v5)]

Title:Stochastic Optimal Control Matching

Authors:Carles Domingo-Enrich, Jiequn Han, Brandon Amos, Joan Bruna, Ricky T. Q. Chen

View PDF

Abstract:Stochastic optimal control, which has the goal of driving the behavior of noisy systems, is broadly applicable in science, engineering and artificial intelligence. Our work introduces Stochastic Optimal Control Matching (SOCM), a novel Iterative Diffusion Optimization (IDO) technique for stochastic optimal control that stems from the same philosophy as the conditional score matching loss for diffusion models. That is, the control is learned via a least squares problem by trying to fit a matching vector field. The training loss, which is closely connected to the cross-entropy loss, is optimized with respect to both the control function and a family of reparameterization matrices which appear in the matching vector field. The optimization with respect to the reparameterization matrices aims at minimizing the variance of the matching vector field. Experimentally, our algorithm achieves lower error than all the existing IDO techniques for stochastic optimal control for four different control settings. The key idea underlying SOCM is the path-wise reparameterization trick, a novel technique that is of independent interest, e.g., for generative modeling.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Numerical Analysis (math.NA); Probability (math.PR); Machine Learning (stat.ML)
Cite as:	arXiv:2312.02027 [math.OC]
	(or arXiv:2312.02027v1 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2312.02027

Submission history

From: Carles Domingo-Enrich [view email]
[v1] Mon, 4 Dec 2023 16:49:43 UTC (6,048 KB)
[v2] Thu, 14 Dec 2023 06:44:31 UTC (4,889 KB)
[v3] Wed, 17 Apr 2024 21:39:34 UTC (10,053 KB)
[v4] Fri, 28 Jun 2024 22:37:36 UTC (10,054 KB)
[v5] Fri, 11 Oct 2024 12:39:38 UTC (10,366 KB)

Mathematics > Optimization and Control

Title:Stochastic Optimal Control Matching

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Stochastic Optimal Control Matching

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators