Ensemble and Mixture-of-Experts DeepONets For Operator Learning

Sharma, Ramansh; Shankar, Varun

Computer Science > Machine Learning

arXiv:2405.11907v3 (cs)

[Submitted on 20 May 2024 (v1), revised 30 Sep 2024 (this version, v3), latest version 2 Oct 2024 (v4)]

Title:Ensemble and Mixture-of-Experts DeepONets For Operator Learning

Authors:Ramansh Sharma, Varun Shankar

View PDF HTML (experimental)

Abstract:We present a novel deep operator network (DeepONet) architecture for operator learning, the ensemble DeepONet, that allows for enriching the trunk network of a single DeepONet with multiple distinct trunk networks. This trunk enrichment allows for greater expressivity and generalization capabilities over a range of operator learning problems. We also present a spatial mixture-of-experts (MoE) DeepONet trunk network architecture that utilizes a partition-of-unity (PoU) approximation to promote spatial locality and model sparsity in the operator learning problem. We first prove that both the ensemble and PoU-MoE DeepONets are universal approximators. We then demonstrate that ensemble DeepONets containing a trunk ensemble of a standard trunk, the PoU-MoE trunk, and/or a proper orthogonal decomposition (POD) trunk can achieve 2-4x lower relative $\ell_2$ errors than standard DeepONets and POD-DeepONets on both standard and challenging new operator learning problems involving partial differential equations (PDEs) in two and three dimensions. Our new PoU-MoE formulation provides a natural way to incorporate spatial locality and model sparsity into any neural network architecture, while our new ensemble DeepONet provides a powerful and general framework for incorporating basis enrichment in scientific machine learning architectures for operator learning.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2405.11907 [cs.LG]
	(or arXiv:2405.11907v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.11907

Submission history

From: Ramansh Sharma [view email]
[v1] Mon, 20 May 2024 09:42:44 UTC (8,411 KB)
[v2] Tue, 21 May 2024 08:27:26 UTC (8,374 KB)
[v3] Mon, 30 Sep 2024 05:46:55 UTC (16,348 KB)
[v4] Wed, 2 Oct 2024 02:44:55 UTC (16,349 KB)

Computer Science > Machine Learning

Title:Ensemble and Mixture-of-Experts DeepONets For Operator Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Ensemble and Mixture-of-Experts DeepONets For Operator Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators