DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators

About

While it is widely known that neural networks are universal approximators of continuous functions, a less known and perhaps more powerful result is that a neural network with a single hidden layer can approximate accurately any nonlinear continuous operator. This universal approximation theorem is suggestive of the potential application of neural networks in learning nonlinear operators from data. However, the theorem guarantees only a small approximation error for a sufficient large network, and does not consider the important optimization and generalization errors. To realize this theorem in practice, we propose deep operator networks (DeepONets) to learn operators accurately and efficiently from a relatively small dataset. A DeepONet consists of two sub-networks, one for encoding the input function at a fixed number of sensors $x_i, i=1,\dots,m$ (branch net), and another for encoding the locations for the output functions (trunk net). We perform systematic simulations for identifying two types of operators, i.e., dynamic systems and partial differential equations, and demonstrate that DeepONet significantly reduces the generalization error compared to the fully-connected networks. We also derive theoretically the dependence of the approximation error in terms of the number of sensors (where the input function is defined) as well as the input function type, and we verify the theorem with computational results. More importantly, we observe high-order error convergence in our computational tests, namely polynomial rates (from half order to fourth order) and even exponential convergence with respect to the training dataset size.

Lu Lu, Pengzhan Jin, George Em Karniadakis• 2019

Related benchmarks

Task	Dataset	Result
Constitutive modeling	hyperelasticity (test)	Relative L2 Error11.998	33
Solving PDE	Burgers	Relative Error5.59	24
Fluid-solid interaction modeling	Fluid-solid interaction NS+EW Re=4000 (test)	L2 Loss0.259	24
Fluid dynamics modeling	Fluid dynamics (NS) Re=400 (test)	L2 Loss0.107	24
Fluid-solid interaction modeling	Fluid-solid interaction NS+EW Re=400 (test)	L2 Loss0.107	24
Solution Reconstruction	Inverse Burgers' Equation (test)	Relative MAE7.34	21
Learning PDE Solution Operators	2D Shallow Water	Mean L2 Relative Error1.11	20
Inverse Problem Subdomain Completion	Inverse Burgers' Equation Subdomain (test)	Relative MAE2.51	15
Forward PDE solving	Navier-Stokes	Relative L2 Error19.68	15
Forward PDE solving	Poisson	Relative L2 Error5.46	15

Showing 10 of 67 rows

Other info

Follow for update

@wizwand_team Discord