Improved Operator Learning by Orthogonal Attention

About

Neural operators, as an efficient surrogate model for learning the solutions of PDEs, have received extensive attention in the field of scientific machine learning. Among them, attention-based neural operators have become one of the mainstreams in related research. However, existing approaches overfit the limited training data due to the considerable number of parameters in the attention mechanism. To address this, we develop an orthogonal attention based on the eigendecomposition of the kernel integral operator and the neural approximation of eigenfunctions. The orthogonalization naturally poses a proper regularization effect on the resulting neural operator, which aids in resisting overfitting and boosting generalization. Experiments on six standard neural operator benchmark datasets comprising both regular and irregular geometries show that our method can outperform competing baselines with decent margins.

Zipeng Xiao, Zhongkai Hao, Bokai Lin, Zhijie Deng, Hang Su• 2023

Related benchmarks

Task	Dataset	Result
PDE solving	Darcy	Relative L2 Error0.0076	46
Forward PDE solving	Elasticity	Relative L2 Error0.0118	44
PDE solving	Navier-Stokes Regular Grid (test)	Relative L2 Error0.1195	41
PDE solving	Darcy Regular Grid (test)	Relative L2 Error0.0076	41
PDE solving	Airfoil Structured Mesh (test)	Relative L2 Error0.0061	38
PDE solving	Pipe Structured Mesh (test)	Relative L2 Error0.0052	38
Forward PDE solving	Airfoil	Relative L20.61	36
Forward PDE solving	Plasticity	Relative L2 Error0.0048	36
Forward PDE solving	Pipe	Relative L2 Error0.0052	35
PDE solving	Darcy-Flow 2d (test)	Relative MSE0.0094	33

Showing 10 of 30 rows

Other info

Follow for update

@wizwand_team Discord