Functional Attention: From Pairwise Affinities to Functional Correspondences
About
Learning mappings between infinite-dimensional function spaces, or operator learning, is essential for many machine learning applications. Although transformer-based operators are popular, they often rely on token-wise attention. These methods treat continuous fields as discrete tokens and usually ignore the global functional structure. We introduce \emph{Functional Attention}, which reinterprets attention as a functional correspondence between adaptive bases. Inspired by geometric functional maps, our method replaces softmax affinities with structured linear operators. This yields a compact, generalizable, resolution-invariant representation that explicitly captures global dependencies. Experiments demonstrate that \emph{Functional Attention} can match state-of-the-art performance in many operator learning tasks, including solving PDEs, 3D segmentation, and regression, while remaining robust to varying discretizations. Project page is available at https://github.com/xjffff/FUNCATTN.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| PDE solving | Darcy | Relative L2 Error0.0042 | 46 | |
| Forward PDE solving | Elasticity | Relative L2 Error0.005 | 44 | |
| Forward PDE solving | Airfoil | Relative L20.43 | 36 | |
| Forward PDE solving | Plasticity | Relative L2 Error0.11 | 36 | |
| Forward PDE solving | Pipe | Relative L2 Error0.0029 | 35 | |
| PDE solving | Navier-Stokes | Relative L2 Loss8 | 16 | |
| Aerodynamic property prediction | AirfRANS OOD Reynolds | Relative Error (CL)23.4 | 11 | |
| Aerodynamic property prediction | AirfRANS (OOD Angles) | Relative Error CL13.3 | 11 | |
| RNA point cloud segmentation | RNA point cloud (test) | Accuracy89 | 7 | |
| PDE solving | 2D Darcy flow triangular notch domain | Relative L2 Error0.64 | 6 |