BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics
About
We introduce BCAT, a PDE foundation model designed for autoregressive prediction of solutions to two dimensional fluid dynamics problems. Our approach uses a block causal transformer architecture to model next frame predictions, leveraging previous frames as contextual priors rather than relying solely on sub-frames or pixel-based inputs commonly used in image generation methods. This block causal framework more effectively captures the spatial dependencies inherent in nonlinear spatiotemporal dynamics and physical phenomena. In an ablation study, next frame prediction demonstrated a 3.5x accuracy improvement over next token prediction. BCAT is trained on a diverse range of fluid dynamics datasets, including incompressible and compressible Navier-Stokes equations across various geometries and parameter regimes, as well as the shallow-water equations. The model's performance was evaluated on 6 distinct downstream prediction tasks and tested on about 8K trajectories to measure robustness on a variety of fluid dynamics simulations. BCAT achieved an average relative error of 1.18% across all evaluation tasks, outperforming prior approaches on standard benchmarks. With fine-tuning on a turbulence dataset, we show that the method adapts to new settings with more than 40% better accuracy over prior methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Inverse (temporal conditioning) | Rayleigh-Benard (test) | Relative MSE0.191 | 14 | |
| Forward (initial value problem) | Rayleigh-Benard (test) | Relative MSE0.106 | 14 | |
| Forward (initial value problem) | Active Matter (test) | Relative MSE0.218 | 13 | |
| Inverse (temporal conditioning) | Active Matter (test) | Relative MSE0.195 | 13 | |
| Forward forecasting | Active Matter forward task (test) | Relative MSE0.218 | 13 | |
| Inverse forecasting | Active Matter inverse task (test) | Relative MSE0.195 | 13 | |
| Forward (initial value problem) | Gray-Scott (test) | Relative MSE0.0882 | 12 | |
| Inverse (temporal conditioning) | Gray-Scott (test) | Relative MSE0.219 | 12 | |
| Forward time-stepping | MHD 3D spatio-temporal | Rel. MSE (1st step)0.329 | 4 | |
| Forward time-stepping | TGC 3D spatio-temporal | Relative MSE (1st Step)0.124 | 4 |