BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics

About

We introduce BCAT, a PDE foundation model designed for autoregressive prediction of solutions to two dimensional fluid dynamics problems. Our approach uses a block causal transformer architecture to model next frame predictions, leveraging previous frames as contextual priors rather than relying solely on sub-frames or pixel-based inputs commonly used in image generation methods. This block causal framework more effectively captures the spatial dependencies inherent in nonlinear spatiotemporal dynamics and physical phenomena. In an ablation study, next frame prediction demonstrated a 3.5x accuracy improvement over next token prediction. BCAT is trained on a diverse range of fluid dynamics datasets, including incompressible and compressible Navier-Stokes equations across various geometries and parameter regimes, as well as the shallow-water equations. The model's performance was evaluated on 6 distinct downstream prediction tasks and tested on about 8K trajectories to measure robustness on a variety of fluid dynamics simulations. BCAT achieved an average relative error of 1.18% across all evaluation tasks, outperforming prior approaches on standard benchmarks. With fine-tuning on a turbulence dataset, we show that the method adapts to new settings with more than 40% better accuracy over prior methods.

Yuxuan Liu, Jingmin Sun, Hayden Schaeffer• 2025

Related benchmarks

Task	Dataset	Result
Inverse (temporal conditioning)	Rayleigh-Benard (test)	Relative MSE0.191	14
Forward (initial value problem)	Rayleigh-Benard (test)	Relative MSE0.106	14
Forward (initial value problem)	Active Matter (test)	Relative MSE0.218	13
Inverse (temporal conditioning)	Active Matter (test)	Relative MSE0.195	13
Forward forecasting	Active Matter forward task (test)	Relative MSE0.218	13
Inverse forecasting	Active Matter inverse task (test)	Relative MSE0.195	13
Forward (initial value problem)	Gray-Scott (test)	Relative MSE0.0882	12
Inverse (temporal conditioning)	Gray-Scott (test)	Relative MSE0.219	12
Forward time-stepping	MHD 3D spatio-temporal	Rel. MSE (1st step)0.329	4
Forward time-stepping	TGC 3D spatio-temporal	Relative MSE (1st Step)0.124	4

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord