Learning Deep Hybrid Models with Sharpness-Aware Minimization
About
Hybrid modeling, the combination of machine learning models and scientific mathematical models, enables flexible and robust data-driven prediction with partial interpretability. However, effectively the scientific models may be ignored in prediction due to the flexibility of the machine learning model, making the idea of hybrid modeling pointless. Typically some regularization is applied to hybrid model learning to avoid such a failure case, but the formulation of the regularizer strongly depends on model architectures and domain knowledge. In this paper, we propose to focus on the flatness of loss minima in learning hybrid models, aiming to make the model as simple as possible. We employ the idea of sharpness-aware minimization and adapt it to the hybrid modeling setting. Numerical experiments show that the SAM-based method works well across different choices of models and datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Hybrid Modeling (Parameter Estimation and Prediction) | DUFFING OSCILLATOR (test) | Theta Error (x10^-2)0.0132 | 6 | |
| Hybrid Modeling (Parameter Estimation and Prediction) | PENDULUM IMAGES (test) | Theta Error (x10^-2)5.04 | 6 | |
| Hybrid Modeling (Parameter Estimation and Prediction) | WIND TUNNEL (test) | Theta Error (x1e-2)0.019 | 6 | |
| Hybrid Modeling (Parameter Estimation and Prediction) | LIGHT TUNNEL (test) | Theta Cosine Similarity0.98 | 6 | |
| Hybrid Modeling (Parameter Estimation and Prediction) | PENDULUM TIME-SERIES (test) | Theta Error0.0034 | 6 | |
| Hybrid Modeling (Parameter Estimation and Prediction) | Reaction-Diffusion (test) | Theta Error2.79e-4 | 6 |