Sparse Flows: Pruning Continuous-depth Models
About
Continuous deep learning architectures enable learning of flexible probabilistic models for predictive modeling as neural ordinary differential equations (ODEs), and for generative modeling as continuous normalizing flows. In this work, we design a framework to decipher the internal dynamics of these continuous depth models by pruning their network architectures. Our empirical results suggest that pruning improves generalization for neural ODEs in generative modeling. We empirically show that the improvement is because pruning helps avoid mode-collapse and flatten the loss surface. Moreover, pruning finds efficient neural ODE representations with up to 98% less parameters compared to the original network, without loss of accuracy. We hope our results will invigorate further research into the performance-size trade-offs of modern continuous-depth models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Gene regulatory network inference | SIM350 5% noise (test) | Sparsity96 | 12 | |
| Gene regulatory network inference | Breast cancer in pseudotime | Sparsity95.7 | 12 | |
| Gene regulatory network inference | Yeast cell cycle | Sparsity95.22 | 12 | |
| Gene regulatory dynamics prediction | SIM350 5% noise (test) | MSE3.6 | 12 | |
| Gene expression dynamics prediction | Hematopoesis Erythroid lineage (test) | Sparsity0.9444 | 12 |