Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Sparsity in Continuous-Depth Neural Networks

About

Neural Ordinary Differential Equations (NODEs) have proven successful in learning dynamical systems in terms of accurately recovering the observed trajectories. While different types of sparsity have been proposed to improve robustness, the generalization properties of NODEs for dynamical systems beyond the observed data are underexplored. We systematically study the influence of weight and feature sparsity on forecasting as well as on identifying the underlying dynamical laws. Besides assessing existing methods, we propose a regularization technique to sparsify "input-output connections" and extract relevant features during training. Moreover, we curate real-world datasets consisting of human motion capture and human hematopoiesis single-cell RNA-seq data to realistically analyze different levels of out-of-distribution (OOD) generalization in forecasting and dynamics identification respectively. Our extensive empirical evaluation on these challenging benchmarks suggests that weight sparsity improves generalization in the presence of noise or irregular sampling. However, it does not prevent learning spurious feature dependencies in the inferred dynamics, rendering them impractical for predictions under interventions, or for inferring the true underlying dynamics. Instead, feature sparsity can indeed help with recovering sparse ground-truth dynamics compared to unregularized NODEs.

Hananeh Aliee, Till Richter, Mikhail Solonin, Ignacio Ibarra, Fabian Theis, Niki Kilbertus• 2022

Related benchmarks

TaskDatasetResultRank
Gene expression dynamics predictionHematopoesis Erythroid lineage (test)
Sparsity0.1394
12
Gene regulatory network inferenceSIM350 5% noise (test)
Sparsity56.5
12
Gene regulatory network inferenceBreast cancer in pseudotime
Sparsity14.09
12
Gene regulatory network inferenceYeast cell cycle
Sparsity12.09
12
Gene regulatory dynamics predictionSIM350 5% noise (test)
MSE8
12
System IdentificationSynthetic second-order ODE (train)
MSE6.30e-5
6
System IdentificationSynthetic second-order ODE Extrapolation
MSE3.90e-4
6
Human motion forecastingHuman Motion Capture Walk (train)
MSE2.90e-4
5
Human motion forecastingHuman Motion Capture Golf (train)
MSE3.30e-4
5
Human motion forecastingHuman Motion Capture Walk (test)
MSE0.002
5
Showing 10 of 12 rows

Other info

Code

Follow for update