Nonlinear Sufficient Dimension Reduction with a Stochastic Neural Network
About
Sufficient dimension reduction is a powerful tool to extract core information hidden in the high-dimensional data and has potentially many important applications in machine learning tasks. However, the existing nonlinear sufficient dimension reduction methods often lack the scalability necessary for dealing with large-scale data. We propose a new type of stochastic neural network under a rigorous probabilistic framework and show that it can be used for sufficient dimension reduction for large-scale data. The proposed stochastic neural network is trained using an adaptive stochastic gradient Markov chain Monte Carlo algorithm, whose convergence is rigorously studied in the paper as well. Through extensive experiments on real-world classification and regression problems, we show that the proposed method compares favorably with the existing state-of-the-art sufficient dimension reduction methods and is computationally more efficient for large-scale data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-Label Classification | sub-MNIST (test) | Misclassification Rate4.56 | 25 | |
| Nonlinear Sufficient Dimension Reduction | Relative location of CT slices on axial axis (test) | MSE2.00e-4 | 21 | |
| Binary Classification | flaresolar (test) | Error Rate32.36 | 16 | |
| Binary Classification | Heart (test) | Error Rate16.25 | 16 | |
| Classification | Breast Cancer (test) | Misclassification Rate25.78 | 16 | |
| Binary Classification | UCI Waveform (test) | Misclassification Rate10.79 | 16 | |
| Binary Classification | thyroid (test) | Misclassification Rate6.87 | 16 | |
| Classification | German (test) | Error Rate23.56 | 16 |