Invariant Scattering Convolution Networks
About
A wavelet scattering network computes a translation invariant image representation, which is stable to deformations and preserves high frequency information for classification. It cascades wavelet transform convolutions with non-linear modulus and averaging operators. The first network layer outputs SIFT-type descriptors whereas the next layers provide complementary invariant information which improves classification. The mathematical analysis of wavelet scattering networks explains important properties of deep convolution networks for classification. A scattering representation of stationary processes incorporates higher order moments and can thus discriminate textures having the same Fourier power spectrum. State of the art classification results are obtained for handwritten digits and texture discrimination, using a Gaussian kernel SVM and a generative PCA classifier.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | MNIST rotated (test) | Test Error (%)7.48 | 105 | |
| Image Classification | MNIST standard (test) | Error Rate0.43 | 40 | |
| Texture Classification | CUReT (test) | Error Rate0.2 | 11 | |
| Classification | MNIST bg-img-rot (test) | Error Rate (%)0.5048 | 11 | |
| Face Recognition | MultiPIE Illum. (test) | Recognition Rate20.88 | 11 | |
| Face Recognition | MultiPIE Exps. (test) | Recognition Rate66.67 | 11 | |
| Face Recognition | MultiPIE Pose (test) | Recognition Rate71.46 | 11 | |
| Face Recognition | MultiPIE Exps.+Pose (test) | Recognition Rate54.37 | 11 | |
| Face Recognition | MultiPIE Illum.+Exps. (test) | Recognition Rate14.51 | 11 | |
| Face Recognition | MultiPIE Illum.+Pose (test) | Recognition Rate15 | 11 |