Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks
About
Spiking Neural Networks (SNNs) are biologically-inspired deep neural networks that efficiently extract temporal information while offering promising gains in terms of energy efficiency and latency when deployed on neuromorphic devices. SNN parameters are sensitive to temporal resolution, leading to significant performance drops when the temporal resolution of target data during deployment is not the same as that of the source data used for training, especially when fine-tuning with the target data is not possible during deployment. To address this challenge, we propose three novel domain adaptation methods for adapting neuron parameters to account for the change in time resolution without re-training on target time resolution. The proposed methods are based on a mapping between neuron dynamics in SNNs and State Space Models (SSMs) and are applicable to general neuron models. We evaluate the proposed methods under spatio-temporal data tasks, namely the audio keyword spotting datasets SHD and MSWC, and the neuromorphic image NMINST dataset. Our methods provide an alternative to-and in most cases significantly outperform-the existing reference method that consists of scaling only the time constant. Notably, when the temporal resolution of the target data is double that of the source data, applying one of our proposed methods instead of the benchmark achieves classification accuracy of 89.5% instead of 53.0% on SHD, 93.6% instead of 38.8% on MSWC and 98.5% instead of 97.2% aon NMNIST. Moreover, our results show that high accuracy on high temporal resolution data can be obtained by time-efficient training on lower temporal resolution data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Classification | SHD (test) | Accuracy89.5 | 48 | |
| Spiking Image Classification | NMNIST bT=1 (test) | Accuracy98.5 | 40 | |
| Classification | NMNIST (bS=1, bT=2) 1.0 (test) | Accuracy98.3 | 10 | |
| Spoken Word Classification | MSWC bS=2 to bT=1 | Accuracy0.949 | 10 | |
| Spoken Word Classification | MSWC bS=3 to bT=1 | Accuracy0.94 | 10 | |
| Spoken Word Classification | MSWC bS=4 to bT=1 | Accuracy0.938 | 10 | |
| Spoken Word Classification | MSWC bS=10 to bT=1 | Accuracy86.8 | 10 | |
| Spoken Word Classification | MSWC (bS=1, bT=2) (test) | Accuracy59.4 | 10 | |
| Spoken Word Classification | MSWC bS=1, bT=3 (test) | Accuracy25.9 | 10 | |
| Spoken Word Classification | MSWC (bS=1, bT=4) (test) | Accuracy0.131 | 10 |