Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks

About

Spiking Neural Networks (SNNs) are biologically-inspired deep neural networks that efficiently extract temporal information while offering promising gains in terms of energy efficiency and latency when deployed on neuromorphic devices. SNN parameters are sensitive to temporal resolution, leading to significant performance drops when the temporal resolution of target data during deployment is not the same as that of the source data used for training, especially when fine-tuning with the target data is not possible during deployment. To address this challenge, we propose three novel domain adaptation methods for adapting neuron parameters to account for the change in time resolution without re-training on target time resolution. The proposed methods are based on a mapping between neuron dynamics in SNNs and State Space Models (SSMs) and are applicable to general neuron models. We evaluate the proposed methods under spatio-temporal data tasks, namely the audio keyword spotting datasets SHD and MSWC, and the neuromorphic image NMINST dataset. Our methods provide an alternative to-and in most cases significantly outperform-the existing reference method that consists of scaling only the time constant. Notably, when the temporal resolution of the target data is double that of the source data, applying one of our proposed methods instead of the benchmark achieves classification accuracy of 89.5% instead of 53.0% on SHD, 93.6% instead of 38.8% on MSWC and 98.5% instead of 97.2% aon NMNIST. Moreover, our results show that high accuracy on high temporal resolution data can be obtained by time-efficient training on lower temporal resolution data.

Sanja Karilanova, Maxime Fabre, Emre Neftci, Ay\c{c}a \"Oz\c{c}elikkale• 2024

Related benchmarks

Task	Dataset	Result
Classification	SHD (test)	Accuracy89.5	81
Spiking Image Classification	NMNIST bT=1 (test)	Accuracy98.5	40
Classification	NMNIST (bS=1, bT=2) 1.0 (test)	Accuracy98.3	10
Spoken Word Classification	MSWC bS=2 to bT=1	Accuracy0.949	10
Spoken Word Classification	MSWC bS=3 to bT=1	Accuracy0.94	10
Spoken Word Classification	MSWC bS=4 to bT=1	Accuracy0.938	10
Spoken Word Classification	MSWC bS=10 to bT=1	Accuracy86.8	10
Spoken Word Classification	MSWC (bS=1, bT=2) (test)	Accuracy59.4	10
Spoken Word Classification	MSWC bS=1, bT=3 (test)	Accuracy25.9	10
Spoken Word Classification	MSWC (bS=1, bT=4) (test)	Accuracy0.131	10

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord