Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

QKFormer: Hierarchical Spiking Transformer using Q-K Attention

About

Spiking Transformers, which integrate Spiking Neural Networks (SNNs) with Transformer architectures, have attracted significant attention due to their potential for energy efficiency and high performance. However, existing models in this domain still suffer from suboptimal performance. We introduce several innovations to improve the performance: i) We propose a novel spike-form Q-K attention mechanism, tailored for SNNs, which efficiently models the importance of token or channel dimensions through binary vectors with linear complexity. ii) We incorporate the hierarchical structure, which significantly benefits the performance of both the brain and artificial neural networks, into spiking transformers to obtain multi-scale spiking representation. iii) We design a versatile and powerful patch embedding module with a deformed shortcut specifically for spiking transformers. Together, we develop QKFormer, a hierarchical spiking transformer based on Q-K attention with direct training. QKFormer shows significantly superior performance over existing state-of-the-art SNN models on various mainstream datasets. Notably, with comparable size to Spikformer (66.34 M, 74.81%), QKFormer (64.96 M) achieves a groundbreaking top-1 accuracy of 85.65% on ImageNet-1k, substantially outperforming Spikformer by 10.84%. To our best knowledge, this is the first time that directly training SNNs have exceeded 85% accuracy on ImageNet-1K. The code and models are publicly available at https://github.com/zhouchenlin2096/QKFormer

Chenlin Zhou, Han Zhang, Zhaokun Zhou, Liutao Yu, Liwei Huang, Xiaopeng Fan, Li Yuan, Zhengyu Ma, Huihui Zhou, Yonghong Tian• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1K
Top-1 Acc82.04
1239
Image ClassificationCIFAR-10 (test)
Accuracy96.31
882
Image ClassificationCIFAR-10
Accuracy96.18
875
Image ClassificationCIFAR-100 (test)
Top-1 Accuracy80.98
395
Image ClassificationCIFAR-100
Accuracy80.27
357
Image ClassificationCIFAR100
Accuracy81.15
347
Image ClassificationCIFAR100
Accuracy81.15
301
Image ClassificationCIFAR10
Accuracy (%)96.18
282
Image ClassificationSVHN
Top-1 Accuracy97.13
186
ClassificationCIFAR10-DVS
Accuracy80
164
Showing 10 of 55 rows

Other info

Follow for update