Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GET: Group Event Transformer for Event-Based Vision

About

Event cameras are a type of novel neuromorphic sen-sor that has been gaining increasing attention. Existing event-based backbones mainly rely on image-based designs to extract spatial information within the image transformed from events, overlooking important event properties like time and polarity. To address this issue, we propose a novel Group-based vision Transformer backbone for Event-based vision, called Group Event Transformer (GET), which de-couples temporal-polarity information from spatial infor-mation throughout the feature extraction process. Specifi-cally, we first propose a new event representation for GET, named Group Token, which groups asynchronous events based on their timestamps and polarities. Then, GET ap-plies the Event Dual Self-Attention block, and Group Token Aggregation module to facilitate effective feature commu-nication and integration in both the spatial and temporal-polarity domains. After that, GET can be integrated with different downstream tasks by connecting it with vari-ous heads. We evaluate our method on four event-based classification datasets (Cifar10-DVS, N-MNIST, N-CARS, and DVS128Gesture) and two event-based object detection datasets (1Mpx and Gen1), and the results demonstrate that GET outperforms other state-of-the-art methods. The code is available at https://github.com/Peterande/GET-Group-Event-Transformer.

Yansong Peng, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun, Feng Wu• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR10-DVS (test)
Accuracy84.8
80
Image ClassificationN-MNIST (test)
Accuracy99.7
69
Object ClassificationN-CARS (test)
Accuracy96.7
53
Object DetectionGen1 (test)
mAP47.9
36
Gesture RecognitionDVS128-Gesture (test)
Accuracy97.9
30
Object Detection1 Mpx (test)
mAP48.4
26
Action RecognitionDailyDVS-200 (test)
Top-1 Acc37.28
17
Action RecognitionDailyDVS-200
Top-1 Acc37.28
16
Action RecognitionDVS128Gesture
Accuracy97.9
15
Action RecognitionHARDVS (test)
Top-1 Acc0.4646
15
Showing 10 of 15 rows

Other info

Code

Follow for update