Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection for SLAM

About

Event-based keypoint detection and matching holds significant potential, enabling the integration of event sensors into highly optimized Visual SLAM systems developed for frame cameras over decades of research. Unfortunately, existing approaches struggle with the motion-dependent appearance of keypoints and the complex noise prevalent in event streams, resulting in severely limited feature matching capabilities and poor performance on downstream tasks. To mitigate this problem, we propose SuperEvent, a data-driven approach to predict stable keypoints with expressive descriptors. Due to the absence of event datasets with ground truth keypoint labels, we leverage existing frame-based keypoint detectors on readily available event-aligned and synchronized gray-scale frames for self-supervision: we generate temporally sparse keypoint pseudo-labels considering that events are a product of both scene appearance and camera motion. Combined with our novel, information-rich event representation, we enable SuperEvent to effectively learn robust keypoint detection and description in event streams. Finally, we demonstrate the usefulness of SuperEvent by its integration into a modern sparse keypoint and descriptor-based SLAM framework originally developed for traditional cameras, surpassing the state-of-the-art in event-based SLAM by a wide margin. Source code is available at https://ethz-mrl.github.io/SuperEvent/.

Yannick Burkhardt, Simon Schaefer, Stefan Leutenegger• 2025

Related benchmarks

TaskDatasetResultRank
Visual Place RecognitionFast-and-Slow 1,293 places (Q-high1)
Recall@195
11
Visual Place RecognitionFast-and-Slow Q-low1 1,310 places
Recall@193
11
Visual Place RecognitionFast-and-Slow (Q-med1)
R@194
11
Visual Place RecognitionBrisbane-Event-VPR Sunset2 as reference (Sunset1)
R@186
11
Visual Place RecognitionBrisbane-Event-VPR Sunset2 (Morning)
Recall@153
11
Visual Place RecognitionBrisbane-Event-VPR Sunset2 as reference (Daytime)
R@137
11
Visual Place RecognitionNSAVP (Query) vs R0-FA0 (Reference) (R0-FS0)
Recall@134
11
Visual Place RecognitionNSAVP (Query) vs R0-FA0 (Reference) (R0-FN0)
Recall@15
11
Simultaneous Localization and Mapping (SLAM)SLAM sequences
AUCv6.27
3
Showing 9 of 9 rows

Other info

Follow for update