Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CoMamba: Real-time Cooperative Perception Unlocked with State Space Models

About

Cooperative perception systems play a vital role in enhancing the safety and efficiency of vehicular autonomy. Although recent studies have highlighted the efficacy of vehicle-to-everything (V2X) communication techniques in autonomous driving, a significant challenge persists: how to efficiently integrate multiple high-bandwidth features across an expanding network of connected agents such as vehicles and infrastructure. In this paper, we introduce CoMamba, a novel cooperative 3D detection framework designed to leverage state-space models for real-time onboard vehicle perception. Compared to prior state-of-the-art transformer-based models, CoMamba enjoys being a more scalable 3D model using bidirectional state space models, bypassing the quadratic complexity pain-point of attention mechanisms. Through extensive experimentation on V2X/V2V datasets, CoMamba achieves superior performance compared to existing methods while maintaining real-time processing capabilities. The proposed framework not only enhances object detection accuracy but also significantly reduces processing time, making it a promising solution for next-generation cooperative perception systems in intelligent transportation networks.

Jinlong Li, Xinyu Liu, Baolu Li, Runsheng Xu, Jiachen Li, Hongkai Yu, Zhengzhong Tu• 2024

Related benchmarks

TaskDatasetResultRank
3D Object DetectionOPV2V
AP@0.5074.3
146
3D Object DetectionDAIR-V2X
AP@0.5063.5
117
3D Object DetectionV2XSet
AP@0.5072.6
70
Showing 3 of 3 rows

Other info

Follow for update