Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GG-SSMs: Graph-Generating State Space Models

About

State Space Models (SSMs) are powerful tools for modeling sequential data in computer vision and time series analysis domains. However, traditional SSMs are limited by fixed, one-dimensional sequential processing, which restricts their ability to model non-local interactions in high-dimensional data. While methods like Mamba and VMamba introduce selective and flexible scanning strategies, they rely on predetermined paths, which fails to efficiently capture complex dependencies. We introduce Graph-Generating State Space Models (GG-SSMs), a novel framework that overcomes these limitations by dynamically constructing graphs based on feature relationships. Using Chazelle's Minimum Spanning Tree algorithm, GG-SSMs adapt to the inherent data structure, enabling robust feature propagation across dynamically generated graphs and efficiently modeling complex dependencies. We validate GG-SSMs on 11 diverse datasets, including event-based eye-tracking, ImageNet classification, optical flow estimation, and six time series datasets. GG-SSMs achieve state-of-the-art performance across all tasks, surpassing existing methods by significant margins. Specifically, GG-SSM attains a top-1 accuracy of 84.9% on ImageNet, outperforming prior SSMs by 1%, reducing the KITTI-15 error rate to 2.77%, and improving eye-tracking detection rates by up to 0.33% with fewer parameters. These results demonstrate that dynamic scanning based on feature relationships significantly improves SSMs' representational power and efficiency, offering a versatile tool for various applications in computer vision and beyond.

Nikola Zubi\'c, Davide Scaramuzza• 2024

Related benchmarks

TaskDatasetResultRank
Multivariate Time-series ForecastingETTm2
MSE0.1725
389
Multivariate ForecastingETTh2
MSE0.2823
350
Multivariate Time-series ForecastingWeather
MSE0.1473
340
Image ClassificationImageNet-1k (val)
Top-1 Acc84.9
303
Multivariate Time-series ForecastingTraffic
MSE0.3486
264
Multivariate Time-series ForecastingExchange
MAE0.2073
181
Optical Flow EstimationSintel Final (test)
EPE1.9
133
Optical Flow EstimationSintel clean (test)
EPE0.97
120
Optical Flow EstimationKITTI 2015
Fl-all2.77
60
Optical Flow EstimationMPI Sintel Final Pass
Overall AEE1.58
29
Showing 10 of 13 rows

Other info

Code

Follow for update