Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GG-SSMs: Graph-Generating State Space Models

About

State Space Models (SSMs) are powerful tools for modeling sequential data in computer vision and time series analysis domains. However, traditional SSMs are limited by fixed, one-dimensional sequential processing, which restricts their ability to model non-local interactions in high-dimensional data. While methods like Mamba and VMamba introduce selective and flexible scanning strategies, they rely on predetermined paths, which fails to efficiently capture complex dependencies. We introduce Graph-Generating State Space Models (GG-SSMs), a novel framework that overcomes these limitations by dynamically constructing graphs based on feature relationships. Using Chazelle's Minimum Spanning Tree algorithm, GG-SSMs adapt to the inherent data structure, enabling robust feature propagation across dynamically generated graphs and efficiently modeling complex dependencies. We validate GG-SSMs on 11 diverse datasets, including event-based eye-tracking, ImageNet classification, optical flow estimation, and six time series datasets. GG-SSMs achieve state-of-the-art performance across all tasks, surpassing existing methods by significant margins. Specifically, GG-SSM attains a top-1 accuracy of 84.9% on ImageNet, outperforming prior SSMs by 1%, reducing the KITTI-15 error rate to 2.77%, and improving eye-tracking detection rates by up to 0.33% with fewer parameters. These results demonstrate that dynamic scanning based on feature relationships significantly improves SSMs' representational power and efficiency, offering a versatile tool for various applications in computer vision and beyond.

Nikola Zubi\'c, Davide Scaramuzza• 2024

Related benchmarks

TaskDatasetResultRank
Multivariate ForecastingETTh2
MSE0.2823
341
Multivariate Time-series ForecastingETTm2
MSE0.1725
334
Image ClassificationImageNet-1k (val)
Top-1 Acc84.9
287
Multivariate Time-series ForecastingWeather
MSE0.1473
276
Multivariate Time-series ForecastingTraffic
MSE0.3486
200
Multivariate Time-series ForecastingExchange
MAE0.2073
165
Optical Flow EstimationSintel clean (test)
EPE0.97
103
Optical Flow EstimationSintel Final (test)
EPE1.9
101
Optical Flow EstimationMPI Sintel Final Pass
Overall AEE1.58
29
Optical Flow EstimationKITTI 2015
Fl-all2.77
28
Showing 10 of 13 rows

Other info

Code

Follow for update