Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rotary Masked Autoencoders are Versatile Learners

About

Applying Transformers to irregular time-series typically requires specializations to their baseline architecture, which can result in additional computational overhead and increased method complexity. We present the Rotary Masked Autoencoder (RoMAE), which utilizes the popular Rotary Positional Embedding (RoPE) method for continuous positions. RoMAE is an extension to the Masked Autoencoder (MAE) that enables interpolation and representation learning with multidimensional continuous positional information while avoiding any time-series-specific architectural specializations. We showcase RoMAE's performance on a variety of modalities including irregular and multivariate time-series, images, and audio, demonstrating that RoMAE surpasses specialized time-series architectures on difficult datasets such as the DESC ELAsTiCC Challenge while maintaining MAE's usual performance across other modalities. In addition, we investigate RoMAE's ability to reconstruct the embedded continuous positions, demonstrating that including learned embeddings in the input sequence breaks RoPE's relative position property.

Uros Zivanovic, Serafina Di Gioia, Andre Scaffidi, Mart\'in de los Rios, Gabriella Contardo, Roberto Trotta• 2025

Related benchmarks

TaskDatasetResultRank
Audio ClassificationESC-50
Accuracy84.7
441
Multivariate Time Series ClassificationUEA Multivariate Time Series Classification Archive--
26
RegressionPendulum
MSE3.32
11
Interpolation2-dimensional spirals
RMSE0.0183
7
Light curve classificationELAsTiCC
F1 Score80.29
6
Multivariate Time Series ClassificationUEA Multivariate Time-series Archive CT
Accuracy98.82
5
Multivariate Time Series ClassificationUEA Multivariate Time-series Archive EP
Accuracy95.17
5
Multivariate Time Series ClassificationUEA Multivariate Time-series Archive LSST
Accuracy62.25
5
Multivariate Time Series ClassificationUEA Multivariate Time-series Archive HB
Accuracy74.47
5
InterpolationPhysioNet 50% masking
MSE0.467
2
Showing 10 of 11 rows

Other info

Follow for update