SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning

About

Modern deep reinforcement learning (DRL) methods have made significant advances in handling continuous action spaces. However, real-world control systems, especially those requiring precise and reliable performance, often demand interpretability in the sense of a-priori assessments of agent behavior to identify safe or failure-prone interactions with environments. To address this limitation, this work proposes SALSA-RL (Stability Analysis in the Latent Space of Actions), a novel RL framework that models control actions as dynamic, time-dependent variables evolving within a latent space. By employing a pre-trained encoder-decoder and a state-dependent linear system, this approach enables interpretability through local stability analysis, where instantaneous growth in action-norms can be predicted before their execution. It is demonstrated that SALSA-RL can be deployed in a non-invasive manner for assessing the local stability of actions from pretrained RL agents without compromising on performance across diverse benchmark environments. By enabling a more interpretable analysis of action generation, SALSA-RL provides a powerful tool for advancing the design, analysis, and theoretical understanding of RL systems.

Xuyang Li, Romit Maulik• 2025

Related benchmarks

Task	Dataset	Result
Reinforcement Learning	LunarLanderContinuous v2	Mean Reward268.3	65
Reinforcement Learning	cartpole	Average Reward1.00e+3	29
Reinforcement Learning	Pendulum	Avg Episode Reward-149.2	26
Reinforcement Learning	BipedalWalker	Average Episode Reward280.9	26

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord