Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Memory-Augmented Potential Field Theory: A Framework for Adaptive Control in Non-Convex Domains

About

Stochastic optimal control methods often struggle in complex non-convex landscapes, frequently becoming trapped in local optima due to their inability to learn from historical trajectory data. This paper introduces Memory-Augmented Potential Field Theory, a unified mathematical framework that integrates historical experience into stochastic optimal control. Our approach dynamically constructs memory-based potential fields that identify and encode key topological features of the state space, enabling controllers to automatically learn from past experiences and adapt their optimization strategy. We provide a theoretical analysis showing that memory-augmented potential fields possess non-convex escape properties, asymptotic convergence characteristics, and computational efficiency. We implement this theoretical framework in a Memory-Augmented Model Predictive Path Integral (MPPI) controller that demonstrates significantly improved performance in challenging non-convex environments. The framework represents a generalizable approach to experience-based learning within control systems (especially robotic dynamics), enhancing their ability to navigate complex state spaces without requiring specialized domain knowledge or extensive offline training.

Dongzhe Zheng, Wenjie Mei• 2025

Related benchmarks

TaskDatasetResultRank
Continuous ControlBipedalWalker v3
Episodic Cumulative Reward298.4
15
Continuous ControlHalfCheetah v4
Max Average Return5.89e+3
12
Continuous ControlPendulum v1
Average Cumulative Reward-152.4
7
Continuous ControlHumanoid v4
Average Cumulative Reward4.98e+3
7
Robotic ControlPendulum v1
Local Optima Escape Rate89.2
7
Robotic ControlBipedalWalker v3
Local Optima Escape Rate83.5
7
Robotic ControlHalfCheetah v4
Local Optima Escape Rate76.8
7
Robotic ControlHumanoid v4
Local Optima Escape Rate72.3
7
Power System ControlIEEE 39-bus New England test system critical disturbances simulation
Constraint Violations (%)2.3
6
UAV Obstacle AvoidanceUAV Obstacle Avoidance environment 100 trials (test)
Success Rate94.3
6
Showing 10 of 10 rows

Other info

Follow for update