Resilient Decentralized Ergodic Coverage for Scalable Multi-Robot Systems in Unknown Time-Varying Environments

About

Maintaining situational awareness in high-stakes multi-robot applications requires balancing exploration of unobserved regions with sustained monitoring of changing Regions of Interest (ROIs), often under unknown and time-varying distributions, partial observability, and limited communication. We propose a decentralized multi-agent coverage framework that serves as a high-level planning strategy, in which each agent computes an adaptive ergodic policy, implemented via a Markov-chain, that tracks an updated belief over the underlying importance map. Beliefs are maintained online via Gaussian Process (GP) regression from local noisy observations exchanged with neighbors. The resulting policy drives agents to spend time in ROIs in proportion to their estimated importance, while preserving sufficient exploration to detect and adapt to time-varying environmental changes. Unlike existing approaches that assume known importance maps, centralized coordination, or a static environment, our framework addresses the combined challenges of unknown, time-varying distributions under a decentralized, partially observable setting. We further show that our framework is robust to communication and memory degradation, robot loss, and can scale up to hundreds of robots.

Maria G. Mendoza, Victoria Marie Tuck, Chinmay Maheshwari, Shankar Sastry• 2026

Related benchmarks

Task	Dataset	Result
Regions of Interest Discovery	5x5 Grid	Timestep14.3	9
Full Map Exploration	5x5 Grid Environment	Success Rate100	3
Full Map Exploration	10x10 Grid Environment	Success Rate100	3
Regions of Interest Discovery	10x10 Grid	Timesteps36.8	3

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord