Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Sentinel: Embodied Cooperative Spatial Reasoning and Planning

About

In this work, we study Cooperative Spatial Intelligence, the ability of decentralized embodied agents to coordinate effectively under dynamic environmental constraints across city-scale outdoor domains. We introduce Sentinel Challenge, a benchmark where multiple decentralized embodied agents must communicate in natural language to agree on a mutually safe and convenient meeting point within large, city-scale outdoor environments. Each agent must then navigate safely while avoiding dynamic sentinels patrolling the area, using a tool that provides coarse spatial information. To address this, we propose CoSaR (Cooperative Spatial Reasoning and Planning), a framework that bridges the high-level communication and planning abilities of foundation models with the precision of classical spatial navigation algorithms. CoSaR enables agents to exchange situational updates, reason over evolving spatial constraints, and collaboratively replan trajectories. Evaluated across 14 city-level scenes with 3-5 agents, CoSaR consistently leads to faster gathering, shorter path lengths, and improved safety. Our results demonstrate that integrating dynamic communication with spatial reasoning is essential for robust multi-agent cooperation. By formalizing this new setting and providing a scalable benchmark, we aim to build a foundation for advancing cooperative spatial intelligence in embodied multi-agent systems. Code and challenge are available at https://github.com/UMass-Embodied-AGI/Sentinel.

Xiangye Lin, Hongxin Zhang, Ruxi Deng, Qinhong Zhou, Chuang Gan• 2026

Related benchmarks

TaskDatasetResultRank
Multi-agent coordinationSentinel Challenge 10 Stationary Sentinels
Success Rate53.57
13
Multi-agent coordinationSentinel Challenge 10 Patrolling Sentinels
Success Rate38.1
13
Multi-Agent Evasion and NavigationSentinel Challenge 20 Stationary Sentinels
Success Rate35.71
11
Multi-agent navigation and evasionSentinel Challenge 10 Patrolling Sentinels (avg 14 scenes, 2 runs)
Success Rate64.29
11
Multi-Agent Evasion and NavigationSentinel Challenge 20 Patrolling Sentinels
Success Rate32.14
11
Multi-agent navigation and evasionSentinel Challenge 10 Stationary Sentinels (avg 14 scenes, 2 runs)
Success Rate57.14
11
Embodied Cooperative Spatial Reasoning and PlanningSentinel Challenge 5 Stationary Sentinels
Success Rate39.29
6
Embodied Cooperative Spatial Reasoning and PlanningSentinel Challenge 5 Patrolling Sentinels
Success Rate35.71
6
Embodied Cooperative Spatial Reasoning and PlanningSentinel Challenge Oracle Perception 5 Stationary Sentinels (average over 14 scenes and 2 runs)
Success Rate67.86
5
Embodied Cooperative Spatial Reasoning and PlanningSentinel Challenge Oracle Perception 5 Patrolling Sentinels (average over 14 scenes and 2 runs)
Success Rate57.14
5
Showing 10 of 10 rows

Other info

Follow for update