Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems

About

Multi-modal learning has emerged as a key technique for improving performance across domains such as autonomous driving, robotics, and reasoning. However, in certain scenarios, particularly in resource-constrained environments, some modalities available during training may be absent during inference. While existing frameworks effectively utilize multiple data sources during training and enable inference with reduced modalities, they are primarily designed for single-agent settings. This poses a critical limitation in dynamic environments such as connected autonomous vehicles (CAV), where incomplete data coverage can lead to decision-making blind spots. Conversely, some works explore multi-agent collaboration but without addressing missing modality at test time. To overcome these limitations, we propose Collaborative Auxiliary Modality Learning (CAML), a novel multi-modal multi-agent framework that enables agents to collaborate and share multi-modal data during training, while allowing inference with reduced modalities during testing. Experimental results in collaborative decision-making for CAV in accident-prone scenarios demonstrate that CAML achieves up to a 58.1% improvement in accident detection. Additionally, we validate CAML on real-world aerial-ground robot data for collaborative semantic segmentation, achieving up to a 10.6% improvement in mIoU.

Rui Liu, Yu Shen, Peng Gao, Pratap Tokekar, Ming Lin• 2025

Related benchmarks

TaskDatasetResultRank
Multi-agent Target NavigationMatterport3D Studio scene
Steps21.59
10
Target NavigationMaze Scene
Distance Traveled6.77
5
Multi-agent Target NavigationMatterport3D Corridor scene
Steps116.3
5
Multi-agent Target NavigationMatterport3D Ranch scene
Steps396.3
5
Target NavigationStudio Scene
Distance2.91
5
Target NavigationApartment Scene
Distance3.93
5
Target NavigationCorridor Scene
Distance9.75
5
Target NavigationRanch Scene
Distance6.87
5
Multi-agent Target NavigationMatterport3D Maze scene
Steps624.5
5
Showing 9 of 9 rows

Other info

Follow for update