ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking

About

Multi-Camera Multi-Object Tracking (MC-MOT) utilizes information from multiple views to better handle problems with occlusion and crowded scenes. Recently, the use of graph-based approaches to solve tracking problems has become very popular. However, many current graph-based methods do not effectively utilize information regarding spatial and temporal consistency. Instead, they rely on single-camera trackers as input, which are prone to fragmentation and ID switch errors. In this paper, we propose a novel reconfigurable graph model that first associates all detected objects across cameras spatially before reconfiguring it into a temporal graph for Temporal Association. This two-stage association approach enables us to extract robust spatial and temporal-aware features and address the problem with fragmented tracklets. Furthermore, our model is designed for online tracking, making it suitable for real-world applications. Experimental results show that the proposed graph model is able to extract more discriminating features for object tracking, and our model achieves state-of-the-art performance on several public datasets.

Cheng-Che Cheng, Min-Xuan Qiu, Chen-Kuo Chiang, Shang-Hong Lai• 2023

Related benchmarks

Task	Dataset	Result
Multi-Camera Multi-Object Tracking	Wildtrack	IDF186.7	28
Multi-view Multi-person Tracking	Wildtrack	MOTA84.9	27
Multi-Object Tracking	PETS 2009 (S2.L1)	MOTA92.3	26
Multiple Object Tracking	WILDTRACK (test)	IDF185.7	22
Multi-view Crowd Tracking	Wildtrack	MOTA81.6	16
Multi-Camera Multi-Object Tracking	CAMPUS (Garden 1)	MOTA77.6	10
Multi-Camera Multi-Object Tracking	CAMPUS Parkinglot	MOTA77.7	10
Multi-Camera Multi-Object Tracking	CAMPUS Garden 2	MOTA86	9
Multi-Camera Multi-Object Tracking	CAMPUS Auditorium	MOTA81.2	9

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord