Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022

About

This report presents a brief description of our winning solution to the AVA Active Speaker Detection (ASD) task at ActivityNet Challenge 2022. Our underlying model UniCon+ continues to build on our previous work, the Unified Context Network (UniCon) and Extended UniCon which are designed for robust scene-level ASD. We augment the architecture with a simple GRU-based module that allows information of recurring identities to flow across scenes through read and update operations. We report a best result of 94.47% mAP on the AVA-ActiveSpeaker test set, which continues to rank first on this year's challenge leaderboard and significantly pushes the state-of-the-art.

Yuanhang Zhang, Susan Liang, Shuang Yang, Shiguang Shan• 2022

Related benchmarks

TaskDatasetResultRank
Active Speaker DetectionAVA-ActiveSpeaker v1.0 (val)
mAP94.7
27
Active Speaker DetectionAVA-ActiveSpeaker v1.0 (test)
mAP94.5
13
Showing 2 of 2 rows

Other info

Code

Follow for update