Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Denoising-Contrastive Alignment for Continuous Sign Language Recognition

About

Continuous sign language recognition (CSLR) aims to recognize signs in untrimmed sign language videos to textual glosses. A key challenge of CSLR is achieving effective cross-modality alignment between video and gloss sequences to enhance video representation. However, current cross-modality alignment paradigms often neglect the role of textual grammar to guide the video representation in learning global temporal context, which adversely affects recognition performance. To tackle this limitation, we propose a Denoising-Contrastive Alignment (DCA) paradigm. DCA creatively leverages textual grammar to enhance video representations through two complementary approaches: modeling the instance correspondence between signs and glosses from a discrimination perspective and aligning their global context from a generative perspective. Specifically, DCA accomplishes flexible instance-level correspondence between signs and glosses using a contrastive loss. Building on this, DCA models global context alignment between the video and gloss sequences by denoising the gloss representation from noise, guided by video representation. Additionally, DCA introduces gradient modulation to optimize the alignment and recognition gradients, ensuring a more effective learning process. By integrating gloss-wise and global context knowledge, DCA significantly enhances video representations for CSLR tasks. Experimental results across public benchmarks validate the effectiveness of DCA and confirm its video representation enhancement feasibility.

Leming Guo, Wanli Xue, Shengyong Chen• 2023

Related benchmarks

TaskDatasetResultRank
Continuous Sign Language RecognitionPHOENIX 2014 (dev)
Word Error Rate17.3
188
Continuous Sign Language RecognitionPHOENIX-2014 (test)
WER17.7
185
Continuous Sign Language RecognitionCSL-Daily (dev)
Word Error Rate (WER)25.6
98
Continuous Sign Language RecognitionCSL-Daily (test)
WER25.3
91
Continuous Sign Language RecognitionPHOENIX14-T (dev)
WER17
75
Continuous Sign Language RecognitionPHOENIX-2014T (test)
WER18.5
43
Showing 6 of 6 rows

Other info

Follow for update