CPEP: Contrastive Pose-EMG Pre-training Enhances Gesture Generalization on EMG Signals

About

Hand gesture classification using high-quality structured data such as videos, images, and hand skeletons is a well-explored problem in computer vision. Leveraging low-power, cost-effective biosignals, e.g. surface electromyography (sEMG), allows for continuous gesture prediction on wearables. In this paper, we demonstrate that learning representations from weak-modality data that are aligned with those from structured, high-quality data can improve representation quality and enables zero-shot classification. Specifically, we propose a Contrastive Pose-EMG Pre-training (CPEP) framework to align EMG and pose representations, where we learn an EMG encoder that produces high-quality and pose-informative representations. We assess the gesture classification performance of our model through linear probing and zero-shot setups. Our model outperforms emg2pose benchmark models by up to 21% on in-distribution gesture classification and 72% on unseen (out-of-distribution) gesture classification.

Wenhui Cui, Christopher Sandino, Hadi Pouransari, Ran Liu, Juri Minxha, Ellen Zippi, Aman Verma, Anna Sedlackova, Erdrin Azemi, Behrooz Mahasseni• 2025

Related benchmarks

Task	Dataset	Result
EMG-to-text retrieval	NinaPro DB2 (NP-class split)	Recall67.92	9
EMG-to-text retrieval	NinaPro-DB2 NP-rand	Recall86.19	9
EMG-to-text retrieval	NinaPro DB2 (NP-user split)	Recall63	9
EMG-to-text retrieval	EMG2Pose E2P-user	Recall61.7	9
EMG-to-text retrieval	EMG2Pose (E2P-rand)	Recall58.62	9
EMG-to-text retrieval	NinaPro-DB2 to NinaPro-DB3 cross-dataset	Recall45.74	9
EMG-to-text retrieval	EMG2Pose E2P-class	Recall62.95	9

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord