Beyond Physical Labels: Redefining Domains for Robust WiFi-based Gesture Recognition
About
In this paper, we propose GesFi, a novel WiFi-based gesture recognition system that introduces WiFi latent domain mining to redefine domains directly from the data itself. GesFi first processes raw sensing data collected from WiFi receivers using CSI-ratio denoising, Short-Time Fast Fourier Transform, and visualization techniques to generate standardized input representations. It then employs class-wise adversarial learning to suppress gesture semantic and leverages unsupervised clustering to automatically uncover latent domain factors responsible for distributional shifts. These latent domains are then aligned through adversarial learning to support robust cross-domain generalization. Finally, the system is applied to the target environment for robust gesture inference. We deployed GesFi under both single-pair and multi-pair settings using commodity WiFi transceivers, and evaluated it across multiple public datasets and real-world environments. Compared to state-of-the-art baselines, GesFi achieves up to 78% and 50% performance improvements over existing adversarial methods, and consistently outperforms prior generalization approaches across most cross-domain tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Gesture Recognition | Widar3 Cross-Loc | Accuracy99.26 | 18 | |
| Gesture Recognition | Widar3 (In-domain) | Accuracy99.73 | 18 | |
| Gesture Recognition | Widar3 Cross-Ori | Accuracy96.45 | 18 | |
| Gesture Recognition | Widar3 (Cross-Env) | Accuracy99.32 | 17 | |
| Gesture Recognition | Widar3 (Cross-User) | Accuracy99.37 | 11 | |
| Gesture Recognition | Widar3 Cross-Loc-Env | Accuracy98.82 | 7 | |
| Gesture Recognition | Widar3 Cross-Ori-Env | Accuracy95.66 | 7 | |
| Gesture Recognition | XRF55 Cross-Env | Accuracy62.15 | 6 | |
| Gesture Recognition | ARIL Cross-Loc | Accuracy75.52 | 5 | |
| Gesture Recognition | XRF55 In-Domain | Accuracy92 | 5 |