FacePhys: State of the Heart Learning
About
Vital sign measurement using cameras presents opportunities for comfortable, ubiquitous health monitoring. Remote photoplethysmography (rPPG), a foundational technology, enables cardiac measurement through minute changes in light reflected from the skin. However, practical deployment is limited by the computational constraints of performing analysis on front-end devices and the accuracy degradation of transmitting data through compressive channels that reduce signal quality. We propose a memory efficient rPPG algorithm - \emph{FacePhys} - built on temporal-spatial state space duality, which resolves the trilemma of model scalability, cross-dataset generalization, and real-time operation. Leveraging a transferable heart state, FacePhys captures subtle periodic variations across video frames while maintaining a minimal computational overhead, enabling training on extended video sequences and supporting low-latency inference. FacePhys establishes a new state-of-the-art, with a substantial 49\% reduction in error. Our solution enables real-time inference with a memory footprint of 3.6 MB and per-frame latency of 9.46 ms -- surpassing existing methods by 83\% to 99\%. These results translate into reliable real-time performance in practical deployments, and a live demo is available at https://www.facephys.com/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Heart Rate estimation | UBFC | MAE0.43 | 40 | |
| Heart Rate estimation | PURE | MAE0.24 | 33 | |
| Pulse Rate Estimation | MMPD | MAE5.3 | 22 | |
| Remote Photoplethysmography (rPPG) | Mobile/ARM Hardware Efficiency | Frame Latency (ms)9.46 | 13 | |
| Heart Rate estimation | VitalVideo | MAE0.77 | 12 | |
| HR estimation | MMPD (five-fold cross-validation) | MAE5.58 | 6 | |
| HR estimation | VitalVideo (five-fold cross-val) | MAE0.64 | 6 | |
| HR estimation | UBFC (five-fold cross-validation) | MAE0.46 | 6 | |
| HR estimation | PURE (five-fold cross-validation) | MAE0.26 | 6 |