Less is More: Micro-expression Recognition from Video using Apex Frame
About
Despite recent interest and advances in facial micro-expression research, there is still plenty room for improvement in terms of micro-expression recognition. Conventional feature extraction approaches for micro-expression video consider either the whole video sequence or a part of it, for representation. However, with the high-speed video capture of micro-expressions (100-200 fps), are all frames necessary to provide a sufficiently meaningful representation? Is the luxury of data a bane to accurate recognition? A novel proposition is presented in this paper, whereby we utilize only two images per video: the apex frame and the onset frame. The apex frame of a video contains the highest intensity of expression changes among all frames, while the onset is the perfect choice of a reference frame with neutral expression. A new feature extractor, Bi-Weighted Oriented Optical Flow (Bi-WOOF) is proposed to encode essential expressiveness of the apex frame. We evaluated the proposed method on five micro-expression databases: CAS(ME)$^2$, CASME II, SMIC-HS, SMIC-NIR and SMIC-VIS. Our experiments lend credence to our hypothesis, with our proposed technique achieving a state-of-the-art F1-score recognition performance of 61% and 62% in the high frame rate CASME II and SMIC-HS databases respectively.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Micro-expression recognition | CASME II | UF178.1 | 25 | |
| Micro-expression recognition | SMIC | UF10.573 | 20 | |
| Micro-expression recognition | SAMM | UF152.1 | 19 | |
| Micro-expression recognition | CASME II (LOSO) | UF10.7805 | 13 | |
| Micro-expression recognition | SAMM (LOSO) | UF152.11 | 13 | |
| Micro-expression recognition | Full (LOSO) | UF162.96 | 13 | |
| Micro-expression recognition | SMIC (LOSO) | UF157.27 | 13 | |
| Micro-expression recognition | MEGC Full 2019 | UF10.63 | 12 |