A Multimodal Anomaly Detector for Robot-Assisted Feeding Using an LSTM-based Variational Autoencoder
About
The detection of anomalous executions is valuable for reducing potential hazards in assistive manipulation. Multimodal sensory signals can be helpful for detecting a wide range of anomalies. However, the fusion of high-dimensional and heterogeneous modalities is a challenging problem. We introduce a long short-term memory based variational autoencoder (LSTM-VAE) that fuses signals and reconstructs their expected distribution. We also introduce an LSTM-VAE-based detector using a reconstruction-based anomaly score and a state-based threshold. For evaluations with 1,555 robot-assisted feeding executions including 12 representative types of anomalies, our detector had a higher area under the receiver operating characteristic curve (AUC) of 0.8710 than 5 other baseline detectors from the literature. We also show the multimodal fusion through the LSTM-VAE is effective by comparing our detector with 17 raw sensory signals versus 4 hand-engineered features.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Anomaly Detection | SMD | F1 Score46.24 | 217 | |
| Anomaly Detection | SWaT | F1 Score76.57 | 174 | |
| Anomaly Detection | MSL | Precision14.61 | 39 | |
| Anomaly Detection | SWaT (test) | -- | 34 | |
| Time Series Anomaly Detection | SMAP | Precision85.51 | 32 | |
| Anomaly Detection | SMD (test) | -- | 25 | |
| Multivariate Time Series Anomaly Detection | MSL | Precision27.23 | 24 | |
| Time Series Anomaly Detection | UCR-AD archive | Top-1 Accuracy19.8 | 23 | |
| Multivariate Time Series Anomaly Detection | SWaT | Precision97.07 | 19 | |
| Multivariate Time Series Anomaly Detection | SMAP | Precision29.65 | 19 |