CTIN: Robust Contextual Transformer Network for Inertial Navigation
About
Recently, data-driven inertial navigation approaches have demonstrated their capability of using well-trained neural networks to obtain accurate position estimates from inertial measurement units (IMU) measurements. In this paper, we propose a novel robust Contextual Transformer-based network for Inertial Navigation~(CTIN) to accurately predict velocity and trajectory. To this end, we first design a ResNet-based encoder enhanced by local and global multi-head self-attention to capture spatial contextual information from IMU measurements. Then we fuse these spatial representations with temporal knowledge by leveraging multi-head attention in the Transformer decoder. Finally, multi-task learning with uncertainty reduction is leveraged to improve learning efficiency and prediction accuracy of velocity and trajectory. Through extensive experiments over a wide range of inertial datasets~(e.g. RIDI, OxIOD, RoNIN, IDOL, and our own), CTIN is very robust and outperforms state-of-the-art models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Pedestrian Inertial Odometry | Ronin | ATE4.878 | 13 | |
| Pedestrian Inertial Odometry | IMUNet | ATE5.589 | 13 | |
| Pedestrian Inertial Odometry | RIDI | ATE2.325 | 13 | |
| Pedestrian Inertial Odometry | RNIN | ATE4.732 | 13 | |
| Pedestrian Inertial Odometry | OxIOD | Absolute Trajectory Error (ATE)4.846 | 13 | |
| Inertial Odometry | RoNIN (unseen) | ATE (m)5.25 | 10 | |
| Inertial Odometry | GrandTour | ATE (m)6.67 | 10 | |
| Inertial Odometry | RoNIN (seen) | ATE (m)4.34 | 10 | |
| Inertial Odometry | Go2 | ATE (m)3.54 | 10 |