Unsupervised Monocular Visual-inertial Odometry Network

Peng Wei; Guoliang Hua; Weibo Huang; Fanyang Meng; Hong Liu

doi:10.24963/ijcai.2020/325

Unsupervised Monocular Visual-inertial Odometry Network

Peng Wei, Guoliang Hua, Weibo Huang, Fanyang Meng, Hong Liu

Long video

Short video

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence

Main track. Pages 2347-2354. https://doi.org/10.24963/ijcai.2020/325

PDF BibTeX

Recently, unsupervised methods for monocular visual odometry (VO), with no need for quantities of expensive labeled ground truth, have attracted much attention. However, these methods are inadequate for long-term odometry task, due to the inherent limitation of only using monocular visual data and the inability to handle the error accumulation problem. By utilizing supplemental low-cost inertial measurements, and exploiting the multi-view geometric constraint and sequential constraint, an unsupervised visual-inertial odometry framework (UnVIO) is proposed in this paper. Our method is able to predict the per-frame depth map, as well as extracting and self-adaptively fusing visual-inertial motion features from image-IMU stream to achieve long-term odometry task. A novel sliding window optimization strategy, which consists of an intra-window and an inter-window optimization, is introduced for overcoming the error accumulation and scale ambiguity problem. The intra-window optimization restrains the geometric inferences within the window through checking the photometric consistency. And the inter-window optimization checks the 3D geometric consistency and trajectory consistency among predictions of separate windows. Extensive experiments have been conducted on KITTI and Malaga datasets to demonstrate the superiority of UnVIO over other state-of-the-art VO / VIO methods. The codes are open-source.

Keywords:

Machine Learning: Unsupervised Learning

Robotics: Localization, Mapping, State Estimation

Robotics: Vision and Perception