DOI QR코드

DOI QR Code

Real-time Simultaneous Localization and Mapping (SLAM) for Vision-based Autonomous Navigation

영상기반 자동항법을 위한 실시간 위치인식 및 지도작성

  • Lim, Hyon (Dept. of Mechanical and Aerospace Engineering, Seoul Nat'l Univ.) ;
  • Lim, Jongwoo (Division of Computer Science and Engineering, Hanyang Univ.) ;
  • Kim, H. Jin (Dept. of Mechanical and Aerospace Engineering, Seoul Nat'l Univ.)
  • 임현 (서울대학교 우주항공공학과) ;
  • 임종우 (한양대학교 컴퓨터공학과) ;
  • 김현진 (서울대학교 우주항공공학과)
  • Received : 2014.07.03
  • Accepted : 2015.03.12
  • Published : 2015.05.01

Abstract

In this paper, we propose monocular visual simultaneous localization and mapping (SLAM) in the large-scale environment. The proposed method continuously computes the current 6-DoF camera pose and 3D landmarks position from video input. The proposed method successfully builds consistent maps from challenging outdoor sequences using a monocular camera as the only sensor. By using a binary descriptor and metric-topological mapping, the system demonstrates real-time performance on a large-scale outdoor dataset without utilizing GPUs or reducing input image size. The effectiveness of the proposed method is demonstrated on various challenging video sequences.

본 논문에서는 넓은 지역에서의 영상기반 자동 항법을 위한 실시간 위치인식 및 지도작성 방법을 제안한다. 한 대의 카메라에서 입력된 영상으로부터 제안된 방법은 6 자유도 카메라 자세와 3 차원 특징점 위치를 연속적으로 계산한다. 제안된 방법은 넓은 지역을 주행하며 촬영된 영상에 적용하여 그 위치와 환경지도를 성공적으로 작성하였이다. 본 논문에서는 이진기술자(binary descriptor)와 수치-위상(metric-topological)지도 표현법을 사용하여 GPU 나 영상의 축소 없이 실시간 성능과 광범위한 지역에서의 회귀점 검출(loop detection)을 하였다. 제안된 방법은 여러 환경에서 촬영된 영상과, 해당 영상의 GPS 기준값과 비교하여 평가하였다.

Keywords

References

  1. Geiger, A., Lenz, P. and Urtasun, R., 2012, "Are We Ready for Autonomous Driving? the Kitti Vision Benchmark Suite," in CVPR. IEEE, pp. 3354-3361.
  2. Davison, A. J., 2003, "Real-time Simultaneous Localisation and Mapping with a Single Camera," in ICCV. IEEE, pp. 1403-1410.
  3. Klein, G. and Murray, D., 2007, "Parallel Tracking and Mapping for Small AR Workspaces," in ISMAR. IEEE.
  4. Strasdat, H., Montiel, J. and Davison, A., 2010, "Scale Drift-Aware Large Scale Monocular Slam," in Proceedings of Robotics: Science and Systems (RSS), p. 5.
  5. Strasdat, H., Montiel, J. and Davison, A. J., 2010, "Real-time Monocular Slam: Why Filter?" in ICRA. IEEE, pp. 2657-2664.
  6. Nister, D., Naroditsky, O. and Bergen, J., 2004, "Visual Odometry," in CVPR IEEE, 2004., Vol. 1, pp. 1-652.
  7. Mouragnon, E., Lhuillier, M., Dhome, M., Dekeyser, F. and Sayd, P., 2006, "Real Time Localization and 3D Reconstruction," in CVPR IEEE, Vol. 1, pp. 363-370.
  8. Konolige, K. and Agrawal, M., 2008, "Frameslam: from Bundle Adjustment to Real-time Visual Mapping," Robotics, IEEE Transactions on, Vol. 24, No. 5, pp. 1066-1077. https://doi.org/10.1109/TRO.2008.2004832
  9. Snavely, N., Seitz, S. M. and Szeliski, R., 2008, "Skeletal Graphs for Efficient Structure from Motion," in CVPR, Vol. 2.
  10. Kuemmerle, R., Grisetti, G., Strasdat, H., Konolige, K. and Burgard, W., 2011, "g2o: A General Framework for Graph Optimization," in ICRA.
  11. Lim, J., Frahm, J.-M. and Pollefeys, M., 2011, "Online Environment Mapping," in CVPR. IEEE, pp. 3489-3496.
  12. Strasdat, H., Davison, A. J., Montiel, J. and Konolige, K., 2011, "Double Window Optimisation for Constant Time Visual Slam," in ICCV. IEEE, pp. 2352-2359.
  13. Engels, C., Stewenius, H. and Nister, D., 2006, "Bundle Adjustment Rules," Photogrammetric Computer Vision, Vol. 2.
  14. Cummins, M. and Newman, P., 2008, "Fab-map: Probabilistic Localization and Mapping in the Space of Appearance," The International Journal of Robotics Research, Vol. 27, No. 6, pp. 647-665. https://doi.org/10.1177/0278364908090961
  15. Nister, D. and Stewenius, H., 2006, "Scalable Recognition with a Vocabulary Tree," in CVPR. IEEE.
  16. Calonder, M., Lepetit, V., Strecha, C. and Fua, P., 2010, "Brief: Binary Robust Independent Elementary Features," Computer Vision-ECCV 2010, pp. 778-792.
  17. Galvez-Lopez, D. and Tardos, J. D., 2011, "Realtime Loop Detection with Bags of Binary Words," in IROS. IEEE, pp. 51-58.
  18. Williams, B. and Reid, I., 2010, "On Combining Visual Slam and Visual Odometry," in Robotics and Automation (ICRA), 2010 IEEE International Conference on. IEEE, pp. 3494-3500.
  19. Mei, C., Sibley, G., Cummins, M., Newman, P. and Reid, I., 2011, "Rslam: A System for Large-Scale Mapping in Constant-time Using Stereo," International Journal of Computer Vision, Vol. 94, No. 2, pp. 198-214. https://doi.org/10.1007/s11263-010-0361-7
  20. Rosten, E. and Drummond, T., 2006, "Machine Learning for High-speed Corner Detection," in ECCV. Springer,
  21. Lowe, D. G., 2004, "Distinctive Image Features from Scale-Invariant Keypoints," IJCV, Vol. 60, No. 2, pp. 91-110. https://doi.org/10.1023/B:VISI.0000029664.99615.94
  22. Bay, H., Tuytelaars, T. and Van Gool, L., 2006, "Surf: Speeded up Robust Features," in ECCV. Springer, pp. 404-417.
  23. Lucas, B. D., Kanade, T., 1981, "An Iterative Image Registration Technique with an Application to Stereo Vision." in IJCAI, Vol. 81, pp. 674-679.
  24. Nister, D., 2004, "An Efficient Solution to the Five-point Relative Pose Problem," PAMI, Vol. 26, No. 6, pp. 756-770. https://doi.org/10.1109/TPAMI.2004.17
  25. Fischler, M. A. and Bolles, R. C., 1981, "Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography," Communications of the ACM, Vol. 24, No. 6, pp. 381-395. https://doi.org/10.1145/358669.358692
  26. Kneip, L., Scaramuzza, D. and Siegwart, R., 2011, "A Novel Parametrization of the Perspective-Threepoint Problem for a Direct Computation of Absolute Camera Position and Orientation," in CVPR. IEEE, pp. 2969-2976.
  27. Lim, J., Frahm, J.-M. and Pollefeys, M., 2012, "Online Environment Mapping Using Metrictopological Maps," The International Journal of Robotics Research, Vol. 31, No. 12, pp. 1394-1408. https://doi.org/10.1177/0278364912461455
  28. Agarwal, S., Mierle, K. and Others, "Ceres Solver," https://code.google.com/p/ceres-solver/.
  29. Geiger, A., Ziegler, J. and Stiller, C., 2011, "Stereoscan: Dense 3d Reconstruction in Real-time," in Intelligent Vehicles Symposium (IV), IEEE, pp. 963-968.