DOI QR코드

DOI QR Code

Efficient 3D Scene Labeling using Object Detectors & Location Prior Maps

물체 탐지기와 위치 사전 확률 지도를 이용한 효율적인 3차원 장면 레이블링

  • Kim, Joo-Hee (Artificial Intelligence Laboratory, Kyonggi University) ;
  • Kim, In-Cheol (Artificial Intelligence Laboratory, Kyonggi University)
  • 김주희 (경기대학교 컴퓨터과학과 인공지능연구실) ;
  • 김인철 (경기대학교 컴퓨터과학과 인공지능연구실)
  • Received : 2015.08.24
  • Accepted : 2015.10.22
  • Published : 2015.11.01

Abstract

In this paper, we present an effective system for the 3D scene labeling of objects from RGB-D videos. Our system uses a Markov Random Field (MRF) over a voxel representation of the 3D scene. In order to estimate the correct label of each voxel, the probabilistic graphical model integrates both scores from sliding window-based object detectors and also from object location prior maps. Both the object detectors and the location prior maps are pre-trained from manually labeled RGB-D images. Additionally, the model integrates the scores from considering the geometric constraints between adjacent voxels in the label estimation. We show excellent experimental results for the RGB-D Scenes Dataset built by the University of Washington, in which each indoor scene contains tabletop objects.

Keywords

References

  1. B. Douillard, D. Fox, F. Ramos, and H. Durrantwhyte, "Classification and semantic mapping of urban environments," The International Journal of Robotics Research, vol. 30, no. 1, pp. 5-32, 2011. https://doi.org/10.1177/0278364910373409
  2. C. Cheng, A. Koschan, C. H. Chen, D. L. Page, and M. Abidi, "Outdoor scene image segmentation based on background recognition and perceptual organization," IEEE Transactions on Image Processing, vol. 21, no. 3, pp. 1007-1019, 2012. https://doi.org/10.1109/TIP.2011.2169268
  3. I. Posner, M. Cummins, and P. Newman, "A generative framework for fast urban labeling using spatial and temporal context," Autonomous Robots, vol. 26, pp. 153-170, 2009 https://doi.org/10.1007/s10514-009-9110-6
  4. D. Anguelov, B. Taskar, V. Chatalbashev, D. Koller, D. Gupta, G. Heitz, and A. Ng, "Discriminative learning of markov random fields for segmentation of 3D scan data," Proc. of IEEE on Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 169-176, 2005.
  5. C. Farabet, C. Couprie, L. Najman, and Y. LeCun, "Learning hierarchical features for scene labeling," Proc. of IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 8, pp. 1915-1929, 2013. https://doi.org/10.1109/TPAMI.2012.231
  6. H. S. Koppula, A. Anand, T. Joachims, and A. Saxena, "Semantic labeling of 3d point clouds for indoor scenes," Advances in Neural Information Processing Systems, pp. 244-252, 2011.
  7. N. Silberman and R. Fergus, "Indoor scene segmentation using a structured light sensor," Proc. of IEEE on International Conference on Computer Vision Workshops, pp. 601-608, 2011.
  8. K. Lai, L. Bo, X. Ren, and D. Fox, "Detection-based object labeling in 3D scenes," Proc. of IEEE on International Conference on Robotics and Automation, pp. 1330-1337, 2012.
  9. S. Helmer, D. Meger, M. Muja, J. J. Little, and D. G. Lowe, "Multiple viewpoint recognition and localization," Proc. of Asian Conference on Computer Vision, pp. 464-477, Springer Berlin Heidelberg, 2010.
  10. J. Malik, "Scene understanding from RGB-D images," Proc. of Scene Understanding Workshop, vol. 112, no. 2, pp. 133-149, 2015.
  11. R. Triebel, R. Schmidt, O. Martinez Mozos, and W. Burgard, "Instance-based amn classification for improved object recognition in 2d and 3d laser range data," Proc. of International Joint Conferences on Artificial Intelligence, Morgan Kaufmann Publishers Inc., 2007.
  12. A. Collet, M. Martinez, and S. Srinivasa, "Object recognition and full pose registration from a single image for robotic manipulation," Proc. of IEEE on International Conference on Robotics and Automation, pp. 48-55, 2009.
  13. N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection," Proc. of IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, vol. 1, pp. 886-893, 2005.
  14. K. Lai, L. Bo, X. Ren, and D. Fox "A large-scale hierarchical multiview RGB-D object dataset," Proc. of IEEE on International Conference on Robotics and Automation, 2011.
  15. D. S. Yoo, S. H. Kim, J. Y. Lee, and S. J. Lee, "Development of hazardous objects detection technology based on metal/non-metal detector," Journal of Institute of Control, Robotics and Systems (in Korean), vol. 20, no. 2, pp. 120-125, 2014. https://doi.org/10.5302/J.ICROS.2014.13.9003
  16. J. K. Park and J. B. Park, "An object recognition method based on depth information for an indoor mobile robot," Journal of Institute of Control, Robotics and Systems (in Korean), vol. 21, no. 10, pp. 958-964, 2015. https://doi.org/10.5302/J.ICROS.2015.15.0027