DOI QR코드

DOI QR Code

Combining Shape and SIFT Features for 3-D Object Detection and Pose Estimation

효과적인 3차원 객체 인식 및 자세 추정을 위한 외형 및 SIFT 특징 정보 결합 기법

  • 탁윤식 (고려대학교 전자컴퓨터공학과) ;
  • 황인준 (고려대학교 전기전자전파공학과)
  • Published : 2010.02.01

Abstract

Three dimensional (3-D) object detection and pose estimation from a single view query image has been an important issue in various fields such as medical applications, robot vision, and manufacturing automation. However, most of the existing methods are not appropriate in a real time environment since object detection and pose estimation requires extensive information and computation. In this paper, we present a fast 3-D object detection and pose estimation scheme based on surrounding camera view-changed images of objects. Our scheme has two parts. First, we detect images similar to the query image from the database based on the shape feature, and calculate candidate poses. Second, we perform accurate pose estimation for the candidate poses using the scale invariant feature transform (SIFT) method. We earned out extensive experiments on our prototype system and achieved excellent performance, and we report some of the results.

Keywords

References

  1. Y. Tak and E. Hwang, "Indexing and Matching Scheme for Recognizing 3D Objects from Single 2D Image," International Conference on Internet and Multimedia Systems and Applications (IMSA'09), pp. 60-67, 2009.
  2. L. Vacchetti, V. Lepetit and P. Fua, "Stable real-time 3D tracking using online and offline information," IEEE Transactions on PAMI, vol. 26, no. 10, pp. 1385-1391, 2004 https://doi.org/10.1109/TPAMI.2004.92
  3. J. Zhang, S.K. Zhou, L. McMilan and D. Comaniciu, "Joint Real-time Object Detection and Pose Estimation Using Probabilistic Boosting Network," CVPR'07, pp.1-8, 2007
  4. M. Saito and K. Kitaguchi, "Appearance based object pose estimation using regression models," SICE Annual Conference, pp.1926-1929, 2008
  5. D. G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," IJCV, vol. 60, no. 2, pp. 91-110, Nov. 2004 https://doi.org/10.1023/B:VISI.0000029664.99615.94
  6. G. Panin and A. Knoll, "Fully Automatic Real-Time 3D Object Tracking using Active Contour and Appearance Models," Journal of Multimedia, vol. 1, no. 7, pp. 62-70, 2006.
  7. D. Wagner, G. Reitmayr, A. Mulloni, T. Drummond and D. Schmalstieg, "Pose Tracking from Natural Features on Mobile Phones," IEEE/ACM International Symposium on Mixed and Augmented Reality, pp. 125-134, 2008.
  8. Y. Nam, E. Hwang and D. Kim, "A smilarity-based leaf image retrieval scheme: Joining shape and venation features," Computer Vision and Image Understanding, vol. 110, no. 2, pp.245-259, 2008. https://doi.org/10.1016/j.cviu.2007.08.002
  9. E. Keogh and C. Ratanamahatana, "Exact indexing of dynamic time warping," Knowledge and Information Systems, Vol. 7, pp. 358-386, 2005. https://doi.org/10.1007/s10115-004-0154-9
  10. C. Faloutsos and K. Lin, "FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets," ACM SIGMOD, pp.163-174. 1995.
  11. H. Kim, Y. Tak and E. Hwang, "A Shape-Based Indexing Scheme for Camera View-Invariant 3-D Object Retrieval," Multimedia Tools and Applications, Accepted.
  12. C. Faloutsos, M. Ranganathan and Y. Manolopoulos, "Fast subsequence matching in time-series databases," ACM SIGMOD, pp.419-429, 1993
  13. Y. Tak and E. Hwang, "An indexing scheme for efficient camera angle invariant image retrieval," International Conference on Computer and Information Technology, pp.143-148, 2008