DOI QR코드

DOI QR Code

Real-time Multi-Objects Recognition and Tracking Scheme

실시간 다중 객체 인식 및 추적 기법

  • Kim, Dae-Hoon (School of Electrical Engineering, Korea University) ;
  • Rho, Seung-Min (Division of Information and Communication, Baekseok University) ;
  • Hwang, Een-Jun (School of Electrical Engineering, Korea University)
  • 김대훈 (고려대학교 전기전자전파 공학부) ;
  • 노승민 (백석대학교 정보통신학부) ;
  • 황인준 (고려대학교 전기전자전파 공학부)
  • Received : 2012.03.29
  • Accepted : 2012.04.30
  • Published : 2012.04.30

Abstract

In this paper, we propose an efficient multi-object recognition and tracking scheme based on interest points of objects and their feature descriptors. To do that, we first define a set of object types of interest and collect their sample images. For sample images, we detect interest points and construct their feature descriptors using SURF. Next, we perform a statistical analysis of the local features to select representative points among them. Intuitively, the representative points of an object are the interest points that best characterize the object. in addition, we make the movement vectors of the interest points based on matching between their SURF descriptors and track the object using these vectors. Since our scheme treats all the objects independently, it can recognize and track multiple objects simultaneously. Through the experiments, we show that our proposed scheme can achieve reasonable performance.

본 논문에서는 객체의 관심점(interest points)에 대한 지역 특징 기술자를 이용하여 이미지나 동영상에서 다수의 관심 객체를 효과적으로 인식하고 추적하기 위한 기법을 제안한다. 이를 위해 먼저 대상이 되는 객체를 포함하는 다양한 이미지를 수집하고 SURF 알고리즘을 적용하여 객체의 관심점과 그들에 대한 지역 특징 기술자를 생성한다. 지역 특징에 대한 통계적인 분석을 통하여 관심점들 중에서 해당 객체의 특성을 가장 잘 표현하는 대표점(representative points)을 선택하고 이를 바탕으로 이미지에 존재하는 객체를 인식한다. 또한, 지역 특징 기술자의 정합을 응용하여 각 SURF 지점들의 움직임 벡터를 생성하고 이를 기반으로 실시간으로 객체를 추적한다. 제안하는 기법은 모든 객체를 독립적으로 다루기 때문에, 여러 개의 객체를 동시에 인식하고 추적할 수 있다. 다양한 실험을 통해, 동영상에서 객체의 존재 여부 및 종류를 신속하게 판별하고 관심 객체의 추적을 효과적으로 수행할 수 있음을 보인다.

Keywords

References

  1. C. Harris and M. Stephens, "A Combined Corner and Edge Detector," Proc. Alvey Vision Conf., pp. 147-151, 1988.
  2. T. Lindeberg, "Feature detection with automatic scale selection," International Journal of Computer Vision, Vol. 30, no. 3, pp. 79-116, 1998. https://doi.org/10.1023/A:1008045108935
  3. K. Mikolajczyk and C. Schmid, "Indexing based on scale invariant interest points," International Conference Computer Vision, Vol. 1 pp. 525-531, 2001.
  4. D. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int'l J. Computer Vision,Vol. 60, no. 2, pp. 91-110, 2004. https://doi.org/10.1023/B:VISI.0000029664.99615.94
  5. H. Bay, T. Tuytelaars, and L. V. Gool, "Surf: Speeded up robust features," European Conference on Computer Vision, Vol. 3951, pp. 404-417, 2006.
  6. G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, IN WORKSHOP ON STATISTICAL LEARNING IN COMPUTER VISION, ECCV, p. 1-22, 2004.
  7. J. Sivic, B. Russell, A. Efros, A. Zisserman, and W. Freeman, Discovering object categories in image collections. 10-2005.
  8. L. Fei-Fei, R. Fergus, and P. Perona. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. In CVPR Workshop, 2004.
  9. S. Ullman, Object recognition and segmentation by a fragment-based hierarchy, Trends in Cognitive Sciences, vol. 11, no. 2, p. 58-64, 2007. https://doi.org/10.1016/j.tics.2006.11.009
  10. F. Faber, M. Bennewitz, A. Gorog, C. Gonsior, D. Joho, M. Schreiber and S. Behnke, "The humanoid museum tour guide Robotinho", IN IEEE INT. SYMP. ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, 2009.
  11. Yan-Tao Zheng, Ming Zhao, Yang Song, H. Adam, U. Buddemeier, A. Bissacco, F. Brucher, Tat-Seng Chua, and H. Neven, "Tour the world: Building a web-scale landmark recognition engine," in IEEE Conference on Computer Vision and Pattern Recognition, 2009. CVPR 2009, 2009, pp. 1085-1092.
  12. A. Del Bimbo, W. Nunziati, and P. Pala, "David: Discriminant analysis for verification of monuments in image data," in IEEE International Conference on Multimedia and Expo, 2009. ICME 2009, 2009, pp. 334-337.
  13. S. L. Lauritzen, Thiele: Pioneer in Statistics, Oxford University Press, 2002. ISBN 0-19-850972-3.
  14. Y. Cheng, "Mean Shift, Mode Seeking, and Clustering". IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE) 17 (8): 790-799, 1995. https://doi.org/10.1109/34.400568
  15. L. Fei-Fei, R. Fergus and P. Perona, Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories, in Workshop on Generative-Model Based Vision, 2004.

Cited by

  1. 첨단운전자보조시스템용 이동객체검출을 위한 광학흐름추정기의 설계 및 구현 vol.19, pp.6, 2012, https://doi.org/10.12673/jant.2015.19.6.544