DOI QR코드

DOI QR Code

A Shape Based Image Retrieval Method using Phase of ART

ART의 위상 정보를 이용한 형태기반 영상 검색 방법

  • 이종민 (한양대학교 전자컴퓨터통신공학부) ;
  • 김회율 (한양대학교 전자컴퓨터통신공학부)
  • Received : 2011.08.11
  • Accepted : 2011.12.07
  • Published : 2012.01.30

Abstract

Since shape of an object in an image carries important information in contents based image retrieval (CBIR), many shape description methods have been proposed to retrieve images using shape information. Among the existing shape based image retrieval methods, the method which employs invariant Zernike moment desciptor (IZMD) showed better performance compared to other methods which employ traditional Zernike moments descriptor in CBIR. In this paper, we propose a new image retrieval method which applies invariant angular radial transform descriptor (IARTD) to obtain higher performance than the method which employs IZMD in CBIR. IARTD is a rotationally invariant feature which consists of magnitudes and alligned phases of angular radial transform coefficients. To produce rotationally invariant phase coefficients, a phase correction scheme is performed while extracting the IARTD. The distance between two IARTDs is defined by combining the differences of the magnitudes and the aligned phases. Through the experiment using MPEG-7 shape dataset, the average bull's eye performance (BEP) of the proposed method is 0.5806 while the average BEPs of the exsiting methods which employ IZMD and traditional ART are 0.4234 and 0.3574, respectively.

영상에 포함된 객체의 형태는 내용 기반 영상검색에 있어서 중요한 정보를 가지고 있기 때문에, 이를 이용하여 영상을 검색하는 방법들이 활발히 연구되어 왔다. 그중에서도 최근에 제안된 저니키 모멘트의 위상과 크기를 이용하는 회전불변 서술자(IZMD: Invariant Zernike moment descriptor)을 이용한 영상 검색 방법은 기존의 크기 정보만을 이용한 저니키 모멘트 서술자보다 높은 영상 검색 성능을 보인다. 본 논문에서는 IZMD를 이용한 방법 보다 향상된 영상 검색 성능을 얻기 위해서 ART(Angular Radial Transform)의 크기와 위상을 이용한 회전 불변 특징 서술자(IARTD: Invariant Angular Radial Transform Descriptor)와 이를 이용해서 영상을 정합하는 방법을 제안한다. IARTD는 ART 기저함수의 특성을 이용해서 정렬된 ART 계수의 위상과 크기로 구성된 특징벡터이다. 영상의 검색은 두 IARTD의 크기차이와 위상차이의 곱을 이용하여 정의된 거리 계산 방법을 이용해서 수행한다. MPEG-7 데이터셋을 이용한 실험 결과, 제안하는 방법의 평균 BEP(Bull's Eye performance)는 0.5806으로서, ARTD나 IZMD를 이용한 영상 검색 결과의 평균 BEP 0.3574, 0.4234보다 우수한 검색 성능을 제공하는 것을 확인하였다.

Keywords

References

  1. S. Chang, J. Smith, M. Beigi and A. Benitez, "Visual Information Retrieval from Large Distributed Online Repositories," Communications of ACM, Vol. 12, pp. 12-20, 1997.
  2. Y. Rui, T. Huang, and S. Chang, Image retrieval: current techniques, promising directions and open issues, J. of Visual Communication and Image Representation, vol. 10, no.4, 39-62, 1999. https://doi.org/10.1006/jvci.1999.0413
  3. Y. Mingqiang, K. Kidiyo, and R. Joseph, (2008) "A Survey of Shape Feature Extraction Techniques," on Pattern Recognition Techniques, Technology and Applications, Vienna: i-Tech, pp.626.
  4. R. J. Prokop and A. P. Reeves, "A survey of moment-based techniques for unoccluded object representation and recognitino," Graphical Models and Image Processing, vol. 54, no. 5, pp. 438-460, Sep. 1992. https://doi.org/10.1016/1049-9652(92)90027-U
  5. C. H. The and R. T. Chin, "On image analysis by the method of moments," IEEE Trans, on Pattern Analysis and Machine Intelligence, vol. 10, no. 4, pp. 496-513, July. 1998. https://doi.org/10.1109/34.3913
  6. A. Khotnazard and Y. H. Hong, "Invariant image recognition by Zernike moments," IEEE Trans, on Pattern Analysis and Machine Intelligence, vol. 12, no. 5, pp. 489-497, May. 1990. https://doi.org/10.1109/34.55109
  7. M. Teague, "Image analysis via the general theory of moments," Journal of the Optical Society of America, Vo. 70. pp. 920-930, Aug. 1980. https://doi.org/10.1364/JOSA.70.000920
  8. Y. S. Kim and W. Y. Kim, "Content-based trademark retrieval system using visually salient feature," Journal of Image and Vision Computing, vol. 16, pp. 931-939, Aug. 1998 https://doi.org/10.1016/S0262-8856(98)00060-2
  9. Shan Li, Moon-Chuen Lee, and Chi-Man Pun, "Complex Zernike Moments Features for Shape-Based Image Retrieval," IEEE Trans. on Systems, Man, And Cybernetics-Part A: Sstems and Humans, vol. 3, no. 1, Jan. 2009
  10. J. Revaud et al., "Improving Zernike moments comparison for optimal similarity and rotation angle retrieval," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 31, no. 4, pp. 627-637, Apr. 2009. https://doi.org/10.1109/TPAMI.2008.115
  11. Z. Chen and S.K. Sun, "A Zernike moment phase-based descriptor for local image representation and matching," IEEE Trans. Image processing, vol. 19, no. 1, Jan 2010.
  12. A. V. Oppenheim and J. S. Lim, "The importance of phase in signals," Proc. IEEE, vol. 69, no. 5, pp. 529-550, 1981. https://doi.org/10.1109/PROC.1981.12022
  13. S. Jeannin, "Mpeg-7 Visual part of eXperimentation Model Version 9.0," in ISO/IEC JTC1/SC29/WG11/N3914, 55th Mpeg Meeting, Pisa, Italia, Jan. 2001.
  14. J Ricard, D Coeurjolly, A. Baskurt, "Generalization of angular radial transform for 2D and 3D shape retrieval," Pattern Recognition Letters, vol. 26, no. 14, pp. 2174-2186, Oct. 2005. https://doi.org/10.1016/j.patrec.2005.03.030
  15. W. Y. Kim and Y. S. Kim, "A new region-based shape descirptor: The ART (Angular Radial Transform) Descriptor," ISO/IEC MPEG99/M5472, Maui, Dec. 1999.