Web Image Classification using Semantically Related Tags and Image Content

의미적 연관태그와 이미지 내용정보를 이용한 웹 이미지 분류

  • 조수선 (충주대학교 컴퓨터정보공학과)
  • Received : 2010.02.10
  • Accepted : 2010.03.14
  • Published : 2010.06.30

Abstract

In this paper, we propose an image classification which combines semantic relations of tags with contents of images to improve the satisfaction of image retrieval on application domains as huge image sharing sites. To make good use of image retrieval or classification algorithms on huge image sharing sites as Flickr, they are applicable to real tagged Web images. To classify the Web images by 'bag of visual word' based image content, our algorithm includes training the category model by utilizing the preliminary retrieved images with semantically related tags as training data and classifying the test images based on PLSA. In the experimental results on the Flickr Web images, the proposed method produced the better precision and recall rates than those from the existing method using tag information.

본 논문에서는 대용량 온라인 이미지 공유 사이트를 적용 도메인으로 하여 이미지 검색의 만족도를 높이고자 태그의 의미적 연관성과 이미지 자체의 내용 정보를 결합하는 이미지 분류 방법을 제안한다. 이미지 검색 및 분류 알고리즘이 플리커와 같은 대용량 이미지 공유 사이트에서 활용될 수 있으려면 실제 웹상의 태깅된 이미지를 대상으로 한 적용이 가능해야 한다. 제안된 알고리즘은 'bag of visual word'기반의 이미지 내용으로 웹 이미지를 분류하기 위한 것으로서, 의미적 연관태그를 이용해 일차 검색된 이미지들을 훈련 데이터로 사용하여 카테고리 모델을 훈련하고, PLSA를 적용하여 평가 이미지들을 분류하는 것이다. 제안된 방법으로 플리커의 웹 이미지들을 대상으로 실험한 결과, 태그 정보를 이용한 기존의 방법에 비해 우수한 검색 정확도 및 재현율을 확인할 수 있었다.

Keywords

References

  1. A. Hotho, R. Jaschke, C. Schmitz, and G. Stumme, "Information Retrieval in Folksonomies: Search and Ranking," In Proc. of ESWC''06, 2006.
  2. S. Angeletou, et al., "Bridging the Gap Between Folksonomies and the Semantic Web: An Experience Report," In Proc. of Workshop: Bridging the Gap between Semantic Web and Web 2.0 at European Semantic Web Conference, 2007.
  3. 이시화, 이만형, 황대훈, "Web2.0 환경에서의 효율적인 이미지 검색을 위한 태그 클러스터링 시스템의 설계 및 구현," 멀티미디어학회 논문지, 제11권, 제8호, 2008.
  4. 권대현, 홍준혁, 조수선, "워드넷 의미정보로 선별된 우선 태그와 이를 이용한 웹 이미지의 검색," 멀티미디어학회 논문지, 제12권, 제7호, 2009.
  5. J. Yang, C-W. Ngo, A. Hauptmann, and Y-G. Jiang, "Evaluating Bag-of-Visual-Words Representations in Scene Classification," In Proc. of the ACM Multimedia Information Retrieval Workshop (MIR 2007) at ACM Multimedia 2007, 2007.
  6. L. Fei-Fei, R. Fergus, and P. Perona, "Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories," In Proc. of Workshop on Generative-Model Based Vision, 2004.
  7. S. Agarwal, A. Awan, and D. Roth, "Learning to detect objects in images via a sparse, part-based representation," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 26, No. 11, 2004.
  8. G. Begelman, P. Keller, and F. Smadja, "Automated Tag Clustering: Improving search and exploration in the tag space," In Proc. of the Collaborative Web Tagging Workshop at WWW''06, 2006.
  9. C-V. Damme, M. Hepp, and K. Siorpaes, "Folksontology: An integrated approach for turning folksonomies into ontologies," In Proc. of Workshop: Bridging the Gap between Semantic Web and Web 2.0 at European Semantic Web Conference, 2007.
  10. WordNet 3.0, "WordNet, a lexical database for the English language," http://wordnet.princeton.edu/, 2006.
  11. J. Sivic, B. Russell, A. Efros, A. Zisserman, and W. Freeman, "Discovering object categories in image collections," In Proc. of International Conference on Computer Vision, 2005.
  12. L. Fei-Fei, and P. Perona, "A bayesian hierarchical model for learning natural scene categories," In Proc. of the 2005 IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, 2005.
  13. Y-G. Jiang, C-W. Ngo, and J. Yang, "Towards optimal bag-of-features for object categorization and semantic video retrieval," In Proc. of ACM Int' Conf. on Image and Video Retrieval, 2007.
  14. D-G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," International Journal of Computer Vision, Vol. 20, No.2, 2004.
  15. P. Schmitz, "Inducing Ontology from Flickr Tags," In Proc. of the Collaborative Web Tagging Workshop at WWW''06, 2006.
  16. G-A. Miller, "WordNet: An On-line Lexical Database," International Journal of Lexicography, Vol. 3, No. 4, 1990.
  17. T. Hofmann, "Probabilistic latent semantic analysis," In Proc. of the Fifteenth Conference on. Uncertainty in Artificial Intelligence (UAI'99), 1999.
  18. J. Sivic, B. Russell, A. Efros, A. Zisserman, and W. Freeman. "Discovering object categories in image collections," Technical Report A. I. Memo 2005-005, Massachusetts Institute of Technology, 2005.
  19. http://people.csail.mit.edu/fergus/iccv2005/bagwords.html
  20. R. Fergus, P. Perona, and A. Zisserman, "Object class recognition by unsupervised scale-invariant learning," In Proc. Computer Vision and Pattern Recognition, 2003.