DOI QR코드

DOI QR Code

A Novel Video Image Text Detection Method

  • Zhou, Lin (Zhengzhou Information Science and Technology Institute) ;
  • Ping, Xijian (Zhengzhou Information Science and Technology Institute) ;
  • Gao, Haolin (Zhengzhou Information Science and Technology Institute) ;
  • Xu, Sen (Yancheng Institute of Technology)
  • Received : 2011.11.14
  • Accepted : 2012.03.05
  • Published : 2012.03.30

Abstract

A novel and universal method of video image text detection is proposed. A coarse-to-fine text detection method is implemented. Firstly, the spectral clustering (SC) method is adopted to coarsely detect text regions based on the stationary wavelet transform (SWT). In order to make full use of the information, multi-parameters kernel function which combining the features similarity information and spatial adjacency information is employed in the SC method. Secondly, 28 dimension classifying features are proposed and support vector machine (SVM) is implemented to classify text regions with non-text regions. Experimental results on video images show the encouraging performance of the proposed algorithm and classifying features.

Keywords

References

  1. Q.X. Ye, Q. M. Huang, W. Gao and D. B. Zhao, "Fast and robust text detection in images and video frames," Image and Vision Computing, vol.23, no.6, pp.565-576, Jun.2005. https://doi.org/10.1016/j.imavis.2005.01.004
  2. M. Wang, X. S. Hua, R. Hong, J. H. Tang, G. J. Qi and Y. Song, "Unified video annotation via multigraph learning," IEEE Transaction on Circuits and Systems for Video Technology, vol.19, no.5, pp.733-746, May.2009. https://doi.org/10.1109/TCSVT.2009.2017400
  3. M. Wang, X. S. Hua, J. H. Tang and R. Hong, "Beyond distance measurement: constructing neighborhood similarity for video annotation," IEEE Transaction on Multimedia, vol.11, no.3, pp.465-476, Apr.2009. https://doi.org/10.1109/TMM.2009.2012919
  4. M. R. Naphade and J. R. Smith, "On the detection of semantic concepts at TRECVID," in Proc. of. ACM Multimedia, pp.660-667, Oct.2004.
  5. H. S. Lee, S. J. Hong and E. Kim, "Probabilistic background subtraction in a video-based recognition system," KSII Transaction on Internet and Information Systems, vol.5, no.4, pp.782-804, Apr.2011.
  6. N. M. Oliver, B. Rosario and A. Pentland, "A Bayesian computer vision system for modeling human interactions," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, no.8, pp.831-843, Aug.2000. https://doi.org/10.1109/34.868684
  7. J. C. Nascimento and J. S. Marques, "Performance evaluation of object detection algorithms for video surveillance," IEEE Transactions on Multimedia, vol.8, no.4, pp.761-774, Aug.2006. https://doi.org/10.1109/TMM.2006.876287
  8. S.C. Pei and Y.T. Chuang, "Automatic text detection using multi-layer color quantization in complex color images," in Proc. of the IEEE International Conference on Multimedia and Expo, pp.619-622, Jun.2004.
  9. P. Shivakumara, T. Q. Phan and C. L. Tan, "Video text detection based on filters and edge features," in Proc. of the IEEE International Conference on Multimedia and Expo, pp.514-517, Jul.2009.
  10. C. M. Liu, C. H. Wang and R. W. Dai, "Text detection in images based on unsupervised classification of edge-based features," in Proc. of the 8th International Conference on Document Analysis and Recognition, pp.610-614, Sep.2005.
  11. P. Shivakumara, T. Q. Phan and C. L. Tan, "A robust wavelet transform based technique for video text detection," in Proc. of the 10th International Conference on Document Analysis and Recognition, pp.1285-1289, Jul.26-29, 2009.
  12. K. I. Kim, K. Jung and J. H. Kim, "Texture-based approach for text detection in images using support vector machines and continuous adaptive mean shift algorithm," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.25, no.12, pp.1631-1639, Dec.2003. https://doi.org/10.1109/TPAMI.2003.1251157
  13. Z. Ji, J. Wang, Y. T. Su, "Text detection in video frames using hybrid features," in Proc. of the 8th International Conference on Machine Learning and Cybernetics, pp.318-342, Jul.2009.
  14. Y. Song, A. N. Liu, L Pang, S. X. Lin, Y. D. Zhang and S. Tang, "A novel image text extraction method based on k-means clustering," in Proc. of the 7th IEEE/ACIS International Conference on Computer and Information Science, pp.318-342, Jul.2009.
  15. R. Guo, Y. H. Peng and H. L Wan, "Palmprint feature extraction and recognition based on stationary wavelet transform," Computer Engineering and Applications, vol.42, no.17, pp.62-65, Jul.2006.
  16. K. Ersahin, I. G. Cumming and R. K. Ward, "Segmentation and classification of polarimetric SAR data using spectral graph partitioning," IEEE Transaction on Geoscience and Remote Sensing, vol.48, no.1, pp.164-174, Jan.2010. https://doi.org/10.1109/TGRS.2009.2024303
  17. C. Alzate and J. A. K. Suykens, "Multiway spectral clustering with out-of-sample extensions through weighted kernel PCA," IEEE Transaction on Pattern Analysis and Machine Intelligence, vol.32, no.2, pp.335-347, Feb.2010. https://doi.org/10.1109/TPAMI.2008.292
  18. T. Sakai, A. Imiya, "Randomized algorithm of spectral clustering and image/video segmentation using a minority of pixels," in Proc. of the 12th IEEE International Conference of Computer Vision Workshops, pp.468-475, Sept.2009.
  19. N. Iam-on, T. Boongoen and S. Garrett, "Refining pairwise similarity matrix for cluster ensemble problem with cluster relations." in Proc. of the 7th IEEE International Conference on Discovery Science, vol.5255, pp.222-233, 2008.
  20. U. V. Luxburg, "A tutorial on spectral clustering," Journal of Statistics and Computing, vol.17, no.4, pp.395-416, Dec.2007. https://doi.org/10.1007/s11222-007-9033-z
  21. Q. X. Ye and Q. M. Huang, "A new text detection algorithm in images/video frames," Lecture Notes in Computer Science, vol.3332, pp.858-865, 2004.
  22. L. C. Jiao, X. R. Zhang, B. Hou, S. Wang and F. Liu, "Intelligent SAR image processing and interpretation," Science Press, 2008.
  23. J. B. Shi and J. Malik, "Normalized cuts and image segmentation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, no.8, pp.888-905, 2000. https://doi.org/10.1109/34.868688
  24. C. Yang, X. R. Zhang and L. C Jiao, "Self-tuning semi-supervised spectral clustering," in Proc. of the IEEE International Conference on Computational Intelligence and Security, pp.1-5, Dec.2008.
  25. C. Fowlkes, S. Belongie, F. Chung and L. Malik, "Spectral grouping using the Nyström method," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.26, no.2, pp.214-225, Feb. 2004. https://doi.org/10.1109/TPAMI.2004.1262185
  26. X. R. Zhang, L. C. Jiao, F. Liu, L. F. Bo and M. G. Gong, "Spectral clustering ensemble applied to SAR image segmentation," IEEE Transaction on Geoscience and Remote Sensing, vol.46, no.7, pp.2126-2136, Jul.2008. https://doi.org/10.1109/TGRS.2008.918647
  27. V. Vladimir, "The nature of statistical learning theory," Springer-Verlag, 1995.
  28. X. Q. Liu and J. Samarabandu, "Multiscale edge-based text extraction from complex images," in Proc. of the IEEE International Conference on Multimedia and Expo, pp.1721-7124, Jul.2006.
  29. E. K. Wong and M. Y. Chen, "A new robust algorithm for video text extraction", Pattern Recognition, vol.36, pp.1397-1406, 2003. https://doi.org/10.1016/S0031-3203(02)00230-3
  30. T. X. Zhao, G. M. Sun, C. Zhang and D. M. Chen, "Study on video text processing," in Proc. of the IEEE International Symposium on Industrial Electronics, pp.1215-1218, Jun.2008.
  31. Z. Wang and Z. Q. Wei, "A comparative study of feature selection for SVM in video text detection," in Proc. of the 2th International Symposium on Intelligence and Design, pp.552-556, Dec.2009.