DOI QR코드

DOI QR Code

Correction of Signboard Distortion by Vertical Stroke Estimation

  • Lim, Jun Sik (School of Electronics & Computer Engineering, Chonnam National University) ;
  • Na, In Seop (School of Electronics & Computer Engineering, Chonnam National University) ;
  • Kim, Soo Hyung (School of Electronics & Computer Engineering, Chonnam National University)
  • Received : 2013.02.14
  • Accepted : 2013.05.20
  • Published : 2013.09.30

Abstract

In this paper, we propose a preprocessing method that it is to correct the distortion of text area in Korean signboard images as a preprocessing step to improve character recognition. Distorted perspective in recognizing of Korean signboard text may cause of the low recognition rate. The proposed method consists of four main steps and eight sub-steps: main step consists of potential vertical components detection, vertical components detection, text-boundary estimation and distortion correction. First, potential vertical line components detection consists of four steps, including edge detection for each connected component, pixel distance normalization in the edge, dominant-point detection in the edge and removal of horizontal components. Second, vertical line components detection is composed of removal of diagonal components and extraction of vertical line components. Third, the outline estimation step is composed of the left and right boundary line detection. Finally, distortion of the text image is corrected by bilinear transformation based on the estimated outline. We compared the changes in recognition rates of OCR before and after applying the proposed algorithm. The recognition rate of the distortion corrected signboard images is 29.63% and 21.9% higher at the character and the text unit than those of the original images.

Keywords

References

  1. O. Bretscher, "Linear Algebra With Applications," Prentice Hall, (1997)
  2. P. Clark and M. Mirmehdi, "Recognizing text in real scenes," International Journal on Document Analysis and Recognition (IJDAR), 4(4) (2004), 243-257. http://dx.doi.org/doi:10.1007/s10032-001-0072-2.
  3. P. Clark and M. Mirmehdi, "On the Recovery of Oriented Documents from Single Images," in Proc. of 4th IEEE Conf. on Advanced Concepts for Intelligent Vision Systems, (2001), 190-197.
  4. C. Li, X. Ding, and Y. Wu, "Automatic text location in natural scene images," in Proc. of International Conference on Document Analysis and Recognition, (2001), 1069-1073. http://dx.doi.org/doi:10.1109/ICDAR.2001.953950.
  5. J. H. Park, G. S. Lee, J. S. Lim, S. H. Kim, H. J. Yang, M. H. Lee and S. T. Hwang, "Automatic detection and recognition of Korean text in outdoor signboard images," Pattern Recognition Letters, 31 (2010), 1728-1739. http://dx.doi.org/doi:10.1016/j.patrec.2010.05.024.
  6. P. Pan, Y. Zhu, J. Sun, S. Naoi, Recognizing Characters with Severe Perspective Distortion Using Hash Tables and Perspective Invariants, in Proc. of 11th International Conference on Document Analysis and Recognition, 548-552, (2011). http://dx.doi.org/doi:10.1109/ICDAR.2011.116.
  7. L. Li, C. L. Tan, "Recognizing Planar Symbols with Severe Perspective Deformation," Pattern Analysis and Machine Intelligence, 32(4) (2010), 755-762. http://dx.doi.org/doi:10.1109/TPAMI.2009.196.
  8. G. K. Myers, R. C. Bolles, Q. T. Luong, J. A. Herson and H. B. Aradhye, "Rectification and recognition of text in 3-d scenes," International Journal on Document Analysis and Recognition (IJDAR), 7(2-3) (2004), 147-158. http://dx.doi.org/doi:10.1007/s10032-004-0133-4.
  9. R. Woods, R. Gonzalez and S. Eddins., "Digital image processing using MATLAB," Prentice Hall, (2004)
  10. X. C. Yin, H. W. Hao, J. Sun, S. Naoi, "Robust Vanishing Point Detection for Mobile Cam-Based Documents," in Proc. of 11th International Conference on Document Analysis and Recognition, (2011), 136-140. http://dx.doi.org/doi:10.1109/ICDAR.2011.36.
  11. Biyi Lin, Yide Wang, Bruno Feuvrie and Qingyun Dai, "A Novel Power Amplifier Linearization Technique based on Pre-distortion Principle and Wiener model," International Journal of Advanced Science and Technology, 22, 59 (2010).
  12. S. Lu and C. L. Tan, "Camera text recognition based on perspective invariants," in Proc. of 6th International Conference on Pattern Recognition, (2006) 2, 1042-1045; Hong Kong, China. http://dx.doi.org/doi:10.1109/ICPR.2006.351.
  13. G. K. Myers, R. C. Bolles, Q. T. Luong, J., "Rectification and recognition of text in 3-d scenes," Int. J. Doc. Anal. Recog.,7, 147-158, (2005). http://dx.doi.org/doi:10.1007/s10032-004-0133-4.
  14. C. M. Thillou, B. Gosselin, Natural Scene Text Understanding, "Vision Systems: Segmentation and Pattern Recognition," I-Tech, Vienna (2007), pp.307-333.
  15. Y. Watanabe, K. Sono, K. Yokomizo, and Y. Okada, Translation camera on mobile phone, in Proc. of 2003 International Conference on Multimedia and Expo, (2003) July 2, 177-180; Baltimore, USA. http://doi.ieeecomputersociety.org/10.1109/ICME.2003.1221582.
  16. Shijian Lu, Ben M. Chen, Chi Chung Ko," Perspective rectification of document images using fuzzy set and morphological operations", Image Vision Comput. 23(5) (2005),541-553. http://dx.doi.org/doi:10.1016/j.imavis.2005.01.003 .

Cited by

  1. Triangulation Based Skeletonization and Trajectory Recovery for Handwritten Character Patterns vol.9, pp.1, 2013, https://doi.org/10.3837/tiis.2015.01.022