DOI QR코드

DOI QR Code

Development an Android based OCR Application for Hangul Food Menu

한글 음식 메뉴 인식을 위한 OCR 기반 어플리케이션 개발

  • Lee, Gyu-Cheol (Department of Electronic Engineering, Kwangwoon University) ;
  • Yoo, Jisang (Department of Electronic Engineering, Kwangwoon University)
  • Received : 2016.12.27
  • Accepted : 2017.02.14
  • Published : 2017.05.31

Abstract

In this paper, we design and implement an Android-based Hangul food menu recognition application that recognizes characters from images captured by a smart phone. Optical Character Recognition (OCR) technology is divided into preprocessing, recognition and post-processing. In the preprocessing process, the characters are extracted using Maximally Stable Extremal Regions (MSER). In recognition process, Tesseract-OCR, a free OCR engine, is used to recognize characters. In the post-processing process, the wrong result is corrected by using the dictionary DB for the food menu. In order to evaluate the performance of the proposed method, experiments were conducted to compare the recognition performance using the actual menu plate as the DB. The recognition rate measurement experiment with OCR Instantly Free, Text Scanner and Text Fairy, which is a character recognizing application in Google Play Store, was conducted. The experimental results show that the proposed method shows an average recognition rate of 14.1% higher than other techniques.

본 논문에서는 스마트폰으로 음식 메뉴를 촬영한 영상으로부터 글자를 인식하는 안드로이드 기반의 한글 음식 메뉴 인식 어플리케이션을 설계하고 구현한다. Optical Character Recognition (OCR) 기술은 크게 전처리, 인식 그리고 후처리 과정으로 구분된다. 전처리 과정에서는 Maximally Stable Extremal Regions (MSER) 기법을 이용하여 글자를 추출한다. 인식 과정에서는 무료 OCR 엔진인 Tesseract-OCR을 이용하여 글자를 인식한다. 후처리 과정에서는 음식 메뉴에 대한 사전 DB를 이용하여 잘못된 결과를 수정한다. 제안하는 기법의 성능을 평가하기 위해 실제 메뉴판을 DB로 이용하여 인식 성능을 비교 측정하는 실험을 진행하였다. 구글 플레이스토어에 있는 글자 인식 어플리케이션인 OCR Instantly Free, Text Scanner 그리고 Text Fairy와 인식률 측정 실험을 진행하였으며 실험 결과 제안하는 기법이 다른 기법보다 평균적으로 14.1% 높은 인식률을 보여주는 것을 확인하였다.

Keywords

References

  1. H. J. Jeon and Y. M. Kim, "The achievement of 14 million people tourists in 2014," Korea Culture & Tourism Institute, Tourgo focus, 2015.
  2. M. S. Park, D. H. Lee, J. H. Park, and J. K. Koo, "Analysis of economic impacts and implications of the increase in Chinese tourists visit," Korea Institute for Industrial Economics & Trade, Issue paper, 2014.
  3. E. K. Choi, "Activation measures to attract Chinese tourists," Korea Culture & Tourism Institute, Basic research, 2011.
  4. E. B. Go, Y. J. Ha, S. R. Choi, K. H. Lee, and Y. H. Park, "An implementation of an android mobile system for extracting and retrieving texts from images," Journal of Digital Contents Society, vol. 12, no. 1, pp. 57-67, Mar. 2011. https://doi.org/10.9728/dcs.2011.12.1.057
  5. R. Mithe, S. Indalkar, and N. Divekar, "Optical character recognition," International Journal of Recent Technology and Engineering, vol. 2, no. 1, pp. 72-75, Mar. 2013. https://doi.org/10.15623/ijret.2013.0201014
  6. M. H. Cho, "A study on character recognition using wavelet transformation and moment," Journal of The Korea Society of Computer and Information, vol. 15, no. 10, pp. 49-57, Oct. 2010. https://doi.org/10.9708/jksci.2010.15.10.049
  7. J. H. Roh and D. S. Choi, "Character recognition using a target string," Proceedings of Symposium of the Korean Institute of communications and Information Sciences, Jun. 2014.
  8. S. G. Hong, S. S Hwang, and S. D. Kim, "A license plate recognition system robust to vehicle location and viewing angle," Journal of the Institute of Electronics and Information Engineers, vol. 49, no. 12, pp. 113-123, Dec. 2012. https://doi.org/10.5573/ieek.2012.49.12.113
  9. J. H. Ju and J. S. Oh, "An adaptive binarization algorithm for degraded document images," Journal of Korean Institute of Communications and Information Sciences, vol. 37, no. 7, pp 581-585, Jul. 2012. https://doi.org/10.7840/KICS.2012.37.7A.581
  10. C. H. Son and H. M. Park, "Fast multiple-image-based deblurring method," Journal of the Institute of Electronics Engineers of Korea SP, vol. 49, no. 4, pp. 49-57, Jul. 2012.
  11. J. W. Song, N. R. Jung, and H. S. Kang, "Container BIC-code region extraction and recognition method using multiple thresholding," Journal of the Korea Institute of Information and Communication Engineering, vol. 19, no. 6, pp. 1462-1470, Jun. 2015. https://doi.org/10.6109/jkiice.2015.19.6.1462
  12. DioLense. Available: http://www.diotek.com/?page_id=8532.
  13. ABBYY. Available: https://www.abbyy.com.
  14. Google Translate. Available: https://translate.google.com/about/intl/en_ALL/.
  15. M. Donoser and H. Bischof "Efficient maximally stable extremal region (MSER) tracking," IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2006, Jun. 2006.
  16. Tesseract-OCR. Available: https://github.com/tesseract-ocr.
  17. S. U. Kim, "An image denoising algorithm for the mobile phone cameras," Journal of the Korea Institute of Electronic Communication Sciences, vol. 9, no. 5, pp. 601- 608, May 2014. https://doi.org/10.13067/JKIECS.201.9.5.601
  18. J. W. Jeong and K. K. Jun, "Noise robust document image binarization using text region detection and down sampling," Journal of Korea Multimedia Society, vol. 18, no. 7, pp. 843-852, Jul. 2015. https://doi.org/10.9717/kmms.2015.18.7.843
  19. N. Otsu, "A threshold selection method from gray-level histograms," IEEE Transactions on Systems, vol. 9, no. 1, pp. 62-66, Jan. 1979.
  20. TheSimplest.Net. Available: https://play.google.com/store/apps/details?id=com.thesimplest.ocr&hl=ko%20Renard.
  21. Generic Co. Available: https://play.google.com/store/apps/details?id=co.kr.generic.ocr&hl=ko%20Renard.
  22. Text Fairy. Available: https://github.com/renard314/textfairy.

Cited by

  1. 맞춤형 여행 콘텐츠 개발을 위한 OCR 기법을 활용한 영화 속 촬영지 정보 추출 방안 제시 vol.5, pp.1, 2017, https://doi.org/10.36498/kbigdt.2020.5.1.29
  2. Design of Image Generation System for DCGAN-Based Kids' Book Text vol.16, pp.6, 2017, https://doi.org/10.3745/jips.02.0149