딥러닝을 이용한 일반 영상에서의 문자 인식

  • Published : 2015.01.28

Abstract

Keywords

References

  1. K. Jung, K. I. Kim and A. K. Jain, "Text information extraction in images and video : a survey", Pattern Recognition, vol. 37, no. 5, pp. 977-997, 2004 https://doi.org/10.1016/j.patcog.2003.10.012
  2. S. Singh, "Optical character recognition techniques : a survey", Journal of Emerging Trends in Computing and Information Sciences, vol. 4, no. 6, pp. 545-550, 2013
  3. C. Patel, A. Patel and D. Patel, "Optical chracter recognition by open source OCR tool Tesseract : a case study", International Journal of Computer Applications, vol. 55, no. 10, pp. 50-56, 2012 https://doi.org/10.5120/8794-2784
  4. C. Yao, X. Bai and W. Liu, "A unified framework for Multioriented text detection and recognition", IEEE Transactions on Image Processing, vol. 23, no. 11, pp. 4737-4749, 2014 https://doi.org/10.1109/TIP.2014.2353813
  5. Y. Bengio and Y. LeCun, "Scaling learning algorithms towards AI", Large-scale Kernel Machines 34, pp. 1-41, 2007
  6. A. Krizhevsky, I. Sutskever and G. E. Hinton, "ImageNet classification with deep convolutional neural networks", Advances in Neural Information Processing Systems 25, 2012, pp. 1097-1105
  7. I. J. Goodfellow, Y. Bulatov, J. Ibraz, S. Arnoud and V. Shet, "Multi-digit number recognition from street view imagery using deep convolutional neural networks", arXiv:1312.6082, 2014
  8. R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation", arXiv:1311.2524, 2013.
  9. A. Bissacco, M. Cummins, Y. Netzer and H. Neven, "PhotoOCR : reading text in uncontrolled conditions", in Proceedings of the IEEE Conference on Computer Vision, 2013, pp. 785-792
  10. K. Koray, P. Sermanet, Y. Boureau, K. Gregor, M. Mathieu and Y. LeCun, "Learning convolutional feature hierarchies for visual recognition", Advances in Neural Information Processing Systems 23, 2010, pp. 1090-1098
  11. K. Wang, B. Babenko and S. Belongie, "End-to-end scene text recognition", in Proceedings of the IEEE International Conference on Computer Vision, 2011, pp. 1457-1464
  12. T. Wang, D. J. Wu, A. Coates and A. Y. Ng, "End-to-end text recognition with convolutional neural networks", in Proceedings of the International Conference on Pattern Recognition, 2012, pp. 3304-3308
  13. O. Alsharif and J. Pineau, "End-to-end text recognition with hybrid HMM maxout models", arXiv:1310.1811, 2013
  14. D. E. Rumelhart and J. L. McClelland, "Parallel distributed processing: explorations in the microstructure of cognition", Cambridge: MIT Press, 1986
  15. 15. G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks", Science, vol. 313, no. 5786, pp. 504-507, 2006 https://doi.org/10.1126/science.1127647
  16. P. Baldi and P. J. Sadowski "Understanding dropout", Advances in Neural Information Processing Systems 26, 2013, pp. 2814-2822
  17. H. Lee, A. Battle, R. Raina and A. Y. Ng, "Efficient sparse coding algorithms", Advances in Neural Information Processing Systems 19, 2007, pp. 584-592
  18. C. Szegedy, W. Liu., Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke and A. Rabinovich, "Going deeper with convolutions", arXiv:1409.4842, 2014
  19. X. Glorot, A. Bordes and Y. Bengio, "Deep sparse rectifier neural networks", in Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011, pp. 315-323
  20. V. Nair and G. E. Hinton, "Rectified linear units improve restricted Boltzmann machines", in Proceedings of International Conference on Machine Learning, 2010, pp. 807-814
  21. G. E. Hinton, N. Srivastava. A. Krizhevsky, I. Sutskever and R. R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors", arXiv:1207.0580, 2012
  22. A. Coates, B. Carpenter, C. Case, S. Satheesh, B. Suresh, T. Wang, D. J. Wu and A. Y. Ng, "Text detection and character recognition in scene images with unsupervised feature learning", in Proceedings of the International Conference on Document Analysis and Recognition, 2011, pp. 440-445
  23. A. Coates, H. Lee and A. Y. Ng, "An analysis of single-layer networks in unsupervised feature learning", AISTATS, 2011
  24. A. Neubeck, L. V. Gool, "Efficient non-maximal suppression", in Proceedings of the International Conference on Pattern Recognition, 2006, pp. 850-855