Fig. 1. KCR-AlexNet architecture.
Fig. 2. Inception module for GoogLeNet and KCR-GoogLeNet.
Fig. 3. KCR-GoogLeNet architecture.
Fig. 4. Example of transformed Input data from PHD08.
Fig. 5. Comparison between KCR-AlexNet and KCR-GoogLeNet. (a) E1-Set_1, (b) E1-Set_2, (c) E1-Set_3, (d) E1-Set_4, and (e) E1-Set_5.
Fig. 6. Comparison between KCR-AlexNet and KCR-GoogLeNet.
Table 1. KCR-GoogLeNet incarnation of the inception architecture
Table 2. PHD08 composition
Table 3. Five experimental data sets
Table 4. Test accuracies for the last training iteration and average times for a single iteration
Table 5. Used fonts for PHD08 and new data set
Table 6. Classification success rate comparison between KCR-AlexNet, KCR-GoogLeNet and other programs
References
- D. Cireşan, U. Meier, J. Masci, and J. Schmidhuber, "Multi-column deep neural network for traffic sign classification," Neural Networks, vol. 32, pp. 333-338, 2012. https://doi.org/10.1016/j.neunet.2012.02.023
- N. Kalchbrenner, E. Grefenstette, and P. Blunsom, "A convolutional neural network for modelling sentences," 2014 [Online]. Available: https://arxiv.org/abs/1404.2188.
- P. L. Callet, C. Viard-Gaudin, and D. Barba, "A convolutional neural network approach for objective video quality assessment," IEEE Transactions on Neural Networks, vol. 17, no. 5, pp. 1316-1327, 2006. https://doi.org/10.1109/TNN.2006.879766
- D. C. Ciresan, U. Meier, L. M. Gambardella, and J. Schmidhuber, "Convolutional neural network committees for handwritten character classification," in Proceedings of the International Conference on Document Analysis and Recognition (ICDAR), Beijing, China, 2011, pp. 1135-1139.
- T. Wang, D. J. Wu, A. Coates, and A. Y. Ng, "End-to-end text recognition with convolutional neural networks," in Proceedings of the 21st International Conference on Pattern Recognition(ICPR 2012), Tsukuba, Japan, 2012, pp. 3304-3308.
- Y. Zhang, "Deep convolutional network for handwritten Chinese character recognition," [Online]. Available: http://yuhao.im/files/Zhang_CNNChar.pdf.
- Z. Zhong, L. Jin, and Z. Xie, "High performance offline handwritten Chinese character recognition using GoogLeNet and directional feature map," in Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR), Nancy, France, 2015, pp. 846-850.
- W. Yang, L. Jin, Z. Xie, and Z. Feng, "Improved deep convolutional neural network for online handwritten Chinese character recognition using domain-specific knowledge," in Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR), Nancy, France, 2015, pp. 551-555.
- D. C. Hwang and S. S. Kim, "Hangul recognition using path following algorithm," IE Interfaces, vol. 3, no. 2, pp. 53-62, 1990.
- B. K. Sin, and J. H. Kim, "On-line handwritten character recognition with hidden Markov models," in Proceedings of the 4th Annual Conference on Human and Cognitive Language Technology, Seoul, Korea, 1992, pp. 533-542.
- J. K. Chung, S. I. Kim, and J. C. Namgung, "A study on an on-line handwritten Hangul character recognition by identifying relative positions of strokes," Journal of Information Technology Applications and Management, vol. 4, no. 2, pp. 65-78, 1997.
- J. Y. Ha, and B. K. Shin, "Optimization of number of states in HMM for on-line Hangul recognition," Proceeding of the Korea Information Science Society, vol. 25, no. 2, pp. 372-374, 1998.
- J. H. Lee, J. H. Ahn, and I. B. Lee, "Elastic curvature matching for online handwritten Hangul recognition," Proceeding of the Korea Information Science Society, vol. 35, no. 2, pp. 238-239, 2008.
- H. S. Cho, "A new feature-extraction method using wavelet transformation and fuzzy data for handwritten Hangul recognition," Journal of Korean Institute of Information Technology, vol. 3, no. 4, pp. 11-17, 2005.
- I. J. Kim and X. Xie, "Handwritten Hangul recognition using deep convolutional neural network," International Journal of Document Analysis and Recognition, vol. 18, no. 1, pp. 1-13, 2011. https://doi.org/10.1007/s10032-014-0229-4
- D. S. Ham, D. Y. Lee, I. S. Jung, and I. S. Oh, "Construction of printed Hangul character database PHD08," Journal of the Korea Contents Association, vol. 8, no. 11, pp. 33-40, 2008. https://doi.org/10.5392/JKCA.2008.8.11.033
- Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proceeding of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998. https://doi.org/10.1109/5.726791
- A. Krizhevsky, I. Sutskever and G. E. Hinton, "ImageNet classification with deep convolutional neural network," in Proceedings of the 25th International Conference on Neural Information Processing Systems (NIPS'12), Lake Tahoe, NV, 2012, pp. 1097-1105.
- ImageNet Large Scale Visual Recognition Challenge [Online]. Available: http://www.image-net.org/challenges/LSVRC/.
- C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, "Going deeper with convolutions," in Proceeding of the IEEE Conference on Computer Vison and Patter Recognition, Boston, MA, 2015, pp. 1-9.
- S. Arora, A. Bhaskara, R. Ge, and T. Ma, "Provable bounds for learning some deep representations," 2013 [Online]. Available: https://arxiv.org/abs/1310.6343.
- Linear interpolation [Online]. Available: https://en.wikipedia.org/wiki/Linear_interpolation.
- Caffe (convolutional architecture for fast feature embedding) [Online]. Available: http://caffe.berkeleyvision.org/.
- ABBYY FineReader 12 [Online]. Available: http://www.retia.co.kr/cnt/products/products.html?category=1&uid=24&name=finereader-12&tab=1.
- ABC OCR scanner app [Online]. Available: https://itunes.apple.com/us/app/scanner-ocr-optical-character/id777913435.
- Office Lens app [Online]. Available: https://itunes.apple.com/kr/app/office-lens/id975925059?mt=8.