DOI QR코드

DOI QR Code

Low Resolution Rate Face Recognition Based on Multi-scale CNN

  • Wang, Ji-Yuan (Dept. of Information and Communication Engineering, Tongmyong University) ;
  • Lee, Eung-Joo (Dept. of Information and Communication Engineering, Tongmyong University)
  • Received : 2018.10.08
  • Accepted : 2018.10.31
  • Published : 2018.12.31

Abstract

For the problem that the face image of surveillance video cannot be accurately identified due to the low resolution, this paper proposes a low resolution face recognition solution based on convolutional neural network model. Convolutional Neural Networks (CNN) model for multi-scale input The CNN model for multi-scale input is an improvement over the existing "two-step method" in which low-resolution images are up-sampled using a simple bi-cubic interpolation method. Then, the up sampled image and the high-resolution image are mixed as a model training sample. The CNN model learns the common feature space of the high- and low-resolution images, and then measures the feature similarity through the cosine distance. Finally, the recognition result is given. The experiments on the CMU PIE and Extended Yale B datasets show that the accuracy of the model is better than other comparison methods. Compared with the CMDA_BGE algorithm with the highest recognition rate, the accuracy rate is 2.5%~9.9%.

Keywords

MTMDCW_2018_v21n12_1467_f0001.png 이미지

Fig. 1. Multi-scale Convolutional Neural Network (MSCNN) algorithm training and testing process.

MTMDCW_2018_v21n12_1467_f0002.png 이미지

Fig. 2. Multi-scale convolutional neural network structure.

MTMDCW_2018_v21n12_1467_f0003.png 이미지

Fig. 3. Different resolution face recognition results on CMU PIE database.

Table 1. Layer parameter list

MTMDCW_2018_v21n12_1467_t0001.png 이미지

References

  1. W.K. Xu and E.J. Lee, "Human-computer Catural User Interface Based on Hand Motion Detection and Tracking," The Journal of Multimedia Information System, Vol. 15, No. 4, pp. 501-507, 2012.
  2. R.T. Collins, A.J. Lipton, and T. Kanade, A System for Video Surveillance and Monitoring, Vsam Final Report Carnegie Mellon University Technical Report, 2000.
  3. Siebel, T. Nils, and S.J. Maybank, "The Advisor Visual Surveillance System," Proceeding of European Conference on Computer Vision 2004 Workshop Applications of Computer Vision, pp. 103-111, 2004.
  4. C.F. Shu, A. Hampapur, M. Lu, L. Brown, J. Cannell, A. Senior, Y.L. Tian, et al., "IBM Smart Surveillance System (S3): a Open and Extensible Framework for Event based Surveillance," Proceeding of IEEE Conference on Advanced Video and Signal Based Surveillance, pp. 318-323, 2005.
  5. K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," Computer Science, 2014.
  6. E. Ahmed, M. Jones, and T.K. Marks, "An Improved Deep Learning Architecture for Person Re-identification," Computer Vision and Pattern Recognition, pp. 3908-3916, 2015.
  7. K.M. He, X.Y. Zhang, S.Q. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," arXiv, arXiv:1512.03385, 2015.
  8. K.T. Lim, H.W. Kang and J.K. lee,"Moving Shadow Detection using Deep Learning and Markov Random Field," journal of multimedia information system, Vol. 18, No. 12, pp. 1432- 1438, 2015
  9. G.E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R.R. Salakhutdinov, "Improving Neural Networks by Preventing Co-adaptation of Feature Detectors," Computer Science, Vol. 3, No. 4, pp. 212-223, 2012.
  10. L. Wan, M.D. Zeiler, S.X. Zhang, Y. Lecun, and R. Fergus, "Regularization of Neural Networks using DropConnect," Proceeding of International Conference on Machine Learning, pp. 1058-1066, 2013.
  11. R. Gross, I. Matthews, J. Cohn, T. Kanade, and S. Baker, "Multi-PIE," Image and Vision Computing, Vol. 28, No. 5, pp. 807-813, 2010. https://doi.org/10.1016/j.imavis.2009.08.002
  12. A.S. Georghiades, P.N. Belhumeur, and D.J. Kriegman, "From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 12, No. 6, pp. 643-660, 2002.
  13. F. Rosenblatt, "The Perception: a Probabilistic Model for Information Storage and Organization in the Brain," American Psychological Association, Vol. 65, No. 6, pp. 386-408, 1958.

Cited by

  1. 딥러닝 기반 표고버섯 병해충 이미지 분석에 관한 연구 vol.23, pp.1, 2020, https://doi.org/10.9717/kmms.2020.23.1.050