A Multi-Scale Parallel Convolutional Neural Network Based Intelligent Human Identification Using Face Information

Li, Chen;Liang, Mengti;Song, Wei;Xiao, Ke;

doi:10.3745/JIPS.02.0103

Journal of Information Processing Systems

Volume 14 Issue 6
/
Pages.1494-1507
/
2018
/
1976-913X(pISSN)
/
2092-805X(eISSN)

Korea Information Processing Society (한국정보처리학회)

DOI QR Code

A Multi-Scale Parallel Convolutional Neural Network Based Intelligent Human Identification Using Face Information

Li, Chen (School of Computer Science, North China University of Technology) ;
Liang, Mengti (School of Computer Science, North China University of Technology) ;
Song, Wei (School of Computer Science, North China University of Technology) ;
Xiao, Ke (School of Computer Science, North China University of Technology)

Received : 2018.09.07
Accepted : 2018.10.22
Published : 2018.12.31

https://doi.org/10.3745/JIPS.02.0103 Citation PDF KSCI HTML

Download PDF

⟨ Previous Next ⟩

Abstract

Intelligent human identification using face information has been the research hotspot ranging from Internet of Things (IoT) application, intelligent self-service bank, intelligent surveillance to public safety and intelligent access control. Since 2D face images are usually captured from a long distance in an unconstrained environment, to fully exploit this advantage and make human recognition appropriate for wider intelligent applications with higher security and convenience, the key difficulties here include gray scale change caused by illumination variance, occlusion caused by glasses, hair or scarf, self-occlusion and deformation caused by pose or expression variation. To conquer these, many solutions have been proposed. However, most of them only improve recognition performance under one influence factor, which still cannot meet the real face recognition scenario. In this paper we propose a multi-scale parallel convolutional neural network architecture to extract deep robust facial features with high discriminative ability. Abundant experiments are conducted on CMU-PIE, extended FERET and AR database. And the experiment results show that the proposed algorithm exhibits excellent discriminative ability compared with other existing algorithms.

Keywords

E1JBB0_2018_v14n6_1494_f0001.png 이미지

Fig. 1. The structure of MP-CNN.

E1JBB0_2018_v14n6_1494_f0002.png 이미지

Fig. 2. Average pooling (a) and max pooling (b).

E1JBB0_2018_v14n6_1494_f0003.png 이미지

Fig. 3. 1-CNN structure.

E1JBB0_2018_v14n6_1494_f0004.png 이미지

Fig. 4. 4-CNN structure.

E1JBB0_2018_v14n6_1494_f0005.png 이미지

Fig. 5. Image examples of CMU-PIE face database.

E1JBB0_2018_v14n6_1494_f0006.png 이미지

Fig. 6. Experiment results comparison on CMU-PIE database.

E1JBB0_2018_v14n6_1494_f0007.png 이미지

Fig. 7. Image examples of the extended FERET face database.

E1JBB0_2018_v14n6_1494_f0008.png 이미지

Fig. 8. Experiment results comparison on the extended FERET database.

E1JBB0_2018_v14n6_1494_f0009.png 이미지

Fig. 9.The recognition performance of the five methods on expanded AR database are shown in Fig. 10.

E1JBB0_2018_v14n6_1494_f0010.png 이미지

Fig. 9. Image examples of the extended AR face database.

E1JBB0_2018_v14n6_1494_f0011.png 이미지

Fig. 10. Experiment results comparison on the extended AR database.

Table 1. The RANK1 recognition rates on CMU-PIE face database

E1JBB0_2018_v14n6_1494_t0001.png 이미지

Table 2. The RANK1 recognition rates on extended FERET face database

E1JBB0_2018_v14n6_1494_t0002.png 이미지

Table 3. The RANK1 recognition rates on the enhanced AR database

E1JBB0_2018_v14n6_1494_t0003.png 이미지

References

W. Song, G. Sun, S. Fong, and K. E. Cho, "A real-time infrared LED detection method for input signal positioning of interactive media," Journal of Convergence, vol. 7, article ID. 16071002, 2016.
P. J. Phillips, W. T. Scruggs, A. J. O'Toole, P. J. Flynn, K. W. Bowyer, C. L. Schott, and M. Sharpe, "FRVT 2006 and ICE 2006 large-scale experimental results," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 32, no. 5, pp. 831-846, 2010. https://doi.org/10.1109/TPAMI.2009.59
P. J. Grother, G. W. Quinn, and P. J. Phillips, "Report on the evaluation of 2D still-image face recognition algorithms," National Institute of Standards and Technology, NIST Interagency Report No. 7709, 2010.
A. Moeini, K. Faez, and H. Moeini, "Unconstrained pose-invariant face recognition by a triplet collaborative dictionary matrix," Pattern Recognition Letters, vol. 68, pp. 83-89, 2015. https://doi.org/10.1016/j.patrec.2015.08.012
K. Ramirez-Gutierrez, D. Cruz-Perez, J. Olivares-Mercado, M. Nakano-Miyatake, and H. Perez-Meana, "A face recognition algorithm using eigenphases and histogram equalization," International Journal of Computers, vol. 5, no. 1, pp. 34-41, 2011.
S. U. Khan, W. Y. Chai, C. S. See, and A. Khan, "X-ray image enhancement using a boundary division wiener filter and wavelet-based image fusion approach," Journal of Information Processing Systems, vol. 12, no. 1, pp. 35-45, 2016. https://doi.org/10.3745/JIPS.02.0029
Z. Nenadic, "Information discriminant analysis: Feature extraction with an information-theoretic objective," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 29, no. 8, pp. 1394-1407, 2007. https://doi.org/10.1109/TPAMI.2007.1156
L. Lei, D. H. Kim, W. J. Park, and S. J. Ko, "Face recognition using LBP eigenfaces," IEICE Transactions on Information and Systems, vol. 97, no. 7, pp. 1930-1932, 2014. https://doi.org/10.1587/transinf.e97.d.1930
J. Li, Y. Zhao, and D. Quan, "The combination of CSLBP and LBP feature for pedestrian detection," in Proceedings of 2013 3rd International Conference on Computer Science and Network Technology (ICCSNT), Dalian, China, 2013, pp. 543-546.
A. R. Rivera, J. R. Castillo, and O. O. Chae, "Local directional number pattern for face analysis: face and expression recognition," IEEE Transactions on Image Processing, vol. 22, no. 5, pp. 1740-1752, 2013. https://doi.org/10.1109/TIP.2012.2235848
A. R. Rivera and O. Chae, "Spatiotemporal directional number transitional graph for dynamic texture recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 10, pp. 2146- 2152, 2015. https://doi.org/10.1109/TPAMI.2015.2392774
J. Ylioinas, A., Hadid, Y. Guo, and M. Pietikainen, "Efficient image appearance description using dense sampling based local binary patterns," in Computer Vision-ACCV 2012. Heidelberg: Springer, 2012, pp. 375-388.
C. Shan, S. Gong, and P. W. McOwan, "Facial expression recognition based on local binary patterns: a comprehensive study," Image and Vision Computing, vol. 27, no. 6, pp. 803-816, 2009. https://doi.org/10.1016/j.imavis.2008.08.005
X. Z. Liu and H. W. Ye, "Dual-kernel based 2D linear discriminant analysis for face recognition," Journal of Ambient Intelligence and Humanized Computing, vol. 6, no. 5, pp. 557-562, 2015. https://doi.org/10.1007/s12652-014-0230-2
X. Z. Liu, P. S. Wang, and G. C. Feng, "Kernel-based 2D fisher discriminant analysis with parameter optimization for face recognition," International Journal of Pattern Recognition and Artificial Intelligence, vol. 27, no. 8, article no. 1356010, 2013.
B. Zhang, Z. Mu, C. Li, and H. Zeng, "Robust classification for occluded ear via Gabor scale feature-based non-negative sparse representation," Optical Engineering, vol. 53, no. 6, article no. 061702, 2013.
L. Yuan, W. Liu, and Y. Li, "Non-negative dictionary based sparse representation classification for ear recognition with occlusion," Neurocomputing, vol. 171, pp. 540-550, 2016. https://doi.org/10.1016/j.neucom.2015.06.074
D. Ciresan, U. Meier, and J. Schmidhuber, "Multi-column deep neural networks for image classification," in Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, 2012, pp. 3642-3649.
P. Sermanet, K. Kavukcuoglu, S. Chintala, and Y. LeCun, "Pedestrian detection with unsupervised multistage feature learning," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, 2013, pp. 3626-3633.
T. Wang, D. J. Wu, A. Coates, and A. Y. Ng, "End-to-end text recognition with convolutional neural networks," in Proceedings of 2012 21st International Conference on Pattern Recognition (ICPR), Tsukuba, Japan, 2012, pp. 3304-3308.
P. Luo, X. Wang, and X. Tang, "Hierarchical face parsing via deep learning," in Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, 2012, pp. 2480-2487.
Y. Taigman, M. Yang, M. A. Ranzato, and L. Wolf, "DeepFace: closing the gap to human-level performance in face verification," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, 2014, pp. 1701-1708.
O. M. Parkhi, A. Vedaldi, and A. Zisserman, "Deep face recognition," in Proceedings of the British Machine Vision Conference (BMVC), Swansea, UK, 2015.
Y. Sun, D. Liang, X. Wang, and X. Tang, "DeepID3: face recognition with very deep neural networks," 2015 [Online]. Available: https://arxiv.org/abs/1502.00873.
F. Schroff, D. Kalenichenko, and J. Philbin, "FaceNet: a unified embedding for face recognition and clustering," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, 2015, pp. 815-823.
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, "Going deeper with convolutions," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, 2015, pp. 1-9.

Journal of Information Processing Systems

A Multi-Scale Parallel Convolutional Neural Network Based Intelligent Human Identification Using Face Information

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)