DOI QR코드

DOI QR Code

Voting and Ensemble Schemes Based on CNN Models for Photo-Based Gender Prediction

  • Jhang, Kyoungson (Dept. of Computer Science and Engineering, Chungnam National University)
  • Received : 2019.08.27
  • Accepted : 2020.04.14
  • Published : 2020.08.31

Abstract

Gender prediction accuracy increases as convolutional neural network (CNN) architecture evolves. This paper compares voting and ensemble schemes to utilize the already trained five CNN models to further improve gender prediction accuracy. The majority voting usually requires odd-numbered models while the proposed softmax-based voting can utilize any number of models to improve accuracy. The ensemble of CNN models combined with one more fully-connected layer requires further tuning or training of the models combined. With experiments, it is observed that the voting or ensemble of CNN models leads to further improvement of gender prediction accuracy and that especially softmax-based voters always show better gender prediction accuracy than majority voters. Also, compared with softmax-based voters, ensemble models show a slightly better or similar accuracy with added training of the combined CNN models. Softmax-based voting can be a fast and efficient way to get better accuracy without further training since the selection of the top accuracy models among available CNN pre-trained models usually leads to similar accuracy to that of the corresponding ensemble models.

Keywords

References

  1. E. Eidinger, R. Enbar, and T. Hassner, "Age and gender estimation of unfiltered faces," IEEE Transactions on Information Forensics and Security, vol. 9, no. 12, pp. 2170-2179, 2014. https://doi.org/10.1109/TIFS.2014.2359646
  2. G. Levi and T. Hassner, "Age and gender classification using convolutional neural networks," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Boston, MA, 2015, pp. 34-42.
  3. A. Dehghan, E. G. Ortiz, G. Shu, and S. Z. Masood, "DAGER: deep age, gender and emotion recognition using convolutional neural network," 2017 [Online]. https://arxiv.org/abs/1702.04280.
  4. S. Lapuschkin, A. Binder, K. R. Muller, and W. Samek, "Understanding and comparing deep neural networks for age and gender classification," in Proceedings of the IEEE International Conference on Computer Vision Workshops, Venice, Italy, 2017, pp. 1629-1638.
  5. K. Zhang, C. Gao, L. Guo, M. Sun, X. Yuan, T. X. Han, Z. Zhao, and B. Li, "Age group and gender estimation in the wild with deep RoR architecture," IEEE Access, vol. 5, pp. 22492-22503, 2017. https://doi.org/10.1109/ACCESS.2017.2761849
  6. A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," Advances in Neural Information Processing Systems, vol. 25, pp. 1097-1105, 2012.
  7. Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrel, "Caffe: convolutional architecture for fast feature embedding," in Proceedings of the 22nd ACM International Conference on Multimedia, Orlando, FL, 2014, pp. 675-678.
  8. C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, "Going deeper with convolutions," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, 2015, pp. 1-9.
  9. K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," 2015 [Online]. Available: https://arxiv.org/abs/1409.1556.
  10. O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, et al., "ImageNet large scale visual recognition challenge," International Journal of Computer Vision, vol. 115, pp. 211-252, 2015. https://doi.org/10.1007/s11263-015-0816-y
  11. R. Rothe, R. Timofte, and L. Van Gool, "Deep expectation of real and apparent age from a single image without facial landmarks," International Journal of Computer Vision, vol. 126, pp. 144-157, 2018. https://doi.org/10.1007/s11263-016-0940-3
  12. Wikipedia, "softmas function," 2020 [Online]. Available: https://en.wikipedia.org/wiki/Softmax_function.
  13. Wikipedia, "Convolutional neural network,", 2020 [Online]. Available: https://en.wikipedia.org/wiki/Convolutional_neural_network.
  14. K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," 2015 [Online]. Available: https://arxiv.org/abs/1512.03385.
  15. A. Thomas, "An introduction to Global Average Pooling in convolutional neural networks," 2019 [Online]. Available: https://adventuresinmachinelearning.com/global-average-pooling-convolutional-neural-networks/.
  16. G. Huang, Z. Liu, L. Van Der Maaten, and K. Q. Weinberger, "Densely connected convolutional networks," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, 2017, pp. 4700-4708.
  17. Wikipedia, "Arg max," 2020 [Online]. Available: https://en.wikipedia.org/wiki/Arg_max.
  18. A. Singh, "A comprehensive guide to ensemble learning (with Python codes)," 2018 [Online]. Available: https://www.analyticsvidhya.com/blog/2018/06/comprehensive-guide-for-ensemble-models/.
  19. R. R. Picard and R. D. Cook, "Cross-validation of regression models," Journal of the American Statistical Association, vol. 79, no. 387, pp. 575-583, 1984. https://doi.org/10.1080/01621459.1984.10478083
  20. S. Arlot and A. Celisse, "A survey of cross-validation procedures for model selection," Statistics Surveys, vol. 4, pp. 40-79, 2010. https://doi.org/10.1214/09-SS054