DOI QR코드

DOI QR Code

Development of a Real-Time Automatic Passenger Counting System using Head Detection Based on Deep Learning

  • Kim, Hyunduk (Division of Automotive Technology, Daegu Gyeongbuk Institute of Science & Technology) ;
  • Sohn, Myoung-Kyu (Division of Automotive Technology, Daegu Gyeongbuk Institute of Science & Technology) ;
  • Lee, Sang-Heon (Division of Automotive Technology, Daegu Gyeongbuk Institute of Science & Technology)
  • Received : 2021.02.03
  • Accepted : 2022.01.08
  • Published : 2022.06.30

Abstract

A reliable automatic passenger counting (APC) system is a key point in transportation related to the efficient scheduling and management of transport routes. In this study, we introduce a lightweight head detection network using deep learning applicable to an embedded system. Currently, object detection algorithms using deep learning have been found to be successful. However, these algorithms essentially need a graphics processing unit (GPU) to make them performable in real-time. So, we modify a Tiny-YOLOv3 network using certain techniques to speed up the proposed network and to make it more accurate in a non-GPU environment. Finally, we introduce an APC system, which is performable in real-time on embedded systems, using the proposed head detection algorithm. We implement and test the proposed APC system on a Samsung ARTIK 710 board. The experimental results on three public head datasets reflect the detection accuracy and efficiency of the proposed head detection network against Tiny-YOLOv3. Moreover, to test the proposed APC system, we measured the accuracy and recognition speed by repeating 50 instances of entering and 50 instances of exiting. These experimental results showed 99% accuracy and a 0.041-second recognition speed despite the fact that only the CPU was used.

Keywords

References

  1. F. Zhang, T. Y. Wu, J. S. Pan, G. Ding, and Z. Li, "Human motion recognition based on SVM in VR art media interaction environment," Human-centric Computing and Information Sciences, vol. 9, article no. 40, 2019. https://doi.org/10.1186/s13673-019-0203-8
  2. H. F. Nweke, Y. W. The, G. Mujtaba, U. R. Alo, and M. A. Al-garadi, "Multi-sensor fusion based on multiple classifier systems for human activity identification," Human-centric Computing and Information Sciences, vol. 9, article no. 34, 2019. https://doi.org/10.1186/s13673-019-0194-5
  3. X. X. Wang an Y. Shen, "A video traffic flow detection system based on machine vision," Journal of Information Processing Systems, vol. 15, no. 5, pp. 1218-1230, 2019. https://doi.org/10.3745/jips.04.0140
  4. P. Lengvenis, R. Simutis, V. Vaitkus, and R. Maskeliunas, "Application of computer vision systems for passenger counting in public transport," Elektronika ir Elektrotechnika, vol. 19, no. 3, pp. 69-72, 2013.
  5. A. S. A. Nasir, N. K. A. Gharib, and H. Jaafar, "Automatic passenger counting system using image processing based on skin colour detection approach," in Proceedings of 2018 International Conference on Computational Approach in Smart Systems Design and Applications (ICASSDA), Kuching, Malaysia, 2018, pp. 1-8.
  6. N. Bernini, L. Bombini, M. Buzzoni, P. Cerri, and P. Grisleri, "An embedded system for counting passengers in public transportation vehicles," in Proceedings of 2014 IEEE/ASME 10th International Conference on Mechatronic and Embedded Systems and Applications (MESA), Senigallia, Italy, 2014, pp. 1-6.
  7. D. Lefloch, F. A. Cheikh, J. Y. Hardeberg, P. Gouton, and R. Picot-Clemente, "Real-time people counting system using a single video camera," in Proceedings of SPIE 6811: Real-Time Image Processing 2008. Bellingham, WA: International Society for Optics and Photonics, 2008, pp. 71-82.
  8. C. H. Chen, T. Y. Chen, D. J. Wang, and T. J. Chen, "A cost-effective people-counter for a crowd of moving people based on two-stage segmentation," Journal of Information Hiding and Multimedia Signal Processing, vol. 3, no. 1, pp. 12-23, 2012.
  9. M. Saqib, S. D. Khan, N. Sharma, and M. Blumenstein, "Person head detection in multiple scales using deep convolutional neural networks," in Proceedings of 2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, Brazil, 2018, pp. 1-7.
  10. D. Cao, Z. Chen, and L. Gao, "An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks," Human-centric Computing and Information Sciences, vol. 10, article no. 14, 2020. https://doi.org/10.1186/s13673-020-00219-9
  11. M. T. N. Truong and S. Kim, "A tracking-by-detection system for pedestrian tracking using deep learning technique and color information," Journal of Information Processing Systems, vol. 15, no. 4, pp. 1017-1028, 2019. https://doi.org/10.3745/JIPS.04.0132
  12. G. Chen, X. Cai, H. Han, S. Shan, and X. Chen, "HeadNet: pedestrian head detection utilizing body in context," in Proceedings of 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG), Xi'an, China, 2018, pp. 556-563.
  13. R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, 2014, pp. 580-587.
  14. R. Girshick, "Fast R-CNN," in Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 2015, pp. 1440-1448.
  15. S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: towards real-time object detection with region proposal networks," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, pp. 1137-1149, 2017. https://doi.org/10.1109/TPAMI.2016.2577031
  16. W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu, and A. C. Berg, "SSD: single shot multibox detector," in Computer Vision - ECCV 2016. Cham, Switzerland: Springer, 2016, pp. 21-37.
  17. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: unified, real-time object detection," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, 2016, pp. 779-788.
  18. J. Redmon and A. Farhadi, "YOLO9000: better, faster, stronger," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, 2017, pp. 6517-6525.
  19. J. Redmon and A. Farhadi, "Yolov3: an incremental improvement," 2018 [Online]. Available: https://arxiv.org/abs/1804.02767.
  20. A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam, "MobileNets: efficient convolutional neural networks for mobile vision applications," 2017 [Online], Available: https://arxiv.org/abs/1704.04861.
  21. R. Huang, J. Pedoeem, and C. Chen, "YOLO-LITE: a real-time object detection algorithm optimized for nonGPU computers," in Proceedings of 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, 2018, pp. 2503-2510.
  22. J. F. Henriques, R. Caseiro, P. Martins, and J. Batista, "High-speed tracking with kernelized correlation filters," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 3, pp. 583-596, 2014. https://doi.org/10.1109/TPAMI.2014.2345390
  23. T. H. Vu, A. Osokin, and I. Laptev, "Context-aware CNNs for person head detection," in Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 2015, pp. 2893-2901.
  24. D. Peng, Z. Sun, Z. Chen, Z. Cai, L. Xie, and L. Jin, "Detecting heads using feature refine net and cascaded multi-scale architecture," in Proceedings of 2018 24th International Conference on Pattern Recognition (ICPR), Beijing, China, 2018, pp. 2528-2533.
  25. S. Shao, Z. Zhao, B. Li, T. Xiao, G. Yu, X. Zhang, and J. Sun, "Crowdhuman: a benchmark for detecting human in a crowd," 2018 [Online]. Available: https://arxiv.org/abs/1805.00123.
  26. J. Redmon, "DarkNet: open source neural networks in C," 2013 [Online]. Available: https://pjreddie.com/darknet/.