DOI QR코드

DOI QR Code

Equipment and Worker Recognition of Construction Site with Vision Feature Detection

  • Qi, Shaowen (Department of Civil Engineering, Tongji University) ;
  • Shan, Jiazeng (Department of Civil Engineering, Tongji University) ;
  • Xu, Lei (Shanghai Construction No.1 (Group) Co., Ltd.)
  • Published : 2020.12.01

Abstract

This article comes up with a new method which is based on the visual characteristic of the objects and machine learning technology to achieve semi-automated recognition of the personnel, machine & materials of the construction sites. Balancing the real-time performance and accuracy, using Faster RCNN (Faster Region-based Convolutional Neural Networks) with transfer learning method appears to be a rational choice. After fine-tuning an ImageNet pre-trained Faster RCNN and testing with it, the result shows that the precision ratio (mAP) has so far reached 67.62%, while the recall ratio (AR) has reached 56.23%. In other word, this recognizing method has achieved rational performance. Further inference with the video of the construction of Huoshenshan Hospital also indicates preliminary success.

Keywords

References

  1. Bang, S., & Kim, H. (2020). Context-based information generation for managing UAV-acquired data using image captioning. Automation in Construction, 112, 103116. https://doi.org/10.1016/j.autcon.2020.103116
  2. Fan, L., Zhao, H., Zhao, H., Hu, H., & Wang, Z. (2020). Survey of target detection based on deep convolutional neural networks. Optics and Precision Engineering, 28(05), 1152-1164.
  3. He, K., Zhang, X., Ren, S., & Sun, J. (2015). Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence, 37(9), 1904-1916. https://doi.org/10.1109/TPAMI.2015.2389824
  4. Lim, R. S., La, H. M., & Sheng, W. (2014). A robotic crack inspection and mapping system for bridge deck maintenance. IEEE Transactions on Automation Science and Engineering, 11(2), 367-378. https://doi.org/10.1109/TASE.2013.2294687
  5. Lv, W., Zhong, Z., & Zhang, W. (2018). Overview of AI Technology. Journal of Shanghai Electric Technology, 11(01), 62-64.
  6. Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning representations by back-propagating errors. nature, 323(6088), 533-536. https://doi.org/10.1038/323533a0
  7. Samuel, A. L. (1959). Some studies in machine learning using the game of checkers. IBM Journal of research and development, 3(3), 210-229. https://doi.org/10.1147/rd.33.0210
  8. Sigmund, O., & Maute, K. (2013). Topology optimization approaches. Structural and Multidisciplinary Optimization, 48(6), 1031-1055. https://doi.org/10.1007/s00158-013-0978-6
  9. Wang, Q., & Chen, Q. (2009). Method of Real Time Supervisory Control Filling and Rolling Quality Construction in the Dam by GPS. Journal of Wuhan University of Technology, 31(08), 79-82+108.
  10. Chakraborty, D., Kovvali, N., Chakraborty, B., Papandreou-Suppappola, A., & Chattopadhyay, A. (2011). Structural damage detection with insufficient data using transfer learning techniques. Paper presented at the Sensors and Smart Structures Technologies for Civil, Mechanical, and Aerospace Systems 2011.
  11. Dutta, A., & Zisserman, A. (2019). The VIA annotation software for images, audio and video. Paper presented at the Proceedings of the 27th ACM International Conference on Multimedia.
  12. Girshick, R. (2015). Fast r-cnn. Paper presented at the Proceedings of the IEEE international conference on computer vision.
  13. Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. Paper presented at the Proceedings of the IEEE conference on computer vision and pattern recognition.
  14. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Paper presented at the Proceedings of the IEEE conference on computer vision and pattern recognition.
  15. Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. Paper presented at the Advances in neural information processing systems.