DOI QR코드

DOI QR Code

A Comparative Study on Performance of Deep Learning Models for Vision-based Concrete Crack Detection according to Model Types

영상기반 콘크리트 균열 탐지 딥러닝 모델의 유형별 성능 비교

  • Kim, Byunghyun (Department of Civil Engineering, University of Seoul)) ;
  • Kim, Geonsoon (Institute for Industrial Technology, University of Seoul) ;
  • Jin, Soomin (Department of Civil Engineering, University of Seoul)) ;
  • Cho, Soojin (Department of Civil Engineering, University of Seoul))
  • 김병현 (서울시립대학교 토목공학과) ;
  • 김건순 (서울시립대학교 산업기술연구소) ;
  • 진수민 (서울시립대학교 토목공학과) ;
  • 조수진 (서울시립대학교 토목공학과)
  • Received : 2019.11.27
  • Accepted : 2019.12.12
  • Published : 2019.12.31

Abstract

In this study, various types of deep learning models that have been proposed recently are classified according to data input / output types and analyzed to find the deep learning model suitable for constructing a crack detection model. First the deep learning models are classified into image classification model, object segmentation model, object detection model, and instance segmentation model. ResNet-101, DeepLab V2, Faster R-CNN, and Mask R-CNN were selected as representative deep learning model of each type. For the comparison, ResNet-101 was implemented for all the types of deep learning model as a backbone network which serves as a main feature extractor. The four types of deep learning models were trained with 500 crack images taken from real concrete structures and collected from the Internet. The four types of deep learning models showed high accuracy above 94% during the training. Comparative evaluation was conducted using 40 images taken from real concrete structures. The performance of each type of deep learning model was measured using precision and recall. In the experimental result, Mask R-CNN, an instance segmentation deep learning model showed the highest precision and recall on crack detection. Qualitative analysis also shows that Mask R-CNN could detect crack shapes most similarly to the real crack shapes.

Keywords

References

  1. K. Jang et al., "Deep Learning-Based Autonomous Concrete Crack Evaluation through Hybrid Image Scanning", Structural Health Monitoring, Vol. 18, No. 5-6, pp. 1722-1737, 2019. https://doi.org/10.1177/1475921718821719
  2. C. V. Dung and D. A. Le, "Autonomous Concrete Crack Detection Using Deep Fully Convolutional Neural Network", Automation in Construction, Vol. 99, pp. 52-58, 2019. https://doi.org/10.1016/j.autcon.2018.11.028
  3. J. Schmidhuber, "Deep Learning in Neural Networks: An Overview", Neural Networks, Vol. 61, Elsevier Ltd, pp. 85-117, 2015. https://doi.org/10.1016/j.neunet.2014.09.003
  4. Y. LeCun et al., "Gradient-Based Learning Applied to Document Recognition", Proceedings of the IEEE, Vol. 86, No. 11, pp. 2278-2323, 1998. https://doi.org/10.1109/5.726791
  5. K. Fukushima, "Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position", Biological Cybernetics, Vol. 36, No. 4, Springer-Verlag, pp. 193-202, 1980. https://doi.org/10.1007/BF00344251
  6. A. Krizhevsky et al., "ImageNet Classification with Deep Convolutional Neural Networks", Advances In Neural Information Processing Systems, pp. 1-9, 2012.
  7. K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition", International Conference on Learning Representations (ICRL), pp. 1-14, 2015.
  8. K. He et al., "Deep Residual Learning for Image Recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
  9. S. Ren et al., "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, No. 6, pp. 1137-1149, 2017. https://doi.org/10.1109/TPAMI.2016.2577031
  10. W. Liu et al., "SSD: Single Shot Multibox Detector", European Conference on Computer Vision, Vol. 9905 LNCS, pp. 21-37, 2016.
  11. J. Redmon et al., "You Only Look Once: Unified, Real-Time Object Detection", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 779-788, 2016.
  12. V. Badrinarayanan et al., "SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, No. 12, pp. 2481-2495, 2017. https://doi.org/10.1109/TPAMI.2016.2644615
  13. L. C. Chen et al., "DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 40, No. 4, pp. 834-848, 2018. https://doi.org/10.1109/TPAMI.2017.2699184
  14. O. Ronneberger et al., "U-Net: Convolutional Networks for Biomedical Image Segmentation", International Conference on Medical Image Computing and Computer-Assisted Intervention, Vol. 9351, pp. 234-241, 2015.
  15. K. He et al., "Mask R-CNN", Proceedings of the IEEE International Conference on Computer Vision, pp. 2961-2969, 2017.
  16. T. Y. Lin et al., Microsoft COCO: Common Objects in Context, 2014.
  17. R. Girshick et al., "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation", The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580-587, 2014.
  18. J. R. R. Uijlings et al., "Selective Search for Object Recognition", International Journal of Computer Vision, Vol. 104, No. 2, Springer US, pp. 154-1571, 2013, doi: 10.1007/s11263-013-0620-5.
  19. R. Girshick, "Fast R-CNN", Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, 2015.
  20. J. Huang et al., "Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors", The IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 7310-7311, 2017.
  21. GitHub - Waspinator/Pycococreator: Helper Functions to Create COCO Datasets. https://github.com/waspinator/pycococreator. Accessed 8 Aug. 2019.