Acknowledgement
본 논문은 과학기술정보통신부 및 정보통신기술진흥센터의 SW중심대학지원사업의 연구결과로 수행되었음.
References
- S. Ren, K. He, R. Girshick, J. Sun, "Faster R-cnn: Towards Real-time Object Detection with Region Proposal Networks," Advances in Neural Information Processing Systems, 28, 91-99. 2015.
- J. Redmon, S. Divvala, R. Girshick, A. Farhadi, "You Only Look Once: Unified, Real-time Object Detection," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition pp. 779-788. 2016.
- W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.Y. Fu, A.C. Berg, "Ssd: Single Shot Multibox DDetector," In European Conference on Computer Vision., Springer, Cham, pp. 21-37. 2016.
- J. Dai, Y. Li, K. He, J. Sun, "R-fcn: Object Detection via Region-based Fully Convolutional Networks," In Advances in Neural Information Processing Systems pp. 379-387, 2016.
- J. Redmon, A. Farhadi, "YOLO9000: Better, Faster, Stronger," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263-7271, 2017.
- T.Y. Lin, P. Dollar, R. Girshick, K. He B. Hariharan, S. Belongie, "Feature Pyramid Networks for Object Detection," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117-2125. 2017.
- T.Y. Lin, P. Goyal, R. Girshick, K. He, P. Dollar, "Focal Loss for Dense Object Detection," In Proceedings of the IEEE International Conference on Computer Vision, pp. 2980-2988, 2017.
- K. He, G. Gkioxari, P. Dollar, R. Girshick, "Mask r-cnn," In Proceedings of the IEEE International Conference on Computer Vision, pp. 2961-2969, 2017.
- J. Redmon, A. Farhadi, "Yolov3: An Incremental Improvement," arXiv preprint, arXiv:1804.02767, 2018.
- S. Zhang, L. Wen, X. Bian, Z. Lei, S.Z. Li, "Single-shot Refinement Neural Network for Object Detection," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4203-4212, 2018.
- Q. Zhao, T. Sheng, Y. Wang, Z. Tang, Y. Chen, L. Cai, H. Ling, "M2det: A Single-shot Object Detector Based on Multi-level Feature Pyramid Network," In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33, No. 01, pp. 9259-9266, 2019.
- S. Liu, L. Qi, H. Qin, J. Shi, J. Jia, "Path aggregation network for instance segmentation," In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.8759-8768, 2018.
- D. Bolya, C. Zhou, F. Xiao, Y.J. Lee, "Yolact: Real-time Instance Segmentation," In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9157-9166, 2019.
- B. Cheng, B. Xiao, J. Wang, H. Shi, T.S. Huang, L. Zhang, "Higherhrnet: Scale-aware Representation Learning for Bottom-up Human Pose Estimation," In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5386-5395, 2020.
- S. Jin, L. Xu, J. Xu, C. Wang, W. Liu, C. Qian, W. Ouyang, P. Luo, "Whole-body Human Pose Estimation in the Wild," In European Conference on Computer Vision, Springer, Cham, pp. 196-21, 2020.
- Z. Cao, G. Hidalgo, T. Simon, S.E. Wei, Y. Sheikh, "OpenPose: Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields," IEEE transactions on pattern analysis and machine intelligence, 43(1), 172-186, 2019.
- J. Wang, K. Sun, T. Cheng, B. Jiang, C. Deng, Y. Zhao, B. Xiao, "Deep High-resolution Representation Learning for Visual Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020.
- L. Pishchulin, E. Insafutdinov, S. Tang, B. Andres, M. Andriluka, P.V. Gehler, B. Schiele, "Deepcut: Joint Subset Partition and Labeling for Multi Person Pose Estimation," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4929-4937, 2016.
- Z. Cao, T. Simon, S.E. Wei, Y. Sheikh, "Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7291-7299, 2017.
- G. Papandreou, T. Zhu, L.C. Chen, S. Gidaris, J. Tompson, K. Murphy, "Personlab: Person Pose Estimation and Instance Segmentation with a Bottom-up, Part-based, Geometric Embedding Model," In Proceedings of the European Conference on Computer Vision (ECCV), pp. 269-286, 2018.
- R. Girshick, "Fast r-cnn," In Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, 2015.
- T.Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, C.L. Zitnick, "Microsoft coco: Common Objects in Context," In European Conference on Computer Vision, pp. 740-755, 2014.
- A. Newell, Z. Huang, J. Deng, "Associative Embedding: End-to-end Learning for Joint Detection and Grouping," arXiv preprint, arXiv:1611.05424, 2016.
- https://github.com/open-mmlab/mmpose