References
- A.Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in Proc. NIPS, 2012.
- S. Ren, K. He, R. Girshick, and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, pp. 1137-1149, June 2017. https://doi.org/10.1109/TPAMI.2016.2577031
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, June 2016.
- W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, and A. C. Berg, "Ssd: Single shot multibox detector," in Computer Vision , pp. 21-37, 2016.
- Uijlings, J. RR et al.,"Selective search for object recognition," Int. journal of computer vision, pp.154-171., 2013
- R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, June 2014.
- R. Girshick, "Fast r-cnn," in IEEE International Conference on Computer Vision (ICCV), pp. 1440-1448, Dec 2015.
- K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," arXiv preprint arXiv:1409.1556, 2014.
- K. He, G. Gkioxari, P. Dollr, and R. Girshick, "Mask r-cnn," in IEEE International Conference on Computer Vision (ICCV), pp. 2980-2988, Oct 2017.
- T. Lin, P. Dollr, R. Girshick, K. He, B. Hariharan, and S. Belongie, "Feature pyramid networks for object detection," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 936-944, July 2017.
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, June 2016.
- J. Redmon and A. Farhadi, "Yolo9000: Better, faster, stronger," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517-6525, July 2017.
- S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift," arXiv preprint arXiv:1502.03167, 2015.
- J. Redmon et al, "YOLOv3: An Incremental Improvement," arXiv 1804.02767
- K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770-778, June 2016.
- A. Bochkovskiy, W. Chien-Yao, M. L. Hong-Yuan, "YOLOv4: Optimal Speed and Accuracy of Object Detection," arXiv:2004.10934v1, 2020.
- W. Chien-Yao Wang, M. L. Hong-Yuan, W. Yueh-Hua, C. Ping-Yang, H. Jun-Wei, and Y. I-Hau, "CSPNet:A new backbone that can enhance learning capability of cnn," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshop, 2020.
- S. Liu, L. Qi, Q. Haifang, S. Jianping, and J. Jiaya, "Path aggregation network for instance segmentation," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8759-8768, 2018.
- S.Zhang, L.Wen, X.Bian, Z.Lei, and S.Z.Li, "Single-shot refinement neural network for object detection," in Proceedings of the IEEE Conference on Computer Vision, 2018
- H. Hu, J. Gu, Z. Zhang, J. Dai, and Y. Wei, "Relation networks for object detection," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3588-3597, 2018.
- X. Zhou, D. Wang, P. Krahenb, "Objects as Points," arXiv:1904.07850, 2019.
- M. Tan, R. Pang, Q. V. Le, "EfficientDet: Scalable and Efficient Object Detection," arXiv preprint arXiv:1911.09070v4, 2020.
- M. Tan, Q.V. Le, "EfficientNet : Rethinking Model Scaling for Convolutional Neural Networks, arXiv:1905.11946v5, 2019.
- H. Bilen, A. Vedaldi, "Universal representations : The missing link between faces, text, planktons, and cat breeds," arXiv:1701.07275 , 2017.
- X. Wang, Z. Cai, D. Gao, and N. Vasconcelos, "Towards universal object detection by domain attention," arXiv:1904.04402, 2019.