Acknowledgement
이 논문은 정부(과학기술정보통신부)의 재원으로 한국연구재단의 지원을 받아 수행된 연구임(No.2022-0-00817).
References
- I. Kostrikov, D. Yarats, and R. Fergus. "Image augmentation is all you need: Regularizing deep reinforcement learning from pixels," arXiv preprint arXiv:2004.13649, 2020.
- Z. Li, C. Zou, Y. Zhao, B. Li, and S. Zhong, "Improving human-object interaction detection via phrase learning and label composition," arXiv preprint arXiv:2112.07383, 2021.
- H. S. Fang, Y. Xie, D. Shao, Y. L. Li, and C. Lu, "DecAug: Augmenting HOI detection via decomposition," Proceedings of the AAAI Conference on Artificial Intelligence, Vol.35, No.2, pp.1300-1308, 2021.
- Y. W. Chao, Y. Liu, X. Liu, H. Zeng, and J. Deng, "Learning to detect human-object interactions," 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), IEEE, 2018.
- S. Gupta and J. Malik, "Visual semantic role labeling," arXiv preprint arXiv:1505.04474, 2015.
- B. Zhuang, Q. Wu, C. Shen, I. Reid, and A. Hengel, "HCVRD: A benchmark for large-scale human-centered visual relationship detection," Proceedings of the AAAI Conference on Artificial Intelligence, Vol.32, No.1, 2018.
- H. S. Fang, J. Sun, R. Wang, M. Gou, Y. L. Li, and C. Lu, "Instaboost: Boosting instance segmentation via probability map guided copy- pasting," Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.
- T. Y. Lin et al., "Microsoft coco: Common objects in context," European Conference on Computer Vision, Springer, Cham, 2014.
- F. Z. Zhang, D. Campbell, and S. Gould, "Efficient two-stage detection of human-object interactions with a novel unary-pairwise transformer," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- G. Ghiasi et al., "Simple copy-paste is a strong data augmentation method for instance segmentation," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021.
- S. Li, K. Gong, C. H. Liu, Y. Wang, F. Qiao, and X. Cheng, "Metasaug: Meta semantic augmentation for long-tailed visual recognition," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021.
- F. Z. Zhang, D. Campbell, and S. Gould. "Spatially conditioned graphs for detecting human-object interactions," Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021.
- C. Shorten and T. M. Khoshgoftaar, "A survey on image data augmentation for deep learning," Journal of Big Data, Vol.6, No.1, pp.1-48, 2019. https://doi.org/10.1186/s40537-018-0162-3
- X. Zhu, W. Su, L. Lu, B. Li, X. Wang, and J. Dai, "Deformable detr: Deformable transformers for end-to-end object detection," arXiv preprint arXiv:2010.04159, 2020.
- Z. Li, C. Zou, Y. Zhao, B. Li, and S. Zhong, "Improving human-object interaction detection via phrase learning and label composition," Proceedings of the AAAI Conference on Artificial Intelligence, Vol.36, No.2, 2022.
- A. Zhang et al., "Mining the benefits of two-stage and one-stage hoi detection," Advances in Neural Information Processing Systems, Vol.34, pp.17209-17220, 2021.
- Y. W. Chao, Z. Wang, Y. He, J. Wang, and J. Deng, "Hico: A benchmark for recognizing human-object interactions in images," Proceedings of the IEEE International Conference on Computer Vision, 2015.