References
- R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2014), pp. 580-587, 2014.
- R. Girshick, "Fast R-CNN," in Proc. of the International Conference on Computer Vision (ICCV 2015), pp. 1440-1448, 2014.
- S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: towards real-time object detection with region proposal networks," IEEE Trans. Pattern Anal. Mach. Intell., vol.39, no.6, pp. 1137-1149, 2016. DOI:10.1109/TPAMI.2016.2577031
- J. Johnson, R. Krishna, M. Stark, L.-J. Li, D. A. Shamma, M. S. Bernstein, and L. Fei-Fei, "Image retrieval using scene graphs," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 3668-3678, 2015.
- C. Galleguillos, A. Rabinovich, and S. Belongie, "Object categorization using co-occurrence, location and appearance," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2008), pp. 1-8. 2014. DOI:10.1109/CVPR.2008.4587799
- W. Choi, Y.-W. Chao, C. Pantofaru, and S. Savarese, "Understanding indoor scenes using 3D geometric phrases," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2013), pp. 33-40, 2013.
- C. Lu, R. Krishna, M. Bernstein, and L. Fei-Fei, "Visual relationship detection with language priors," in Proc. of the European Conference on Computer Vision (ECCV 2016), pp. 852-869, 2016.
- G. Gkioxari, R. Girshick, and J. Malik, "Contextual action recognition with R*CNN," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 1080-1088, 2015.
- V. Ramanathan, C. Li, J. Deng, W. Han, Z. Li, K. Gu, Y. Song, S. Bengio, C. Rossenberg, and L. Fei-Fei, "Learning semantic relationships for better action retrieval in images," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 1100-1109, 2015.
- B. Dai, Y. Zhang, and D. Lin, "Detecting visual relationships with deep relational networks," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 3076-3086, 2017.
- L.-C. Chen, A. G. Schwing, A. L. Yuille, and R. Urtasun, "Learning deep structured models," in Proc. of the International Conference on Machine Learning (ICML 2015), pp. 1785-1794, 2015.
- H. Wang, X. Shi, and D.-Y. Yeung, "Relational deep learning: a deep latent variable model for link prediction," in Proc. of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17), pp. 2688-2694, 2017.
- M. Long, Z. Cao, J. Wang, and P. S. Yu, "Learning multiple tasks with multilinear relationship networks," in Proc. of the Thirty-first Annual Conference on Neural Information Processing (NIPS 2017), pp. 1593-1602, 2017.
- J.R.R. Uijlings, K.D.A. van de Sande, T. Gevers, and A. W. M. Smeulders, "Selective search for object recognition," INT J COMPUT VISION, vol.104, no.2, pp. 154-171, 2013. DOI:10.1007/s11263-013-0620-5
- K. Mao, M. Harman, and Y. Jia, "Sapienz: multi-objective automated testing for Android applications," in Proc. of 2016 International Symposium on Software Testing and Analysis, pp. 94-105, 2016.
- A. Bordes, N. Usunier, A. Garcia-Duran, J. Weston, and O. Yakhnenko, "Translating embeddings for modeling multi-relational data," in Proc. of the Twenty-seventh Conference on Neural Information Processing Systems (NIPS 2013), pp. 2787-2795, 2013.
- H. Zhang, Z. Kyaw, S.-F. Chang, and T.-S. Chua, "Visual translation embedding network for visual relational detection," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 5532-5540, 2017.