Fig. 1. An Example of Scene Graph
Fig. 2. 3D Scene Graph Generation
Fig. 3. 3D Scene Graph Generation Model
Fig. 4. Attribute Prediction Network (AttNet)
Fig. 5. Transfer Network (TransNet)
Fig. 6. Storing Object Information in Object Memory
Fig. 7. 3D Intersection over Union (3D IoU)
Fig. 8. Relationship Recognition Network (RelNet)
Fig. 9. 3D Scene Graphs Generated by the Proposed Model
Table 1. Performance Analysis of AttNet
Table 2. Performance Analysis of TransNet
Table 3. Performance Analysis of RelNet
Table 4. Performance Analysis of Total Model
참고문헌
- Y. Guo, Y. Liu, and A. Oerlemans et al., "Deep Learning for Visual Understanding: A Review," Neurocomputing, Vol. 187, pp. 27-48, 2016. https://doi.org/10.1016/j.neucom.2015.09.116
- S. Aditya, Y. Yang, and C. Baral et al., "Image Understanding using Vision and Reasoning through Scene Description Graph," Computer Vision and Image Understanding, In Press, Available online 18 December, 2017.
- E. Kolve, R. Mottaghi, and D. Gordon et al., "AI2-THOR: An Interactive 3d Environment for Visual AI," arXiv preprint arXiv:1712.05474, 2017.
- D. Xu, Y. Zhu, and C. B. Choy et al., "Scene Graph Generation by Iterative Message Passing," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5410-5419, 2017.
- Y. Li, W. Ouyang, and B. Zhou et al., "Scene Graph Generation from Objects, Phrases and Region Captions," Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 1261-1270, 2017.
- S. Ren, K. He, and R. Girshick et al., "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks," Proceedings of the Neural Information Processing Systems (NIPS), pp. 91-99, 2015.
- C. Lu, R. Krishna, and M. Bernstein et al., "Visual Relationship Detection with Language Priors," Proceedings of the European Conference on Computer Vision(ECCV), pp. 852-869, 2016.
- B. Dai, Y. Zhang, and D. Lin, "Detecting Visual Relationships with Deep Relational Networks," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3298-3308. 2017.
- P. Gay, J. Stuart, and A. D. Bue, "Visual Graphs from Motion (VGfM): Scene understanding with Object Geometry Reasoning," arXiv preprint arXiv:1807.05933, 2018.
- S. Song and J. Xiao, "Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 808-816. 2016.
- A. Dai, A. X. Chang, and M. Savva et al., "ScanNet: Richlyannotated 3D Reconstructions of Indoor Scenes," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 5828-5839. 2018.
- D. Goron, A. Kembhavi, and M. Rastegari et al., "IQA: Visual Question Answering in Interactive Environments," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 4089-4098, 2018.
- J. Redmon and A. Farhadi, "YOLOv3: An Incremental Improvement," arXiv preprint arXiv:1804.02767, 2018.