References
- A. Torralba and A. Oliva, "Depth estimation from image structure," IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 9, pp. 1226-1238, Sep. 2002. https://doi.org/10.1109/TPAMI.2002.1033214
- D. Hoiem, A. A. Efros, and M. Hebert, "Recovering surface layout from an image," Int. J. Comput. Vis., vol. 75, no. 1, pp. 151-172, Oct. 2007. https://doi.org/10.1007/s11263-006-0031-y
- K. Karsch, C. Liu, and S. B. Kang, "Depth transfer: depth extraction from video using non-parametric sampling," IEEE Trans. Pattern Anal. Mach. Intell., vol. 36, no. 11, pp. 2144-2158, Nov. 2014. https://doi.org/10.1109/TPAMI.2014.2316835
- D. Eigen, C. Puhrsch, and R. Fergus, "Depth map prediction from a single image using a multi-scale deep network," in Proc. Adv. Neural Inf. Process. Syst., Dec. 2014, pp. 2366-2374.
- F. Liu, C. Shen, and G. Lin, "Deep convolution neural fields for depth estimation from a single image," in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit., Jun. 2015, pp. 5162-5170.
- D. Xu, E. Ricci, W. Ouyang, X. Wang, and N. Sebe, "Multi-scale continuous CRFs as sequential deep networks for monocular depth estimation," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., Jul. 2017, pp. 161-169.
- C. Godard, O. M. Aodha, and G. J. Brostow, "Unsupervised monocular depth estimation with left-right consistency," in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit., Jun. 2017, pp. 6602-6611.
- C. Godard, O. M. Aodha, M. Firman, and G. Brostow, "Digging into self-supervised monocular depth estimation," in Proc. IEEE Int. Conf. Comput. Vis., Nov. 2019. pp. 3827-3837.
- H. Fu, M. Gong, C. Wang, K. Batmanghelich, and D. Tao, "Deep ordinal regression network for monocular depth estimation," in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit., Jun. 2018, pp. 2002-2011.
- Y. Cao, T. Zhao, K. Xian, C. Shen, Z. Cao, and S. Xu, "Monocular depth estimation with augmented ordinal depth relationships," IEEE Trans. Circuits Syst. Video Technol., vol. 30, no. 8, pp. 2674-2682, Aug. 2020. https://doi.org/10.1109/tcsvt.2019.2929202
- Y. Gan, X. Xu, W. Sun, and L. Lin, "Monocular depth estimation with affinity, vertical pooling, and label enhancement," in Proc. Eur. Conf. Comput. Vis., Sep. 2018, pp. 232-247.
- J. H. Lee, M.-K. Han, D. W. Ko, and I. H. Suh, "From big to small: multi-scale local planar guidance for monocular depth estimation," 2019, arXiv:1907.10326. [Online]. Available: http://arxiv.org/abs/1907.10326.
- M. Song, S. Lim and W. Kim, "Monocular depth estimation using Laplacian pyramid-based depth residuals," IEEE Trans. Circuits Syst. Video Technol., vol. 31, no. 11, pp. 4381-4393, Nov. 2021. https://doi.org/10.1109/TCSVT.2021.3049869
- K. Xian, J. Zhang, O. Wang, L. Mai, Z. Lin, and Z. Cao, "Structure-guided ranking loss for single depth image prediction," in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit, Jun. 2020, pp. 611-620.
- A. Dosovitskiy et al., "An image is worth 16x16 words: transformers for image recognition at scale," in Proc. Int. Conf. Learn. Represet., May 2021, pp. 1-12.
- S. F. Bhat, I. Alhashim, and P. Wonka, "AdaBins: depth estimation using adaptive bins," in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit, Jun. 2021, pp. 4009-4018.
- A. Geiger, P. Lenz, C. Stiller, and R. Urtasun, "Vision meets robotics: The KITTI dataset," Int. J. Robot. Res., vol. 32, no. 11, pp. 1231-1237, Aug. 2013. https://doi.org/10.1177/0278364913491297
- M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler, R. Benenson, U. Franke, S. Roth, and B. Schiele, "The cityscapes dataset for semantic urban scene understanding," in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit., Jun. 2016, pp. 3213-3223.
- N. Silberman, D. Hoiem, P. Kohli, and R. Fergus, "Indoor segmentation and support inference from RGBD images," in Proc. Eur. Conf. Comput. Vis. Oct. 2012, pp. 746-760.