Study on the estimation and representation of disparity map for stereo-based video compression/transmission systems

스테레오 기반 비디오 압축/전송 시스템을 위한 시차영상 추정 및 표현에 관한 연구

  • Bak Sungchul (Dept. of Computer Engineering, Kwangwoon University) ;
  • Namkung Jae-Chan (Dept. of Computer Engineering, Kwangwoon University)
  • Published : 2005.12.01

Abstract

This paper presents a new estimation and representation of a disparity map for stereo-based video communication systems. Several pixel-based and block-based algorithms have been proposed to estimate the disparity map. While the pixel-based algorithms can achieve high accuracy in computing the disparity map, they require a lost of bits to represent the disparity information. The bit rate can be reduced by the block-based algorithm, sacrificing the representation accuracy. In this paper, the block enclosing a distinct edge is divided into two regions and the disparity of each region is set to that of a neighboring block. The proposed algorithm employs accumulated histograms and a neural network to classify a type of a block. In this paper, we proved that the proposed algorithm is more effective than the conventional algorithms in estimating and representing disparity maps through several experiments.

본 논문에서는 스테레오 기반 비디오 압축 전송 시스템을 위하여 시차영상을 추정하고 표현하는 방법에 대하여 연구를 수행하였다. 기존에는 스테레오 영상 전송을 위하여 시차영상을 화소 단위나 블록단위로 구하는 방법이 사용되었다. 화소 단위 시차추정은 정확도는 높으나 전송시 많은 비트를 발생시키는 반면, 블록단위 시차 추정은 정보량을 줄일 수 있으나 정확도가 떨어지는 단점을 가지고 있다. 본 논문에서는 영상의 경계부분을 두 개의 영역으로 나누고 시차정보를 주변 것으로 대치함으로써 블록단위의 방법과 거의 같은 정보량을 갖으면서 경계부분에서 보다 정확한 시차정보를 표현하는 방법을 제안하였다. 본 방법은 블록의 형태를 분류하기 위하여 누적 히스토그램을 특징으로 하는 신경망을 사용하였다. 본 논문에서는 제안한 알고리즘이 경계블록을 다수 포함한 영상에서는 블록단위의 시차표현 방법보다 효과적임을 실제 영상 분석을 통하여 증명하였다.

Keywords

References

  1. S. T. Barnard and M. A. F ischler, 'Computational and biological theories of stereo,' in Proc. of the DARPA Image Understanding workshop, pp. 439-448, Sept. 1990
  2. S. Sethuraman, 'Stereoscopic image sequence compression using multiresolution and quadtree decomposition based disparityand motion-adaptive segmentation,' Ph.D. Dissertation, Carnegie Mellon Univ., 1996
  3. J. E. W. Mayhew, and J. P. Frisby, Ed., 3D Model Recognition from Stereoscopic Cues, MIT Press, Cambridge, MA, 1991
  4. M. Perkins, 'Data compression of stereo pairs,' Ph.D.. Dissertation, Stanford University, Stanford, CA, 1988
  5. V. Grinberg, G. Podnar, and M. Siegel, 'Geometry of binocular imaging,' in Proc. IS&T/SPIE Symp. Electronic Imaging, Stereoscopic Displays and Applications, vol. 2177, 1994.
  6. M. G. Perkins, 'Data Compression of Stereo Pairs,' in IEEE Trans. Comm, vol. 40, pp. 684-696, Apr. 1992 https://doi.org/10.1109/26.141424
  7. International Telecommunication Union 'Video Coding for Low Bitrate Communication.' ITU-T Recommendation H.263, Mar.1996
  8. ISO/IEC JTC1/SC29/WG11, 'ISO/IEC CD 11172:Information Technology,' MPEG-1 Committee Draft, Dec. 1991
  9. M. E. Lukacs, 'Predictive Coding of Multi-viewpoint Image Sets,' in ICASSP, pp. 521-524, 1989
  10. H. Yamaguchi, Y. Tatehira, K.Akiyama, and Y.Kobayashi, 'Stereoscopic Images Disparity for Predictive Coding,' in ICASSP, pp. 1976-1979, 1989
  11. J. R. Jain, and A.K. Jain, 'Displacement measurement and its application in interframe image coding,' IEEE Trans. on Commun., vol. 29, pp. 1799-1808, Dec. 1981 https://doi.org/10.1109/TCOM.1981.1094950
  12. M. E. Lukacs, 'Predictive coding of multiviewpoint image sets,' in Proc. ICASSP, pp. 521-524, Oct. 1986
  13. T. Frajka and K. Zeger, 'Residual image coding for stereo image compression,' Optical Engineering, vol. 42, no. 1, pp. 182-189, Jan. 2003 https://doi.org/10.1117/1.1526492
  14. W. Woo, and A. Ortega, 'Overlapped block disparity compensation with adaptive windows for stereo image coding,' IEEE Trans. Circuits and Systems for Video Technology, vol. 10, no. 2, pp. 194-200, Mar. 2000 https://doi.org/10.1109/76.825718
  15. M. Orchard, and G. Sullivan, 'Overlapped block motion compensation : an estimation-theoretic approach,' IEEE Trans. Image Processing, vol. 5, pp. 693-699, Mar. 1994
  16. R. Rajagopalan, E. Feig, and M. Orchard, 'Motion optimization of ordered blocks for overlapped block motion compensation,' IEEE Trans. Circuit and Systems for Video Technology, vol. 8, no. 2, Apr. 1998
  17. G. J. Sullivan, and R. L. Baker, 'Efficient quadtree coding of images and video,' IEEE Trans. Image Processing, vol. 3, no, 3, pp. 327-331, May 1994 https://doi.org/10.1109/83.287030
  18. D.R. Clewer, L. J. Lewer, C. N. Canagarajah, D. R. Bull, and M. H. Barton, 'Efficient multiview image compression using quadtree disparity estimation,' ISCAS 2001, vol. 5, pp. 295-298, May 2001
  19. C.-Y. Chiu and R. L. Baker, 'quad-tree product vector quantization of images,' in Proc. SPIE conf. Adavances Image Compression Automat. Target Recogn, vol. 1099, pp. 142-153, Mar. 1989