다시점 동영상 부호화를 위한 가변형 다시점GOP 예측 구조

Flexible GGOP prediction structure for multi-view video coding

  • 윤재원 (연세대학교 전기전자공학부) ;
  • 서정동 (연세대학교 전기전자공학부) ;
  • 김용태 (연세대학교 전기전자공학부) ;
  • 박창섭 (KBS 방송기술연구팀) ;
  • 손광훈 (연세대학교 전기전자공학부)
  • Yoon, Jae-Won (Dept. of Electrical and Electronic Engineering, Yonsei University) ;
  • Seo, Jung-Dong (Dept. of Electrical and Electronic Engineering, Yonsei University) ;
  • Kim, Yong-Tae (Dept. of Electrical and Electronic Engineering, Yonsei University) ;
  • Park, Chang-Seob (KBS Broadcast Technical Research Institute) ;
  • Sohn, Kwang-Hoon (Dept. of Electrical and Electronic Engineering, Yonsei University)
  • 발행 : 2006.12.29

초록

본 논문에서는 다시점 동영상 부호화를 위한 참조 소프트웨어의 부호화기 성능을 높이기 위해 가변형 다시점GOP 예측 구조로 부호화 하는 방법을 제안한다. 다시점 동영상 부호화를 위한 참조 소프트웨어에서는 고정된 시공간 예측구조를 사용하여 다시점 동영상을 부호화한다. 그러나 다시점 동영상 부호화의 성능은 영상의 특성에 따라 예측 부호화 구조를 가변적으로 변경하는 것에 영향을 받는다. 따라서 다시점 동영상의 전역 변이를 이용하여 부호화의 기준 시점을 정하고 카메라 간의 간격을 고려하여 B-픽쳐의 개수를 조절하여 영상의 특성에 따라 다시점 동영상의 부호화 단위인 다시점GOP 예측 구조를 가변적으로 적용하는 방법을 제안한다. 실험 결과에서 제안된 가변형 다시점GOP 예측구조의 부호화 방법이 기존의 참조 소프트웨어보다 우수한 성능을 보여줌을 확인하였다. 제안 예측 부호화 구조는 기존의 부호화 구조와 비교하여 7.1%의 비트량 감소를 보였다.

In this paper, we propose a flexible GGOP prediction structure to improve coding efficiency for multi-view video coding. In general, reference software used for MVC uses the fixed GGOP prediction structure. However, the performance of MVC depends on the base view and numbers of B-pictures between I-picture(or P-picture) and P-picture. In order to implement the flexible GGOP prediction structure, the location of base view is decided according to the global disparities among the adjacent sequences. Numbers of B-pictures between I-picture(or P-picture) and P-picture are decided by camera arrangement such as the baseline distance among the cameras. The proposed method shows better result than the reference software of MVC. The proposed prediction structure shows considerable reduction of coded bits by 7.1%.

키워드

참고문헌

  1. R. Franich, R. Lagendijk and R. Horst, 'Reference model for hardware demonstrator implementation,' RACE DISTIMA deliverable 45/TUD/ IT/DS/B/003/bl, Oct. 1992
  2. RACE 2045-DISTIMA, http://www.tnt.uni-hannover.de/plain/project/ eu/distima/
  3. A. Rauol, 'State of the art of autostereoscopic displays,' RACE DISTIMA deliverable 45/THO/WP4.2/DS/R/57/01. Dec. 1995
  4. S. Malassiotis and M. G. Strintzis, 'Coding of video-conference stereo image sequences using 3D models,' Signal Processing: Image Communications, vol. 9, no. 1, pp 125-135, Jan. 1997 https://doi.org/10.1016/S0923-5965(96)00014-8
  5. Berlin, PANIRAM Final Demonstrations, AC092/SIE/FinalDemo/DS/ P/032/bl, Oct. 1998
  6. http://www.virtue.eu.com/
  7. N. Hur and C. Ahn, 'Experimental service of 3DTV broadcasting relay in Korea,' Proc. SPIE 4864, pp 1-13, 2002
  8. N. Hur, G. Lee, W. You, J. Lee and C. Ahn, 'An HDTV- Compatible 3DTV Broadcasting System,' ETRI J. 2003
  9. R. Borner, 'Autostereoscopic direct-view displays and rear projection for short viewing distances by lenticular method,'Proc. of the First International Symposium on Three Dimensional Image Communication Technologies, Tokyo, pp 1-14, Dec. 1993
  10. C. V. Berkel and D. W. Parker, 'Multiview 3D-LCD,' Proc. SPIE 2653, pp 32-39, 1996
  11. ISO/IEC JTC1/SC29/WG11, 'Description of Core Experiments in MVC', W7798, Bangkok, Thailand, Jan. 2006
  12. E. Izquierdo, Stereo Matching for Enhanced Telepresence in Three-Dimensional Videodcommunications, IEEE Trnas. Circuits and Systems for Video Technol., vol. 7, no. 4, pp. 629-643, Aug. 1997 https://doi.org/10.1109/76.611174
  13. D. Tzovaras, N. Grammalidis and M. G. Strintzis, Object-Based Coding of Stereo Image Sequences Using Joint 3-D Motion/Disparity Compensation, IEEE Trans. Circuits and Systems for Video Technol., vol. 7, no. 4, pp. 312-327, Apr. 1997 https://doi.org/10.1109/76.564110
  14. K.H. Sohn, J.R. Ryou and J. Lim, 'Efficient stereoscopic video coding using joint disparity-motion estimation,' Circuits, Systems and Signal Processing, vol. 23, no. 1, pp. 57-76, Jan. 2003
  15. G. Egnal and R.P. Wildes, 'Detecting binocular half-occlusions: empirical comparisons of five approaches,' IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 24, no. 8, pp. 1127-1133, Jun. 2000 https://doi.org/10.1109/TPAMI.2002.1023808
  16. G.A. Triantafylldis, D. Tzovaras and M.G. Strintzis, 'Detection of occlusion and visible background and foreground areas in stereo image pairs,' Proc. IEEE 9th International Conference on Electronics, Circuits and Systems, vol. 3, pp. 1019-1022, Sep. 2002
  17. ISO/IEC JTC1/SC29/WG11, 'Results on CE1 for multi-view video coding', M13544, Klagenfurt, Austria, Jul. 2006
  18. ISO/IEC JTC1/SC29/WG11, 'Preliminary Call for Proposals on Multi-View Video Coding', W7094, Busan, South Korea, Apr. 2005
  19. ISO/IEC JTC1/SC29/WG11, 'Updated Call for Proposal on Multi-view Video Coding', N7567, Nice, France, Oct. 2005
  20. ISO/IEC JTC1/SC29/WG11, 'Results of Core Experiment 1-D on Multiview Video Coding', M13228, Montreux, Swiss, Apr. 2006