DOI QR코드

DOI QR Code

Fast Macroblock Mode Selection Algorithm for B Frames in Multiview Video Coding

  • Yu, Mei (Faculty of Information Science and Engineering, Ningbo University) ;
  • He, Ping (Faculty of Information Science and Engineering, Ningbo University) ;
  • Peng, Zongju (Faculty of Information Science and Engineering, Ningbo University) ;
  • Zhang, Yun (Faculty of Information Science and Engineering, Ningbo University) ;
  • Si, Yuehou (Faculty of Information Science and Engineering, Ningbo University) ;
  • Jiang, Gangyi (Faculty of Information Science and Engineering, Ningbo University)
  • 투고 : 2010.11.29
  • 심사 : 2011.01.23
  • 발행 : 2011.02.28

초록

Intensive computational complexity is an obstacle of enabling multiview video coding for real-time applications. In this paper, we present a fast macroblock (MB) mode selection algorithm for B frames which are based on the computational complexity analyses between the MB mode selection and reference frame selection. Three strategies are proposed to reduce the coding complexity jointly. First, the temporal correlation of MB modes between current MB and its temporal corresponding MBs is utilized to reduce computational complexity in determining the optimal MB mode. Secondly, Lagrangian cost of SKIP mode is compared with that of Inter $16{\times}16$ modes to early terminate the mode selection process. Thirdly, reference frame correlation among different Inter modes is exploited to reduce the number of reference frames. Experimental results show that the proposed algorithm can promote the encoding speed by 3.71~7.22 times with 0.08dB PSNR degradation and 2.03% bitrate increase on average compared with the joint multiview video model.

키워드

참고문헌

  1. Pascal Frossard, Juan Carlos de Martin and Reha Civanlar, "Media Streaming with Network Diversity," in Proc. of the IEEE, vol. 96, no. 1, pp. 39-53, January 2008.
  2. Hsien-Po Shiang and Mihaela van der Schaar, "Distributed Resource Management in Multi-hop Cognitive Radio Networks for Delay Sensitive Transmission," IEEE Transactions on Vehicular Technology, vol. 58, no. 2, pp. 941-953, February 2009. https://doi.org/10.1109/TVT.2008.925308
  3. L. Zhou, X. Wang, W. Tu, G. Mutean and B. Geller, "Distributed Scheduling Scheme for Video Streaming over Multi-Channel Multi-Radio Multi-Hop Wireless Networks," IEEE Journal on Selected Areas in Communications, vol. 28, no. 3, pp. 409-419, April 2010. https://doi.org/10.1109/JSAC.2010.100412
  4. L. Zhou, B. Geller, B. Zheng, A. Wei and J. Cui, "System Scheduling for Multi-Description Video Streaming Over Wireless Multi-Hop Networks," IEEE Transactions on Broadcasting, vol. 55, no. 4, pp. 731-741, December 2009. https://doi.org/10.1109/TBC.2009.2032795
  5. Y.M. Feng, D.X. Li, K. Luo and M. Zhang, "Asymmetric bidirectional view synthesis for free viewpoint and three-dimensional video," IEEE Transactions on Consumer Electronics, vol. 55, no. 4, pp. 2349-2355, November 2009. https://doi.org/10.1109/TCE.2009.5373809
  6. K.-J. Oh et al., "Multi-view video and multi-channel audio broadcasting system," in Proc. of 3DTV-CON, no. 4379437, May 2007.
  7. P. Merkle, K Muller and T. Wiegand, "3D Video: Acquisition, Coding, and Display," IEEE Transactions on Consumer Electronics, vol. 56, no. 2, pp. 946-950, July 2010. https://doi.org/10.1109/TCE.2010.5506024
  8. P. Merkle et al., "Coding efficiency and complexity analysis of MVC prediction structures," in Proc. of 15th European Signal Conference, pp. 5-9, September 3-7, 2007.
  9. P. Merkle et al., "Efficient prediction structures for multi-view video coding," IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 11, pp. 1461-1473, November 2007. https://doi.org/10.1109/TCSVT.2007.903665
  10. Tae-Young Chunga, II-Lyong Junga, Kwanwoong Songa and Chang-Su Kim, "Multi-view video coding with view interpolation prediction for 2D camera arrays," Journal of Visual Communication and Image Representation, vol. 21, no. 5-6, pp. 474-486, July-August 2010. https://doi.org/10.1016/j.jvcir.2009.10.001
  11. Y. Zhang, M. Yu and G. Jiang, "New approach to multi-modal multi-view video coding," Chinese Journal of Electronics, vol. 18, no. 2, pp. 338-342, April 2009.
  12. P.K. Park and Y. S. Ho, "Prediction structure and quantization parameter selection for efficient multiview video coding," Optical Engineering, vol. 47, no. 4, 047401, April 2008. https://doi.org/10.1117/1.2909666
  13. Hung-Chih Lin, Wen-Hsiao Peng and Hsueh-Ming Hang, "Fast context-adaptive mode decision algorithm for scalable video coding with combined coarse-grain quality scalability (CGS) and temporal scalability," IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 5, pp. 732-748, May 2010. https://doi.org/10.1109/TCSVT.2010.2045832
  14. Chia-Hung Yeh et al., "Fast mode decision algorithm for scalable video coding using Bayesian theorem detection and Markov process," IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 4, pp. 563-574, April, 2010. https://doi.org/10.1109/TCSVT.2010.2041825
  15. L. Shen, Z. Liu, Z. Zhang and G. Wang, "An adaptive and fast multi-frame selection algorithm for H.264 video coding," IEEE Signal Processing Letters, vol. 14, no. 11, pp. 836-839, November 2007. https://doi.org/10.1109/LSP.2007.898343
  16. H. Nisar and T. S. Choi, "Multiple initial point prediction based search pattern selection for fast motion estimation," Pattern Recognition, vol. 42, no. 3, pp. 475-486, March 2009. https://doi.org/10.1016/j.patcog.2008.08.010
  17. Grecos and M. Yang, "A framework for fast mode decision in the H.264 video coding standard," Digital Signal Processing: A Review Journal, vol. 17, no. 3, pp. 652-664, May 2007. https://doi.org/10.1016/j.dsp.2005.11.005
  18. H. Wang, S. Kwong and C. W. Kok, "An efficient mode decision algorithm for H.264/AVC encoding optimization," IEEE Transactions on Multimedia, vol. 9, no. 4, pp. 882-888, May 2007. https://doi.org/10.1109/TMM.2007.893345
  19. Vetro, P. Pandit, H. Kimata and A. Smolic, "Joint multiview video model (JMVM) 7.0," Joint Video Team (JVT) of ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, JVT-Z207, January 2008.
  20. Liquan Shen et.al., "Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding," IEEE Transactions on Broadcasting, vol. 55, no. 4, pp. 761-766, December 2009. https://doi.org/10.1109/TBC.2009.2030453
  21. X. Li, D. Zhao, S. Ma and W. Gao, "Fast disparity and motion estimation based on correlations for multiview video coding," IEEE Transactions on Consumer Electronics, vol. 54, no. 4, pp. 2037-2044, November 2008. https://doi.org/10.1109/TCE.2008.4711270
  22. Z. Peng, G. Jiang, M. Yu and Q. Dai, "Fast macroblock mode selection algorithm for multiview video coding," EURASIP Journal on Image and Video Processing, Article ID 393727, 2008.
  23. G. Cernigliaro et al., "Fast mode decision for multiview video coding based on depth maps," in Proc. of SPIE, vol. 7257, no. 72570N, January 2009.
  24. L. F. Ding et al., "Content-aware prediction algorithm with inter-view mode decision for multiview video coding," IEEE Transactions on Multimedia, vol. 10, no. 8, pp. 1553-1564, December 2008. https://doi.org/10.1109/TMM.2008.2007314
  25. Wei Zhu, Wei Jiang and Yaowu Chen, "A Fast Inter Mode Decision for Multiview Video Coding," in Proc. of International Conference on Information Engineering and Computer Science, pp. 1-4, December 2009.
  26. M. Yu, Z. Peng and G. Jiang. "Statistical analysis of macroblock mode selection in JMVM," JVT of ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, JVT-Y026, October 2007.
  27. Feldmann, M. Mueller, F. Zilly, R. Tanger, K. Mueller, A. Smolic, P. Kauff and T. Wiegand, "HHI test material for 3D video," ISO/IEC JTC1/SC29/WG11, M15413, April, 2008.
  28. L. Zitnick, S. B. Kang and M. Uyttendaele, "High-quality video view interpolation using a layered representation," ACM SIGGRAPH and ACM Transactions on Graphics, Los Angeles, CA, vol. 4, pp. 600-608, August 2004.
  29. M. Tanimoto, T. Fujii and N. Fukushima, "1D parallel test sequences for MPEG-FTV," ISO/IEC JTC1/SC29/WG11, M15378, April, 2008.