Fast Ultra-mode Selection Algorithm for H.264/AVC Video Coding with Low Complexity

저 복잡도의 H.264/AVC를 위한 고속 인트라 모드 선택 기법

  • 김종호 (한양대학교 전자통신전파공학과 영상통신 및 신호처리 연구실) ;
  • 정제창 (한양대학교 전자전기컴퓨터공학부)
  • Published : 2005.11.01

Abstract

The emerging H.264/AVC video coding standard improves coding performance significantly by adopting many advanced techniques. This is achieved at the expense of great increasing encoder complexity. Specifically the intra prediction using RDO examines all possible combinations of coding modes, which depend on spatial directional correlation with adjacent blocks. For 4${\times}$4 luma blocks, there are 9 modes, and for 16${\times}$16 luma and 8${\times}$8 chroma blocks, there are 4 modes, respectively. Therefore the number of mode combinations for each macroblock is 592. This paper presents a method to reduce the RDO complexity using simple directional masks and neighboring modes. According to the proposed method, we reduce the number of mode combinations to 132 at the most. Experimental results show the proposed method reduces the encoding time up to $70\%$ with negligible loss of PSNR and bitrate increase compared to the H.264/AVC exhaustive search.

H.264/AVC는 부호화 성능을 향상시키기 위해 많은 최신의 기술들을 채택하였지만, 그 결과 부호화기의 복잡도가 매우 증가하였다. 특히 화면내(intra) 예측 부호화에서는 주변블록과의 방향성 상관도에 따라 4${\times}$4 휘도블록은 9개, 16${\times}$16 휘도블록과 8${\times}$8 색차블록은 각각 4개의 부호화 모드가 있는데, 이들 중 최적의 모드는 율-왜곡 최적화(RDO) 기법에 의해 선택한다. 따라서 하나의 매크로블록을 부호화하기 위해서는 592번의 RDO 계산을 수행해야 한다. 본 논문에서는 간단한 구조의 방향성 마스크와 주변블록의 모드정보를 이용하여 RDO 계산의 대상이 되는 모드의 개수를 줄임으로써 부호화 복잡도를 줄이는 알고리즘을 제안한다. 제안한 방법은 RDO 계산을 최대132회로 줄이고 부호화 시간을 전역탐색(exhaustive search)에 비해 $70\%$까지 줄이면서 PSNR 및 비트율 변화가 거의 없음을 실험을 통해 보이고 있다.

Keywords

References

  1. ITU-T Rec. H.264 I ISO/IEC 14496-10, 'Infonnantion Technology-Coding of AudioVisual Objects, Part 10: Advanced Video Coding,' 2002
  2. Thomas Wiegand, Gary Sullivan, Gisle Bjontegaard, and Ajay Lutbra, 'Overview of the H.264/AVC video coding standard,' IEEE Trans. Circuits and Syst. for Video Technol., vol.13, no.7, pp.560-576, Jul. 2003 https://doi.org/10.1109/TCSVT.2003.815165
  3. Zibo Chen, Peng Zhou, and Yun He, 'Fast integer pel and fractional pel motion estimation for JVT,' JVT-F017, 6th JVT Meeting, Awaji, Japan, Dec. 2002
  4. Bing-Yu Hsieh, Yu-Wen Huang, Tu-Chih Wang, Shao-Yi Chien, and Liang-Gee Chen, 'Fast motion estimation algorithm for H.264/ MPEG-4 A VC by using multiple reference frame skipping criteria,' in Proc. SPIE, VCIP 2003, vol. 5 150, pp.1551-1560
  5. K. P. Lim, S. Wu, D. J. Wu, S. Rahardja, X. Lin, F. Pan, and Z. G. Li, 'Fast inter mode decision,' JVT-1020, 9th JVT Meeting, SanDiego, USA, Sept. 2003
  6. F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, 'Fast mode decision algorithm for intra prediction in H.264/AVC video coding,' IEEE Trans. Circuits and Syst. for Video Technol., vol.15, no.7, pp.813-822, JuI. 2005 https://doi.org/10.1109/TCSVT.2005.848356
  7. Changsung Kim, Hsuan-Huei Shih, and C.-C. Jay Kuo, 'Multistage mode decision for intra prediction in H.264 codec,' in Proc. SPIE, VCIP 2004, voI.5308, pp.355-363 https://doi.org/10.1117/12.528040
  8. Keng-Pang Lim, Gary Sullivan, and Thomas Wiegand, 'Text description of joint model reference encoding methods and decoding concealment methods,' JVT-N046, 14th JVT Meeting, HongKong, China, Jan. 2005
  9. Thomas Stockhanuner, Dimitros Kontopodis, and Thomas Wiegand, 'Rate-distortion optimization for JVT/H.26L video coding in packet loss environment,' in Proc. Int. Packet Video Workshop. 2002
  10. Gary Sullivan, 'Recommended simulation common conditions for H.26L coding efficiency experiments on low resolution progressive scan source material,' VCEG-N81, 14th Meeting, SantaBarbara, USA, Sept. 2001
  11. JVT Test Model AdHoc Group, 'Evaluation sheet for motion estimation,' Draft ver. A, Feb. 2003