Search | Korea Science

Largest Coding Unit Level Rate Control Algorithm for Hierarchical Video Coding in HEVC

Yoon, Yeo-Jin;Kim, Hoon;Baek, Seung-Jin;Ko, Sung-Jea
- IEIE Transactions on Smart Processing and Computing
- /
- v.1 no.3
- /
- pp.171-181
- /
- 2012
In the new video coding standard, called high efficiency video coding (HEVC), the coding unit (CU) is adopted as a basic unit of a coded block structure. Therefore, the rate control (RC) methods of H.264/AVC, whose basic unit is a macroblock, cannot be applied directly to HEVC. This paper proposes the largest CU (LCU) level RC method for hierarchical video coding in a HEVC. In the proposed method, the effective bit allocation is performed first based on the hierarchical structure, and the quantization parameters (QP) are then determined using the Cauchy density based rate-quantization (RQ) model. A novel method based on the linear rate model is introduced to estimate the parameters of the Cauchy density based RQ model precisely. The experimental results show that the proposed RC method not only controls the bitrate accurately, but also generates a constant number of bits per second with less degradation of the decoded picture quality than with the fixed QP coding and latest RC method for HEVC.
PDF

LCU-Level Rate Control for HEVC Considering Hierarchical Coding Structure (HEVC의 계층적 부호화 구조를 고려한 LCU 단위의 비트율 제어 기법)

Park, Dong-Il;Kim, Jae-Gon;Lim, Sung-Chang;Kim, Jong-Ho;Kim, Hui-Yong
- Journal of Broadcast Engineering
- /
- v.16 no.5
- /
- pp.762-772
- /
- 2011
In this paper, a method of rate control for constant bitrate (CBR) coding of High Efficiency Video Coding (HEVC) is addressed. The existing rate control of H.264/AVC may not provide exact rate control in the case of hierarchical coding structure since it doesn't consider the characteristics of the hierarchical coding structure. It is expected that a rate control is added to the reference software called HM for CBR encoding in the near future. More accurate rate control may be required in a hierarchical structure of random access (RA) mode defined in the common test condition of HM. In this paper, we propose a method of rate control based on quadratic Rate-Distortion (R-D) model considering temporal layers and frame types in hierarchical coding structure for efficient rate control. In the consideration of the trade-off relationship between the bit fluctuation and the average PSNR, both of frame and coding unit (CU) are set as the basic unit of rate control. The performance of the proposed rate control method is verified by simulations along with the trade-off relationships for the both cases of basic unit.
https://doi.org/10.5909/JEB.2011.16.5.762 인용 PDF KSCI

A Study on New Hierarchical Motion Compensation Pyramid Coding (새로운 계층적 이동 보상 피라미드 부호화 방식 연구)

전준현
- Journal of Broadcast Engineering
- /
- v.8 no.2
- /
- pp.181-197
- /
- 2003
Notion Compensation(MC) technique using Sub-Band Coding with the hierarchical structure is efficient to estimate real motion. In the hierarchical pyramid method, low-band MC pyramid method is popular, where the upper layer estimate the glover motion and next lower layer estimate the local motion. The low-band MC pyramid scheme has two problems. First, because the quantization errors at lower layer are accumulated when using coding and quantizing, it is impossible to search the exact Motion Vector(MV) Second, because of the top-down search problem in the hierarchical structure, MV mismatch in upper layer causes serious MV in lower layer So. we propose new hierarchical MC pyramid method based on edge classification. In this Paper, we show that the performance of proposed Pass-band motion compensation pyramid technique is better than low-band motion compensation pyramid. Also, in the pyramid motion estimation, we propose initial MV estimation scheme based on the edge-pattern classification. As a result, we find that PSNR was increased.
PDF KSCI

Rate control to reduce bitrate fluctuation on HEVC

Yoo, Jonghun;Nam, Junghak;Ryu, Jiwoo;Sim, Donggyu
- IEIE Transactions on Smart Processing and Computing
- /
- v.1 no.3
- /
- pp.152-160
- /
- 2012
This paper proposes a frame-level rate control algorithm for low delay video applications to reduce the fluctuations in the bitrate. The proposed algorithm minimizes the bitrate fluctuations in two ways with minimal coding loss. First, the proposed rate control applies R-Q model to all frames including the first frame of every group of pictures (GOP) except for the first one of a sequence. Conventional rate control algorithms do not use any R-Q models for the first frame of each GOP and do not estimate the generated-bit. An unexpected output rate result from the first frame affects the remainder of the pictures in the rate control. Second, a rate-distortion (R-D) cost is calculated regardless of the hierarchical coding structure for low bitrate fluctuations because the hierarchical coding structure controls the output bitrate in rate distortion optimization (RDO) process. The experimental results show that the average variance of per-frame bits with the proposed algorithm can reduce by approximately 33.8% with a delta peak signal-to-noise ratio (PSNR) degradation of 1.4dB for a "low-delay B" coding structure and by approximately 35.7% with a delta-PSNR degradation of 1.3dB for a "low-delay P" coding structure, compared to HM 8.0 rate control.
PDF

A Tree Regularized Classifier-Exploiting Hierarchical Structure Information in Feature Vector for Human Action Recognition

Luo, Huiwu;Zhao, Fei;Chen, Shangfeng;Lu, Huanzhang
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.11 no.3
- /
- pp.1614-1632
- /
- 2017
Bag of visual words is a popular model in human action recognition, but usually suffers from loss of spatial and temporal configuration information of local features, and large quantization error in its feature coding procedure. In this paper, to overcome the two deficiencies, we combine sparse coding with spatio-temporal pyramid for human action recognition, and regard this method as the baseline. More importantly, which is also the focus of this paper, we find that there is a hierarchical structure in feature vector constructed by the baseline method. To exploit the hierarchical structure information for better recognition accuracy, we propose a tree regularized classifier to convey the hierarchical structure information. The main contributions of this paper can be summarized as: first, we introduce a tree regularized classifier to encode the hierarchical structure information in feature vector for human action recognition. Second, we present an optimization algorithm to learn the parameters of the proposed classifier. Third, the performance of the proposed classifier is evaluated on YouTube, Hollywood2, and UCF50 datasets, the experimental results show that the proposed tree regularized classifier obtains better performance than SVM and other popular classifiers, and achieves promising results on the three datasets.
https://doi.org/10.3837/tiis.2017.03.020 인용 PDF KSCI

Motion detection and compensation in object-oriented coding based on combined mapping parameter estimation using hierarchical structure (물체지향 부화화에서 계층적 구조를 이용한 결합형 변환 파라미터 추정 기법에 의한 움직임 검출 및 보상)

이창범;김준식;박래홍
- Journal of the Korean Institute of Telematics and Electronics A
- /
- v.33A no.3
- /
- pp.163-175
- /
- 1996
This paper invetigates estimation methods of mapping parameters in object-oriented coding. In this paper, we propose a fast parameter estimation method with its performance similar to that of the conventional methods. We employ hierarchical structure in difference images to redcue the computational complexity and also combine conventional six- and eight-mapping parameter estimation methods to compensate for the performance degradation caused by employment of hierarchical structure. Computer simulation shows that the proposed mehtod gives results similar to conventional methods with greatly reduced computational complexity.
PDF

Temporal Prediction Structure for Multi-view Video Coding (다시점 비디오 부호화를 위한 시간적 예측 구조)

Yoon, Hyo-Sun;Kim, Mi-Young
- Journal of Korea Multimedia Society
- /
- v.15 no.9
- /
- pp.1093-1101
- /
- 2012
Multi-view video is obtained by capturing one three-dimensional scene with many cameras at different positions. Multi-view video coding exploits inter-view correlations among pictures of neighboring views and temporal correlations among pictures of the same view. Multi-view video coding which uses many cameras requires a method to reduce the computational complexity. In this paper, we proposed an efficient prediction structure to improve performance of multi-view video coding. The proposed prediction structure exploits an average distance between the current picture and its reference pictures. The proposed prediction structure divides every GOP into several small groups to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. Experimental results show that the proposed prediction structure shows good performance in image quality and bit-rates. When compared to the performance of hierarchical B pictures of Fraunhofer-HHI, the proposed prediction structure achieved 0.07~0.13 (dB) of PSNR gain and was down by 6.5(Kbps) in bitrate.
https://doi.org/10.9717/kmms.2012.15.9.1093 인용 PDF KSCI

Adaptive Multiview Video Coding Scheme Based on Spatiotemporal Correlation Analyses

Zhang, Yun;Jiang, Gang-Yi;Yu, Mei;Ho, Yo-Sung
- ETRI Journal
- /
- v.31 no.2
- /
- pp.151-161
- /
- 2009
In this paper, we propose an adaptive multiview video coding scheme based on spatiotemporal correlation analyses using hierarchical B picture (AMVC-HBP) for the integrative encoding performances, including high compression efficiency, low complexity, fast random access, and view scalability, by integrating multiple prediction structures. We also propose an in-coding mode-switching algorithm that enables AMVC-HBP to adaptively select a better prediction structure in the encoding process without any additional complexity. Experimental results show that AMVC-HBP outperforms the previous multiview video coding scheme based on H.264/MPEG-4 AVC using the hierarchical B picture (MVC-HBP) on low complexity for 21.5%, on fast random access for about 20%, and on view scalability for 11% to 15% on average. In addition, distinct coding gain can be achieved by AMVC-HBP for dense and fast-moving sequences compared with MVC-HBP.
PDF

Improved Prediction Structure and Motion Estimation Method for Multi-view Video Coding (다시점 비디오 부호화를 위한 개선된 예측 구조와 움직임 추정 기법)

Yoon, Hyo Sun;Kim, Mi Young
- Journal of KIISE
- /
- v.41 no.11
- /
- pp.900-910
- /
- 2014
Multi-view video is obtained by capturing one three-dimensional scene with many cameras at different positions. The computational complexity of multi view video coding increases in proportion to the number of cameras. To reduce computational complexity and maintain the image quality, improved prediction structure and motion estimation method is proposed in this paper. The proposed prediction structure exploits an average distance between the current picture and its reference pictures. The proposed prediction structure divides every GOP into several groups to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. And the proposed motion estimation method uses a hierarchical search strategy. This strategy method consists of modified diamond search pattern, progressive diamond search pattern and modified raster search pattern. Experiment results show that the complexity reduction of the proposed prediction structure and motion estimation method over JMVC (Joint Multiview Video Coding) reference model using hierarchical B pictures of Fraunhofer-HHI and TZ search method can be up to 40~70% while maintaining similar video quality and bit rates.
https://doi.org/10.5626/JOK.2014.41.11.900 인용

Progressive Image Transmission Using Hierarchical Pyramid Structure and Classified Vector Quantizer in DCT Domain (계층적 피라미드 구조와 DCT 영역에서의 분류 벡터 양지기를 이용한 점진적 영상전송)

박섭형;이상욱
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.26 no.8
- /
- pp.1227-1237
- /
- 1989
In this paper, we propose a lossless progressive image transmission scheme using hierarchical pyramid structure and classified vector quantizer in DCT domain. By adopting DCT to the hierarchical pyramid signals, we can reduce the spatial redundance. Moreover, the DCT coefficients can be encoded efficiently by using classified vector quantizer in DCT domain. The classifier is simply based on the variance of a subblock. Also, the mirror set of training set of images can improve the robustness of codebooks. Progressive image transmission can be achieved through following processes: from top to bottom level of planes in a pyramid, and from high to low AC variance class in a plane. Some simulation results with real images show that the proposed coding scheme yields a good performance at below 0.3 bpp and an excellent result at 0.409 bpp. The proposed coding scheme is well suited for lossless progressive image transmission as well as image data compression.
PDF

Search Result 50, Processing Time 0.018 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)