• Title/Summary/Keyword: MPEG-4 Video

Search Result 508, Processing Time 0.024 seconds

Rate-Distortion Model for HEVC Quadtree Coding (HEVC 쿼드트리 부호화를 위한 율-왜곡 모델)

  • Lee, Bumshik;Kim, Munchurl
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2011.07a
    • /
    • pp.169-172
    • /
    • 2011
  • 최근 ISO/IEC의 MPEG과 ITU-T의 VCEG이 JCT-VC (Joint Collaborative Team for Video Coding)를 구성하여 HEVC (High Efficiency Video Coding) 차세대 비디오 압축 표준 제정을 위한 작업을 진행 중이다. 과거 압축률이 가장 좋은 것으로 알려진 H.264/AVC 보다 최대 50%까지 부호화 효율 향상을 목표로 하고 있다. HEVC는 H.264/AVC와는 상이한 부호화 구조를 채택하고 있고 작은 크기의 영상뿐만 아니라 크기가 큰 영상까지도 효율적으로 부호화할 수 있도록 설계되고 있다. 예측 및 변환 부호화 과정이 계층적 쿼드트리 구조를 가지며, 특히 변환 부호화는 작은 크기의 변환 블록으로부터 $32{\times}32$ 크기의 변환 블록까지 크게 확장되어 계층적 변환 구조를 이루며 부호화하도록 되어 있다. 본 논문에서는 기존 코덱과는 상이한 부호화 구조를 갖는 쿼드트리 부호화 기반 HEVC 코덱 표준을 위한 율-왜곡 (Rate-Distortion) 모델을 제안한다. 기존의 코덱에서는 부호화되는 기본 단위가 $16{\times}16$로 일정하고, 변환 및 양자화되는 블록의 크기 역시 $4{\times}4$또는 $8{\times}8$ 크기 단위로 그 블록의 크기가 작을 뿐만 아니라 고정된 크기를 사용한다. 따라서 단일 확률 모형을 사용하여 율-왜곡 모델을 만들었으며, 그 정확도 역시 비교적 정확한 결과를 얻었다. 그러나 HEVC에서는 계층적 가변 블록 크기를 갖는 기본 부호화, 예측 및 변환/양자화 기법을 사용하기 때문에 기존의 단일 모델로는 정확한 율-왜곡 모델을 만들어 내기 어렵다. 제안하는 방법은 HEVC의 기본 단위인 CU (Coding Unit)별로 독립적인 확률 모형을 사용하여 율-왜곡모델을 사용하는 것으로 CU의 크기가 가변적이고 CU 내의 텍스처 역시 크기에 따라 매우 다른 특성을 가지고 있기 때문에 단일 모델을 사용하는 것보다 매우 효율적인 것을 실험을 통하여 확인하였다.

  • PDF

Performance Analysis of Coding According to the Interpolation filter in Inter layer Intra Prediction of H.264/SVC (H.264/SVC의 계층간 화면내 예측에서 보간법에 따른 부호화 성능 분석)

  • Gil, Dae-Nam;Cheong, Cha-Keon
    • Proceedings of the IEEK Conference
    • /
    • 2009.05a
    • /
    • pp.225-227
    • /
    • 2009
  • International standard specification, H.264/SVC improved from H.264/AVC, is set up so as to promote free use of huge multimedia data in various channel environments.;H.264/AVC is a international standard speicification for video compression, adopted and commercialized as standard for DMB broadcasting by JVT of ISO/IEC MPEG and ITU-T VCEG. SVC standard uses 'intra/inter prediction' in AVC as well as 'inter-layer intra prediction', 'inter-layer motion prediction' and 'inter-layer residual prediction' to improve efficiency of encoding. Among prediction technologies, 'inter-layer intra prediction' is to use co-located block of up sampled sublevels as a prediction signal. At this time, application of interpolation is one of the most important factors to determine encoding efficiency. SVC's currently using poly-phase FIR filter of 4-tap and 2-tap respectively to luma components. This paper is written for the purpose of analyzing encoding performance according to the interpolation. For this purpose, we applied poly-phase FIR filter of '2-tap', '4-tap' and '6-tap' respectively to luma components and then measured bit-rate, PNSR and running time of interpolation filter. We're expecting that the analysis results of this paper will be utilized for effective application of interpolation filter. SVC standard uses 'intra/inter prediction' in AVC as well as 'inter-layer intra prediction', 'inter-layer motion prediction' and 'inter-layer residual prediction' to improve efficiency of encoding.

  • PDF

Low-complexity patch projection method for efficient and lightweight point-cloud compression

  • Sungryeul Rhyu;Junsik Kim;Gwang Hoon Park;Kyuheon Kim
    • ETRI Journal
    • /
    • v.46 no.4
    • /
    • pp.683-696
    • /
    • 2024
  • The point cloud provides viewers with intuitive geometric understanding but requires a huge amount of data. Moving Picture Experts Group (MPEG) has developed video-based point-cloud compression in the range of 300-700. As the compression rate increases, the complexity increases to the extent that it takes 101.36 s to compress one frame in an experimental environment using a personal computer. To realize real-time point-cloud compression processing, the direct patch projection (DPP) method proposed herein simplifies the complex patch segmentation process by classifying and projecting points according to their geometric positions. The DPP method decreases the complexity of the patch segmentation from 25.75 s to 0.10 s per frame, and the entire process becomes 8.76 times faster than the conventional one. Consequently, this proposed DPP method yields similar peak signal-to-noise ratio (PSNR) outcomes to those of the conventional method at reduced times (4.7-5.5 times) at the cost of bitrate overhead. The objective and subjective results show that the proposed DPP method can be considered when low-complexity requirements are required in lightweight device environments.

Design and Implementation of 8K UHD Encapsulation Method for Efficient Transmission and Reception based on MMT

  • Song, Seulki;Ryu, Youngsu;Wee, Jungwook;Park, Kyungwon;Kwon, Kiwon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.2
    • /
    • pp.860-872
    • /
    • 2018
  • In this Paper, we propose 8K UHD (Ultra High Definition) encapsulation method for efficient transmission and reception based on MMT (MPEG Media Transport). Broadcasting services for 8K UHD allow users to feel the maximized reality. However, present technology is difficult to provide 8K UHD in broadcasting networks, because the 8K UHD bitrate is too high to be transmitted in the current broadcasting networks. Research for transmitting 8K UHD is underway. In some researches, a receiver is implemented with four 4K UHD display instead of a 8K UHD display. In order to transmit 8K UHD within the limited transmission bitrate of broadcasting network, 8K UHD contents encoded by SHVC (Scalable High Efficiency Video Coding) and then transmitted over heterogeneous network. For using the broadcasting and communication networks, MMT standard is used. MMT is IP based transmission protocol as the next generation transmission protocol. According to the MMT standard, video stream encapsulated and transmitted in MMTP (MMT Protocol) packet. IP-based broadcasting and communication networks can be used to transmit simultaneously, and the receiver can synchronize and play it. We propose an encapsulation method that can efficiently transmit and receive 8K UHD. The proposed method increases a payload rate and decreases an initial delay at the receiver. We show that the efficiency of the proposed method is verified by experimental tests.

VLSI architecture design of CAVLC entropy encoder/decoder for H.264/AVC (H.264/AVC를 위한 CAVLC 엔트로피 부/복호화기의 VLSI 설계)

  • Lee Dae-joon;Jeong Yong-jin
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.5C
    • /
    • pp.371-381
    • /
    • 2005
  • In this paper, we propose an advanced hardware architecture for the CAVLC entropy encoder/decoder engine for real time video compression. The CAVLC (Context-based Adaptive Variable Length Coding) is a lossless compression method in H.264/AVC and it has high compression efficiency but has computational complexity. The reference memory size is optimized using partitioned storing method and memory reuse method which are based on partiality of memory referencing. We choose the hardware architecture which has the most suitable one in several encoder/decoder architectures for the mobile devices and improve its performance using parallel processing. The proposed architecture has been verified by ARM-interfaced emulation board using Altera Excalibur and also synthesized on Samsung 0.18 um CMOS technology. The synthesis result shows that the encoder can process about 300 CIF frames/s at 150MHz and the decoder can process about 250 CIF frames/s at 140Mhz. The hardware architectures are being used as core modules when implementing a complete H.264/AVC video encoder/decoder chip for real-time multimedia application.

An Efficient H.264/AVC Decoding Technique Using Prefetching Mechanism (선인출 메커니즘을 이용한 효율적인 H.264/AVC 복호화 기법)

  • Ji, Shin-Haeng;Park, Jung-Wook;Kim, Shin-Dug
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.11a
    • /
    • pp.946-948
    • /
    • 2005
  • H.264/AVC는 SoC/IEC MPEG와 ITU-T Video Coding Experts Group에서 함께 발표한 비디오 코딩을 위한 가장 최근의 표준이다. 기존의 표준들 보다 적은 비트로 높은 압축률과 좋은 화질을 제공하고 있다. 그러나 1/4 화소 움직임 예측과 보상의 지원과 7가지의 가변블록에 대한 움직임예측과 블록모드별 RD(Rate-Distortion)를 수행하고 CAVLC등 H.264/AVC 표준에서 채택한 여러 가지 비디오 압축방식으로 인해 그 복잡도가 훨씬 증가하였다. 이 논문에서 H.264/AVC의 복호화기에서 복잡도의 약 $40\%$ 이상을 차지하는 움직임보상 모듈을 효율적으로 수행하고 최적화하기 위한 방법을 제안한다. 예측된 모션벡터에 따라 창조하는 프레임에서 매크로블록을 만들어 내는 움직임 보상 과정을 수행하는 데 있어서 접근 지연시간이 큰 외부 메모리 창조를 선인출 메커니즘을 이용하여 미리 예측하여 수행함으로써 전체 수행시간을 줄이는 기법을 적용하였다. 이를 통하여 가변길이 복호화 모듈과 움직임 보상모듈을 수정하여 반복적으로 읽고, 쓰기를 수행해야 하는 횟수를 줄였다. 실험 결과 본 논문에서 제안하는 방법을 이용하여 복호화 과정을 수행했을 때 PSNR(Peak Signal to Noise Ratio)의 손실은 전혀 없으면서 복호화기의 전체 실행시간을 약 $5\%$ 향상시키고, 핵심 모듈인 움직임 보상과정에서 약 $20\%$ 정도 실행시간을 향상시키는 등 높은 성능 향상을 보였다.

  • PDF

Design of Service Signaling Structure based on MMT for Terrestrial UHD Broadcasting Systems (MMT 기반 지상파 UHD 방송을 위한 서비스 시그널링 구조 설계)

  • Seo, Min-jae;Yu, Kyung-A;Paik, Jong-Ho
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2014.06a
    • /
    • pp.126-128
    • /
    • 2014
  • 디지털 방송기술의 비약적인 발전으로 기존 HD(High Definition) 화질의 4~16배까지 지원 가능한 UHD(Ultra High Definition) 방송 서비스가 제공되는 새로운 시대를 맞이하게 되었다. UHD 방송은 초고선명 비디오와 22.2 채널 오디오 서비스가 가능한 차세대 실감방송 기술이다. 이러한 UHD 서비스를 제공하기 위해서는 고압축 영상 코덱 기술인 HEVC(High Efficiency Video Coding), OFDM(Othogonal Frequency Division Multiplexing) 기반 대용량 전송기술과 다양한 멀티미디어 부가서비스 가능한 전송 프로토콜이 필수적으로 요구된다. 최근 UHD 방송 전송 프로토콜로 표준화 추진 중인 MMT(MPEG Media Transport)는 이기종망에서 적용할 수 있으며, 양방향 전환이 가능하여 시청자의 요구사항을 실시간으로 반영할 수 있다는 장점을 지닌다. 한편, 지상파 UHD 방송 서비스를 빠르고 효과적으로 수신하기 위해 필요한 정보인 NIT(Network Information Table), RRT(Rating Region Table) 및 SDT(Service Description Table)가 MMT 시그널링 프로토콜에서는 포함되어 있지 않다. 따라서 본 논문에서는 지상파 UHD 방송 서비스의 효과적 수신이 가능한 MMT 기반 지상파 UHD 방송을 위해 NIT, RRT 및 SDT 정보가 포함된 서비스 시그널링 구조을 제안한다.

  • PDF

A Study Vector Image Transformation of Personal Feature And Image Interpolation (2차원 얼굴외곽 정보의 VECTOR IMAGE 변환과 효과적인 영상복원에 관한 연구)

  • Jo, Nam-Chul
    • Journal of the Korea society of information convergence
    • /
    • v.1 no.1
    • /
    • pp.17-24
    • /
    • 2008
  • Video camera play very important roles for preventing many kinds of crimes and resolving those crime affairs. But in the case of recording image of a specific person far from the CCTV, the original image needs to be enlarged and recovered in order to identify the person more obviously. Interpolation is usually used for the enlargement and recovery of the image in this case. However, it has a certain limitation. As the magnification of enlargement is getting bigger, the quality of the original image can be worse. This paper uses FOP(Facial Definition Parameter) proposed by the MPEG-4 SNHC FBA group and introduces a new algorithm that uses face outline information of the original image based on the FOP, which makes it possible to recover better than the known methods until now.

  • PDF

Motion estimation method using multiple linear regression model (다중선형회귀모델을 이용한 움직임 추정방법)

  • 김학수;임원택;이재철;이규원;박규택
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.34S no.10
    • /
    • pp.98-103
    • /
    • 1997
  • Given the small bit allocation for motion information in very low bit-rate coding, motion estimation using the block matching algorithm(BMA) fails to maintain an acceptable level of prediction errors. The reson is that the motion model, or spatial transformation, assumed in block matching cannot approximate the motion in the real world precisely with a small number of parameters. In order to overcome the drawback of the conventional block matching algorithm, several triangle-based methods which utilize triangular patches insead of blocks have been proposed. To estimate the motions of image sequences, these methods usually have been based on the combination of optical flow equation, affine transform, and iteration. But the compuataional cost of these methods is expensive. This paper presents a fast motion estimation algorithm using a multiple linear regression model to solve the defects of the BMA and the triange-based methods. After describing the basic 2-D triangle-based method, the details of the proposed multiple linear regression model are presented along with the motion estimation results from one standard video sequence, representative of MPEG-4 class A data. The simulationresuls show that in the proposed method, the average PSNR is improved about 1.24 dB in comparison with the BMA method, and the computational cost is reduced about 25% in comparison with the 2-D triangle-based method.

  • PDF

Fast Hybrid Transform: DCT-II/DFT/HWT

  • Xu, Dan-Ping;Shin, Dae-Chol;Duan, Wei;Lee, Moon-Ho
    • Journal of Broadcast Engineering
    • /
    • v.16 no.5
    • /
    • pp.782-792
    • /
    • 2011
  • In this paper, we address a new fast DCT-II/DFT/HWT hybrid transform architecture for digital video and fusion mobile handsets based on Jacket-like sparse matrix decomposition. This fast hybrid architecture is consist of source coding standard as MPEG-4, JPEG 2000 and digital filtering discrete Fourier transform, and has two operations: one is block-wise inverse Jacket matrix (BIJM) for DCT-II, and the other is element-wise inverse Jacket matrix (EIJM) for DFT/HWT. They have similar recursive computational fashion, which mean all of them can be decomposed to Kronecker products of an identity Hadamard matrix and a successively lower order sparse matrix. Based on this trait, we can develop a single chip of fast hybrid algorithm architecture for intelligent mobile handsets.