• Title/Summary/Keyword: compression coding

Search Result 828, Processing Time 0.027 seconds

Design and Development of T-DMB Multichannel Audio Service System Based on Spatial Audio Coding

  • Lee, Yong-Ju;Seo, Jeong-Il;Beack, Seung-Kwon;Jang, Dae-Young;Kang, Kyeong-Ok;Kim, Jin-Woong;Hong, Jin-Woo
    • ETRI Journal
    • /
    • v.31 no.4
    • /
    • pp.365-375
    • /
    • 2009
  • In this paper, a terrestrial digital multimedia broadcasting (T-DMB) multichannel audio broadcasting system based on spatial audio coding is presented. The proposed system provides realistic multichannel audio service via T-DMB with a small increase of data rate as well as backward compatibility with the conventional stereo-based T-DMB player. To reduce the data rate for additional multichannel audio signals, we compress the multichannel audio signals using the sound source location cue coding algorithm, which is an efficient parametric multichannel audio compression technique. For compatibility, we use the dependent property of an elementary stream descriptor, and this property should be ignored in a conventional T-DMB player. To verify the feasibility of the proposed system, we implement the T-DMB multichannel audio encoder and a prototype player. We perform a compatibility test using the T-DMB multichannel audio encoder and conventional T-DMB players. The test demonstrates that the proposed system is compatible with a conventional T-DMB player and that it can provide a promisingly rich audio service.

On a Multiband Nonuniform Samping Technique with a Gaussian Noise Codebook for Speech Coding (가우시안 코드북을 갖는 다중대역 비균일 음성 표본화법)

  • Chung, Hyung-Goue;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.6
    • /
    • pp.110-114
    • /
    • 1997
  • When applying the nonuniform sampling to noisy speech signal, the required data rate increases to be comparable to or more than that by uniform sampling such as PCM. To solve this problem, we have proposed the waveform coding method, multiband nonuniform waveform coding(MNWC), applying the nonuniform sampling to band-separated speech signal[7]. However, the speech quality is deteriorated when it is compared to the uniform sampling method, since the high band is simply modeled as a Gaussian noise with average level. In this paper, as a good method to overcome this drawback, the high band is modeled as one of 16 codewords having different center frequencies. By doing this, with maintaining high speech quality as MOS score of average 3.16, the proposed method achieves 1.5 times higher compression ratio than that of the conventional nonuniform sampling method(CNSM).

  • PDF

S-CODE: A Subdivision Based Coding System for CAD Models

  • Takarada, Yosuke;Takeuchi, Shingo;Kawano, Isao;Hotta, Jun;Suzuki, Hiromasa
    • International Journal of CAD/CAM
    • /
    • v.3 no.1_2
    • /
    • pp.97-109
    • /
    • 2003
  • A large scale polygon models are often used to approximately represent 3D CAD models in Digital Engineering environment such as DMU (Digital Mockups) and network based collaborative design. However, they are not suitable for distribution on the network and for interactive rendering. We introduce a new coding system based on subdivision schemes called S-CODE system. In this system, it is possible to highly compress the model with sufficient accuracy and to view the model efficiently in a level of detail (LOD) fashion. The method is based on subdivision surface fitting by which a subdivision surface and curves which approximate a face of a CAD model are generated. We also apply a subdivision method to analytic surfaces such as conical and cylindrical surfaces. A prototype system is developed and used for evaluation with reasonably complicated data. The results show that the method is useful as a CAD data coding system.

Image Coding by Block Based Fractal Approximation (블록단위의 프래탈 근사화를 이용한 영상코딩)

  • 정현민;김영규;윤택현;강현철;이병래;박규태
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.2
    • /
    • pp.45-55
    • /
    • 1994
  • In this paper, a block based image approximation technique using the Self Affine System(SAS) from the fractal theory is suggested. Each block of an image is divided into 4 tiles and 4 affine mapping coefficients are found for each tile. To find the affine mapping cefficients that minimize the error between the affine transformed image block and the reconstructed image block, the matrix euation is solved by setting each partial differential coefficients to aero. And to ensure the convergence of coding block. 4 uniformly partitioned affine transformation is applied. Variable block size technique is employed in order to applynatural image reconstruction property of fractal image coding. Large blocks are used for encoding smooth backgrounds to yield high compression efficiency and texture and edge blocks are divided into smaller blocks to preserve the block detail. Affine mapping coefficinets are found for each block having 16$\times$16, 8$\times$8 or 4$\times$4 size. Each block is classified as shade, texture or edge. Average gray level is transmitted for shade bolcks, and coefficients are found for texture and edge blocks. Coefficients are quantized and only 16 bytes per block are transmitted. Using the proposed algorithm, the computational load increases linearly in proportion to image size. PSNR of 31.58dB is obtained as the result using 512$\times$512, 8 bits per pixel Lena image.

  • PDF

Fast Encoder Design for Multi-view Video

  • Zhao, Fan;Liao, Kaiyang;Zhang, Erhu;Qu, Fangying
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.7
    • /
    • pp.2464-2479
    • /
    • 2014
  • Multi-view video coding is an international encoding standard that attains good performance by fully utilizing temporal and inter-view correlations. However, it suffers from high computational complexity. This paper presents a fast encoder design to reduce the level of complexity. First, when the temporal correlation of a group of pictures is sufficiently strong, macroblock-based inter-view prediction is not employed for the non-anchor pictures of B-views. Second, when the disparity between two adjacent views is above some threshold, frame-based inter-view prediction is disabled. Third, inter-view prediction is not performed on boundary macroblocks in the auxiliary views, because the references for these blocks may not exist in neighboring views. Fourth, finer partitions of inter-view prediction are cancelled for macroblocks in static image areas. Finally, when estimating the disparity of a macroblock, the search range is adjusted according to the mode size distribution of the neighboring view. Compared with reference software, these techniques produce an average time reduction of 83.65%, while the bit-rate increase and peak signal-to-noise ratio loss are less than 0.54% and 0.05dB, respectively.

A design of CAVLC(Context-Adaptive Variable Length Coding) for H.264 (H.264 CAVLC(Context-Adaptive Variable Length Coding)설계)

  • Lee, Yong-Ju;Suh, Ki-Bum
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.10a
    • /
    • pp.108-111
    • /
    • 2008
  • In this paper, we propose an advanced hardware architecture for the CAVLC entropy encoder engine for real time Full HD video compression. Since there are 384 data coefficients which are sum of 376 AC coefficient and 8 DC coefficient per one macroblock, 384 coefficient have to be processed per one macroblock in worst case for real time processing. We propose an novel architecture which includes parallel architecture and pipeline processing, and reduction "0" in AC/DC coefficient table. To verify the proposed architecture, we develop the reference C for CAVLC and verified the designed circuit with the test vector from reference C code.

  • PDF

A Fast Block Mode Decision Scheme for P- Slices of High profile in H.264/AVC

  • Kim, Jong-Ho;Pahk, Un-Kyung;Kim, Mun-Churl;Choi, Jin-Soo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.142-147
    • /
    • 2009
  • The recent H.264/AVC video coding standard provides a higher coding efficiency than previous standards. H.264/AVC achieves a bit rate saving of more than 50 % with many new technologies, but it is computationally complex. Most of fast mode decision algorithms have focused on Baseline profile of H.264/AVC. In this paper, a fast block mode decision scheme for P- slices in High profile is proposed to reduce the computational complexity for H.264/AVC because the High profile is useful for broadcasting and storage applications. To reduce the block mode decision complexity in P- pictures of High profile, we use the SAD value after $16{\times}16$ block motion estimation. This SAD value is used for the classification feature to divide all block modes into some proper candidate block modes. The proposed algorithm shows average speed-up factors of 47.42 ${\sim}$ 67.04% for IPPP sequences.

  • PDF

Adaptive Interleaved Motion Vector Coding using Motion Characteristics (움직임 특성을 이용한 적응적 교차 움직임 벡터 부-복호화)

  • Won, Kwang-Hyun;Yang, Jung-Youp;Park, Dae-Yun;Jeon, Byeung-Woo
    • Journal of Broadcast Engineering
    • /
    • v.16 no.2
    • /
    • pp.372-383
    • /
    • 2011
  • This paper proposes an improved design of an interleaved motion vector coding scheme with flexibility in predictive motion vector component by exploiting motion characteristics. It can use component-wise adaptive motion vector predictor based on the utility of spatial and temporal motion field without any signaling bit for indicating decoder of the selected predictive motion vector component. Experiment with test video data shows about 1.99% (max 8.71%) bit rate reduction compared to the conventional H.264/AVC method.

Evaluation of Various Tone Mapping Operators for Backward Compatible JPEG Image Coding

  • Choi, Seungcheol;Kwon, Oh-Jin;Jang, Dukhyun;Choi, Seokrim
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.9
    • /
    • pp.3672-3684
    • /
    • 2015
  • Recently, the standardization of backward compatible JPEG image coding for high dynamic range (HDR) image has been undertaken to establish an international standard called "JPEG XT." The JPEG XT consists of two layers: the base layer and the residual layer. The base layer contains tone mapped low dynamic range (LDR) image data and the residual layer contains the error signal used to reconstruct the HDR image. This paper gives the result of a study to evaluate the overall performance of tone mapping operators (TMOs) for this standard. The evaluation is performed using five HDR image datasets and six TMOs for profiles A, B, and C of the proposed JPEG XT standard. The Tone Mapped image Quality Index (TMQI) and no reference image quality assessment (NR IQA) are used for measuring the LDR image quality. The peak signal to noise ratio (PSNR) is used to evaluate the overall compression performance of JPEG XT profiles A, B, and C. In TMQI and NR IQA measurements, TMOs using display adaptive tone mapping and adaptive logarithmic mapping each gave good results. A TMO using adaptive logarithmic mapping gave good PSNRs.

Adaptive Coding Mode Decision Algorithm using Motion Vector Map in H.264/AVC Video Coding (H.264/AVC 부호기에서 움직임 벡터 맵을 이용한 적응적인 부호화 모드 결정 방법)

  • Kim, Tae-Jung;Ko, Man-Geun;Suh, Jae-Won
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.2
    • /
    • pp.48-56
    • /
    • 2009
  • We propose a fast intra mode skip decision algorithm for H.264/AVC video encoding. Although newly added MB encoding algorithms based on various prediction methods increase compression ratio, they require a significant increase in the computational complexity because we calculate rate-distortion(RD) cost for all possible MB coding modes and then choose the best one. In this paper, we propose a fast mode decision algorithm based on an adaptive motion vector map(AMVM) method for H.264/AVC video encoding to reduce the processing time for the inter frame. We verify that the proposed algorithm generates generally good performances in PSNR, bit rates, and processing time.