• Title/Summary/Keyword: Rate distortion

Search Result 820, Processing Time 0.021 seconds

A Temporal Decomposition Method Based on a Rate-distortion Criterion (비트율-왜곡 기반 음성 신호 시간축 분할)

  • 이기승
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.3
    • /
    • pp.315-322
    • /
    • 2002
  • In this paper, a new temporal decomposition method is proposed. which takes into consideration not only spectral distortion but also bit rates. The interpolation functions, which are one of necessary parameters for temporal decomposition, are obtained from the training speech corpus. Since the interval between the two targets uniquely defines the interpolation function, the interpolation can be represented without additional information. The locations of the targets are determined by minimizing the bit rates while the maximum spectral distortion maintains below a given threshold. The proposed method has been applied to compressing the LSP coefficients which are widely used as a spectral parameter. The results of the simulation show that an average spectral distortion of about 1.4 dB can be achieved at an average bit rate of about 8 bits/Frame.

An Adaptive Control for the Propagation Errors Incurred by DCT Coefficient-Dropping Transcoder

  • Kim, Jin-Soo;Kim, Jae-Gon;Seo, Kwang-Deok;Yun, Mong-Han
    • ETRI Journal
    • /
    • v.29 no.5
    • /
    • pp.559-568
    • /
    • 2007
  • This paper presents a new distortion control scheme with a simple estimation model for the propagation errors incurred by dropping some parts of the bitstream in a frame dropping-coefficient dropping (FD-CD) transcoder. The primary goal of this paper is to facilitate bit-rate conversions and rate-distortion controls in the compressed domain without introducing a full decoding and reencoding system in the pixel domain. First, the error propagation behavior over several frame sequences due to coefficient dropping is investigated on the basis of statistical and empirical properties. Then, such properties are used to develop a simple estimation model for the CD distortion accounting for the characteristics of the underlying coded-frame. Finally, the proposed estimation model allows us to determine the amount of coefficient dropping and to effectively allocate rate-distortions into coded-frames. Experimental results show that the proposed estimation model accurately describes the characteristics of propagation errors adaptively in the compressed domain and can be easily applied to distortion control over different kinds of video sequences.

  • PDF

Quantization of LPC Coefficients Using a Multi-frame AR-model (Multi-frame AR model을 이용한 LPC 계수 양자화)

  • Jung, Won-Jin;Kim, Moo-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.2
    • /
    • pp.93-99
    • /
    • 2012
  • For speech coding, a vocal tract is modeled using Linear Predictive Coding (LPC) coefficients. The LPC coefficients are typically transformed to Line Spectral Frequency (LSF) parameters which are advantageous for linear interpolation and quantization. If multidimensional LSF data are quantized directly using Vector-Quantization (VQ), high rate-distortion performance can be obtained by fully utilizing intra-frame correlation. In practice, since this direct VQ system cannot be used due to high computational complexity and memory requirement, Split VQ (SVQ) is used where a multidimensional vector is split into multilple sub-vectors for quantization. The LSF parameters also have high inter-frame correlation, and thus Predictive SVQ (PSVQ) is utilized. PSVQ provides better rate-distortion performance than SVQ. In this paper, to implement the optimal predictors in PSVQ for voice storage devices, we propose Multi-Frame AR-model based SVQ (MF-AR-SVQ) that considers the inter-frame correlations with multiple previous frames. Compared with conventional PSVQ, the proposed MF-AR-SVQ provides 1 bit gain in terms of spectral distortion without significant increase in complexity and memory requirement.

Implementation of an Efficient Rate-Distortion Optimization Algorithm for JPEG2000 (JPEG2000 영상 압축을 위한 효율적인 비율-왜곡 최적화 알고리즘 구현)

  • Moon Hyoung-Jin;Jung Gab-Cheon;Park Seong-Mo
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.43 no.3 s.309
    • /
    • pp.50-58
    • /
    • 2006
  • This paper describes the implementation of an efficient Rate-Distortion Optimization algerian to speed up rate control in JPEG2000. While the conventional algorithm determines the rate constant by averaging maximum R-D slope and minimum R-D slope for entire image, the proposed algorithm determines it by using R-D slopes of coding passes located near truncation point. Moreover, the rate allocation in proposed algorithm is conducted about only coding passes excluded from the previous rate allocation. As a result, it can reduce the number of operations required for rate-distortion optimization. The proposed algorithm was implemented in C programing language and was executed on the Altera Excalibur(EPXA4) development board.

An Accurate Estimation of Channel Loss Threshold Set for Optimal FEC Code Rate Decision (최적의 FEC 부호율 결정을 위한 정확한 채널손실 한계집합 추정기법)

  • Jung, Tae-Jun;Jeong, Yo-Won;Seo, Kwang-Deok
    • Journal of Broadcast Engineering
    • /
    • v.19 no.2
    • /
    • pp.268-271
    • /
    • 2014
  • Conventional forward error correction (FEC) code rate decision schemes using analytical source coding distortion model and channel-induced distortion model are usually complex, and require the typical process of model parameter training which involves potentially high computational complexity and implementation cost. To avoid the complex modeling procedure, we propose a simple but accurate joint source-channel distortion model to estimate channel loss threshold set for optimal FEC code rate decision.

THE STUDY OF APICAL CHANGES ON THE ORTHOPANTOMOGRAPH (Orthopantomograph에 있어서 치근부 상의 변화에 관한 연구)

  • Ahn Hyung Kyu
    • Journal of Korean Academy of Oral and Maxillofacial Radiology
    • /
    • v.9 no.1
    • /
    • pp.19-23
    • /
    • 1979
  • A study was made primarily to investigate vertical and horizontal distortion of the image at the apical region of the dental roots in orthopantomographs. The subjects consisted of two dry skulls with radiopaque materials attached to root surface. Measuring of the width and length of each predetermined point at 23 teeth was performed in dry skulls and radiographic films. The results obtained were as follows; 1. There was overall magnification of image in the vertical dimension. And anterior portion had greater magnification rate than posterior portion, while lower anterior portion had less magnification rate than upper anterior portion. 2. There was reduction of the image in the horizontal dimension of the teeth, because of the position relation between dry skull and image layer of the orthopantomograph. 3. There was a significant difference in distortion rate between the oposite teeth. 4. Cervical portion of the tooth had more decreased rate of horizontal distortion than apical portion.

  • PDF

Optimal Packet Scheduling Algorithms for Token-Bucket Based Rate Control

  • Mehta Neerav Bipin;Karandikar Abhay
    • Journal of Communications and Networks
    • /
    • v.7 no.1
    • /
    • pp.65-75
    • /
    • 2005
  • In this paper, we consider a scenario in which the source has been offered QoS guarantees subject to token-bucket regulation. The rate of the source should be controlled such that it conforms to the token-bucket regulation, and also the distortion obtained is the minimum. We have developed an optimal scheduling algorithm for offline (like pre-recorded video) sources with convex distortion function and which can not tolerate any delay. This optimal offline algorithm has been extended for the real-time online source by predicting the number of packets that the source may send in future. The performance of the online scheduler is not substantially degraded as compared to that of the optimal offline scheduler. A sub-optimal offline algorithm has also been developed to reduce the computational complexity and it is shown to perform very well. We later consider the case where the source can tolerate a fixed amount of delay and derive optimal offline algorithm for such traffic source.

A Fast Intra-Prediction Method in HEVC Using Rate-Distortion Estimation Based on Hadamard Transform

  • Kim, Younhee;Jun, DongSan;Jung, Soon-Heung;Choi, Jin Soo;Kim, Jinwoong
    • ETRI Journal
    • /
    • v.35 no.2
    • /
    • pp.270-280
    • /
    • 2013
  • A fast intra-prediction method is proposed for High Efficiency Video Coding (HEVC) using a fast intra-mode decision and fast coding unit (CU) size decision. HEVC supports very sophisticated intra modes and a recursive quadtree-based CU structure. To provide a high coding efficiency, the mode and CU size are selected in a rate-distortion optimized manner. This causes a high computational complexity in the encoder, and, for practical applications, the complexity should be significantly reduced. In this paper, among the many predefined modes, the intra-prediction mode is chosen without rate-distortion optimization processes, instead using the difference between the minimum and second minimum of the rate-distortion cost estimation based on the Hadamard transform. The experiment results show that the proposed method achieves a 49.04% reduction in the intra-prediction time and a 32.74% reduction in the total encoding time with a nearly similar coding performance to that of HEVC test model 2.1.

Approximation Vertex Search of Polygon-based Shape Coding by the Type of Distortion Patterns (왜곡 패턴 유형에 의한 다각형 기반 형상 부호화의 근사 정점 탐색)

  • Seo Jeong-Gu;Kwak No-Yoon;Seo Beom-Seok;Hwang Byong-Won
    • Journal of Digital Contents Society
    • /
    • v.3 no.2
    • /
    • pp.197-209
    • /
    • 2002
  • If we reduce the number of vertexes to decrease bit rate in polygon-based shape coding, the distortion of approximated contour increases rapidly. On the other hand, if we reduce the distortion, the number of vertexes increases rapidly and many bits are required to encode the vertexes. To improve this problem, in this paper we propose the approximation vertex search method. The encoder in the proposed method searches the type of distortion patterns that is the most similar to the shape which polygon edge and contour segment form and then encodes it. And then, the decoder mathematically finds the approximated vertexes from decoded distortion pattern information. Therefore, the proposed algorithm results in encoding many vertexes at a low bit rate and having the smoother shape than conventional method. As shown in computer simulation, the proposed method has less distortion than conventional method. It costs less bit rate by $10{\sim}20%$ than conventional algorithm in same distortion.

  • PDF

Correction of Signboard Distortion by Vertical Stroke Estimation

  • Lim, Jun Sik;Na, In Seop;Kim, Soo Hyung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.9
    • /
    • pp.2312-2325
    • /
    • 2013
  • In this paper, we propose a preprocessing method that it is to correct the distortion of text area in Korean signboard images as a preprocessing step to improve character recognition. Distorted perspective in recognizing of Korean signboard text may cause of the low recognition rate. The proposed method consists of four main steps and eight sub-steps: main step consists of potential vertical components detection, vertical components detection, text-boundary estimation and distortion correction. First, potential vertical line components detection consists of four steps, including edge detection for each connected component, pixel distance normalization in the edge, dominant-point detection in the edge and removal of horizontal components. Second, vertical line components detection is composed of removal of diagonal components and extraction of vertical line components. Third, the outline estimation step is composed of the left and right boundary line detection. Finally, distortion of the text image is corrected by bilinear transformation based on the estimated outline. We compared the changes in recognition rates of OCR before and after applying the proposed algorithm. The recognition rate of the distortion corrected signboard images is 29.63% and 21.9% higher at the character and the text unit than those of the original images.