• Title/Summary/Keyword: Residual vector quantization

Search Result 14, Processing Time 0.022 seconds

Multispectral image data compression using classified vector quantization (영역분류 벡터 양자화를 이용한 다중분광 화상데이타 압축)

  • 김영춘;반성원;김중곤;서용수;이건일
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.8
    • /
    • pp.42-49
    • /
    • 1996
  • In this paper, we propose a satellite multispectral image data compression method using classified vector quantization. This method classifies each pixel vector considering band characteristics of multispectral images. For each class, we perform both intraband and interband vector quantization to romove spatial and spectral redundancy, respectively. And residual vector quantization for error images is performed to reduce error of interband vector quantization. Thus, this method improves compression efficiency because of removing both intraband(spatial) and interband (spectral) redundancy in multispectral images, effectively. Experiments on landsat TM multispectral image show that compression efficiency of proposed method is better than that of conventional method.

  • PDF

Encoding of Speech Spectral Parameters Using Adaptive Vector-Scalar Quantization Methods for Mobile Communication Systems

  • Lee, In-Sung;Kim, Jong-Hark
    • The Journal of the Acoustical Society of Korea
    • /
    • v.17 no.4E
    • /
    • pp.35-40
    • /
    • 1998
  • In this paper, an efficient quantization method of line spectrum pairs(LSP) with cascaded structure of vector quantizer and scalar quantizer is proposed. First, input LSP parameters is vector-quantized using a codebook a with a moderate number of entries. In the second stage of quantization, the components of residual vector are individually quantized by the scalar quantizer. The utilization of ordering property of LSP parameters and the inclusion of interframe prediction improve the quantizer performance and remove the stability check routine after quantization procedure. The new vector-scalar hybrid quantizer using 26 bits/frame shows a transparent quality of speech that an average spectral distortion is 1 dB and the frame proportion with above 2 dB spectral distortion is less than 2%. The performances of proposed quantization method is evaluated in the transmission errors.

  • PDF

A study on the application of residual vector quantization for vector quantized-variational autoencoder-based foley sound generation model (벡터 양자화 변분 오토인코더 기반의 폴리 음향 생성 모델을 위한 잔여 벡터 양자화 적용 연구)

  • Seokjin Lee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.2
    • /
    • pp.243-252
    • /
    • 2024
  • Among the Foley sound generation models that have recently begun to be studied, a sound generation technique using the Vector Quantized-Variational AutoEncoder (VQ-VAE) structure and generation model such as Pixelsnail are one of the important research subjects. On the other hand, in the field of deep learning-based acoustic signal compression, residual vector quantization technology is reported to be more suitable than the conventional VQ-VAE structure. Therefore, in this paper, we aim to study whether residual vector quantization technology can be effectively applied to the Foley sound generation. In order to tackle the problem, this paper applies the residual vector quantization technique to the conventional VQ-VAE-based Foley sound generation model, and in particular, derives a model that is compatible with the existing models such as Pixelsnail and does not increase computational resource consumption. In order to evaluate the model, an experiment was conducted using DCASE2023 Task7 data. The results show that the proposed model enhances about 0.3 of the Fréchet audio distance. Unfortunately, the performance enhancement was limited, which is believed to be due to the decrease in the resolution of time-frequency domains in order to do not increase consumption of the computational resources.

Design of the Vector-Scalar Quantizer of LSP Parameters for Wideband Speech Coder (광대역 음성부호화기를 위한 백터-스칼라 LSP 파라미터 양자화기 설계)

  • 신재현;이인성;지덕구;윤병식;최송인
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.4
    • /
    • pp.286-291
    • /
    • 2003
  • In this Paper, we designed an LSP(Line Spectral Pairs) parameter quantizer with cascaded structure of vector quantizer and scalar quantizer for the wideband speech coder. We have chosen the 16th-order of the LP coefficients. These coefficients are then transformed into the LSP parameters which have the excellent properties for quantization and easy stability checking condition of synthesis filter. In the first stage of quantization, input LSP parameters are split-vector-quantized using two 8-th order codebooks. In the second stage, the components of residual vector are individually quantized by the scalar quantizer utilizing the ordering property of LSP parameters. The designed adaptive VQ-SQ quantizer using 35 bits/frame shows the wideband transparency that the average spectral distortion should be less than 1.6 ㏈ and less than 4% of the frames should have SD above 3 ㏈. The simulation results show that the designed quantizer provides a 2-3 bits/frame saving over the typical vector-scalar quantizer.

Efficient vector-scalar quantization of line spectrum parirs (LSP) (효율적인 벡터-스칼라 Line spectrum pairs(LSP) 양자화 방법)

  • 이인성;남승현
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.21 no.2
    • /
    • pp.333-339
    • /
    • 1996
  • In this paper, an effiicent quatization method of line spectrum pairs(LSP) with cascaded structure of vector quantizer and scalar quantizer is proposed. First, input LSP parameters is vector-quantized using a codebook with a moderate number of entries. In the second stage of quantization, the components of residual vector are individution improve the quantizer by the scalar quantizer. The utilization of ordering property and the inclusion of interframe prediction improve the quantizer performance and remove the stability check routine. The new vector-scalar cascaded quantizer using 27 bits/frame shows a transparent quality that an average specytural distortion is 1 dB and the frame proportion with above 2 dB spectral distion is less than 2%.

  • PDF

Finite-state projection vector quantization applied to mean-residual compression of images (평균-잔류신호 영상압축에 적용된 유한 상태 투영벡터양자화)

  • 김철우;이충웅
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.21 no.9
    • /
    • pp.2341-2348
    • /
    • 1996
  • This paper proposes an image compression algorithm that adopts projection scheme on mean-residual metod. Sub-blocks of an image are encoded using mean-residual method where mean value is predicted according to that of neighboring blocks. Projection scheme with 8 directions is applied to the compression of residual signals of blocks. Projection vectors are finite-state vector quantized according to the projection angle of nighboring blocks in order to exploit the correlation among them. Side information to represent the repetition of projection is run-length coded while the information for projection direction is compressed using entropy encoding. The proposed scheme apears to be better in PSNR performance when compared with conventional projection scheme as well as in subjective quality preserving the edges of images better than most tranform methods which usually require heavy computation load.

  • PDF

3-dimensional Mesh Model Coding Using Predictive Residual Vector Quantization (예측 잉여신호 벡터 양자화를 이용한 3차원 메시 모델 부호화)

  • 최진수;이명호;안치득
    • Journal of Broadcast Engineering
    • /
    • v.2 no.2
    • /
    • pp.136-145
    • /
    • 1997
  • As a 3D mesh model consists of a lot of vertices and polygons and each vertex position is represented by three 32 bit floating-point numbers in a 3D coordinate, the amount of data needed for representing the model is very excessive. Thus, in order to store and/or transmit the 3D model efficiently, a 3D model compression is necessarily required. In this paper, a 3D model compression method using PRVQ (predictive residual vector quantization) is proposed. Its underlying idea is based on the characteristics such as high correlation between the neighboring vertex positions and the vectorial property inherent to a vertex position. Experimental results show that the proposed method obtains higher compression ratio than that of the existing methods and has the advantage of being capable of transmitting the vertex position data progressively.

  • PDF

Bitrate Reduction in Vector Quantization System Using a Dynamic Index Mapping (동적 인텍스 매핑을 이용한 벡터 양자화 시스템에서의 비트율 감축)

  • 이승준;양경호;김철우;이충웅
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.32B no.8
    • /
    • pp.1091-1098
    • /
    • 1995
  • This paper proposes an efficient noiseless encoding method of vector quantization(VQ) index using a dynamic index mapping. Using high interblock correlation, the proposed index mapper transforms an index into a new one with lower entropy. In order to achieve good performance with low computational complexity, we adopt 'the sum of differences in pixel values on the block boundaries' as the cost function for index mapping. Simulation results show that the proposed scheme reduces the average bitrate by 40 - 50 % in ordinary VQ system for image compression. In addition, it is shown that the proposed index mapping method can be also applied to mean-residual VQ system, which allows the reduction of bitrate for VQ index by 20 - 30 %(10 - 20 % reduction in total bitrate). Since the proposed scheme is one for noiseless encoding of VQ index, it provides the same quality of the reconstructed image as the conventional VQ system.

  • PDF

Shuffled Discrete Sine Transform in Inter-Prediction Coding

  • Choi, Jun-woo;Kim, Nam-Uk;Lim, Sung-Chang;Kang, Jungwon;Kim, Hui Yong;Lee, Yung-Lyul
    • ETRI Journal
    • /
    • v.39 no.5
    • /
    • pp.672-682
    • /
    • 2017
  • Video compression exploits statistical, spatial, and temporal redundancy, as well as transform and quantization. In particular, the transform in a frequency domain plays a major role in energy compaction of spatial domain data into frequency domain data. The high efficient video coding standard uses the type-II discrete cosine transform (DCT-II) and type-VII discrete sine transform (DST-VII) to improve the coding efficiency of residual data. However, the DST-VII is applied only to the Intra $4{\times}4$ residual block because it yields relatively small gains in the larger block than in the $4{\times}4$ block. In this study, after rearranging the data of the residual block, we apply the DST-VII to the inter-residual block to achieve coding gain. The rearrangement of the residual block data is similar to the arrangement of the basis vector with a the lowest frequency component of the DST-VII. Experimental results show that the proposed method reduces the luma-chroma (Cb+Cr) BD rates by approximately 0.23% to 0.22%, 0.44% to 0.58%, and 0.46% to 0.65% for the random access, low delay B, and low delay P configurations, respectively.

Efficient Multispectral Image Compression Using Variable Block Size Vector Quantization (가변 블럭 벡터 양자화를 이용한 효율적인 다분광 화상 데이터 압축)

  • Ban, Seong-Won;Kim, Byeong-Ju;Seok, Jeong-Yeop;Gwon, Seong-Geun;Gwon, Gi-Gu;Kim, Yeong-Chun;Lee, Geon-Il
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.38 no.6
    • /
    • pp.703-711
    • /
    • 2001
  • In this paper, we propose efficient multispectral image compression using variable block size vector quantization (VQ). In wavelet domain, we perform the variable block size VQ to remove intraband redundancy for a reference band image that has the lowest spatial variance and the best correlation with other band. And in wavelet domain, we perform the classified interband prediction to remove interband redundancy for the remaining bands. Then error wavelet coefficients between original image and predicted image are residual variable block size vector quantized to reduce prediction error. Experiments on remotely sensed satellite image show that coding efficiency of the proposed method is better than that of the conventional method.

  • PDF