• Title/Summary/Keyword: LSF parameters

Search Result 16, Processing Time 0.026 seconds

The Revised Transform Algorithm from LSF to LPC (LSF에서 LPC 계수를 구하는 개선된 알고리즘)

  • Kim, Hyang-Jin;Lee, Ki-Tae;Ham, Young-Hee;Kim, Hyoung-Jun;Lim, Jae-Yun
    • Proceedings of the IEEK Conference
    • /
    • 1999.06a
    • /
    • pp.679-682
    • /
    • 1999
  • This paper proposes the LSF or LSP that is the method of using to transfer the speech parameters after processed the speech to LPC, which is digital coding transferring efficiently, for the best quality and the lowest bit rate of parameters. The new revised transform algorithm between LSF and LPC coefficients is proposed. The proposed algorithm eliminates all multiplications, computes fewer operations, and reduces memory buffer sizes.

  • PDF

Quantization of LPC Coefficients Using a Multi-frame AR-model (Multi-frame AR model을 이용한 LPC 계수 양자화)

  • Jung, Won-Jin;Kim, Moo-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.2
    • /
    • pp.93-99
    • /
    • 2012
  • For speech coding, a vocal tract is modeled using Linear Predictive Coding (LPC) coefficients. The LPC coefficients are typically transformed to Line Spectral Frequency (LSF) parameters which are advantageous for linear interpolation and quantization. If multidimensional LSF data are quantized directly using Vector-Quantization (VQ), high rate-distortion performance can be obtained by fully utilizing intra-frame correlation. In practice, since this direct VQ system cannot be used due to high computational complexity and memory requirement, Split VQ (SVQ) is used where a multidimensional vector is split into multilple sub-vectors for quantization. The LSF parameters also have high inter-frame correlation, and thus Predictive SVQ (PSVQ) is utilized. PSVQ provides better rate-distortion performance than SVQ. In this paper, to implement the optimal predictors in PSVQ for voice storage devices, we propose Multi-Frame AR-model based SVQ (MF-AR-SVQ) that considers the inter-frame correlations with multiple previous frames. Compared with conventional PSVQ, the proposed MF-AR-SVQ provides 1 bit gain in terms of spectral distortion without significant increase in complexity and memory requirement.

Block Constrained Trellis Coded Vector Quantization of LSF Parameters for Wideband Speech Codecs

  • Park, Jung-Eun;Kang, Sang-Won
    • ETRI Journal
    • /
    • v.30 no.5
    • /
    • pp.738-740
    • /
    • 2008
  • In this paper, block constrained trellis coded vector quantization (BC-TCVQ) is presented for quantizing the line spectrum frequency parameters of the wideband speech codec. Both a predictive structure and a safety-net concept are combined into BC-TCVQ to develop the predictive BC-TCVQ. The performance of this quantization is compared with that of the linear predictive coding vector quantizer used in the AMRWB codec, demonstrating reductions in spectral distortion.

  • PDF

A Study on the Relation Between the LSF's and Spectral Distribution of Speech Signals (Line Spectral Frequency와 음성신호의 주파수 분포에 관한 연구)

  • 이동수;김영화
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.25 no.4
    • /
    • pp.430-436
    • /
    • 1988
  • LSF(Line Spectral Frequency) derived from LPC has known as a very useful transmission parameter of speech signals, for it has a good linear interpolation characteristics and a low spectrum distortion at low bit rates coding. This paper presents that it is possible to extract directly the formant frequencies of speech signals from LSF parameter without application of FFT algorithm by comparing the distribution of LSF parameter with the frequency distribution of analysis filter. This paper suggests the advanced algorithm that results in improving the speed of convergence at analytic solution method. Also, for the flexibility of parameters, the process that transforms from LSF to LPC is presented.

  • PDF

Perceptual and Adaptive Quantization of Line Spectral Frequency Parameters (선 스펙트럼 주파수의 청각 적응 부호화)

  • 한우진;김은경;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.8
    • /
    • pp.68-77
    • /
    • 2000
  • Line special frequency (LSF) parameters have been widely used in low bit-rate speech coding due to their efficiency for representing the short-time speech spectrum. In this paper, a new distance measure based on the masking properties of human ear is proposed for quantizing LSF parameters whereas most conventional quantization methods are based on the weighted Euclidean distance measure. The proposed method derives the perceptual distance measure from the definition of noise-to-mask ratio (NMR) which has high correspondence with the actual distortion received in the human ear and uses it for quantizing LSF parameters. In addition, we propose an adaptive bit allocation scheme, which allocates minimal bits to LSF parameters maintaining the perceptual transparency of given speech frame for reducing the average bit-rates. For the performance evaluation, we has shown the ratio of perceptually transparent frames and the corresponding average bit-rates for the conventional and proposed methods. By jointly combining the proposed distance measure and adaptive bit allocation scheme, the proposed system requires only 770 bps for obtaining 95.5% perceptually transparent frames, while the conventional systems produce 89.9% at even 1800 bps.

  • PDF

Low-Delay LSF FEC Technique Robust in Lossy VoIP Environment (VoIP 손실 환경에 강인한 저지연 LSF FEC 기법)

  • Yang, Hae-Yong;Lee, Kyung-Hoon;Hwang, In-Ho
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.6
    • /
    • pp.687-695
    • /
    • 2002
  • Media-specific FEC techniques, suggested to confront with VoIP speech packet loss, improve speech quality at the expense of generating additional one-frame delay. In this paper, we suggest new media-specific FEC, i.e, LSF FEC technique which is able to improve speech quality with much shortened additional delay. In the proposed technique, the LSF parameters of the future frame are utilized to recover a lost packet. To evaluate performance of the proposed technique, we use ITU-T G.723.1 and G.729 Codec and apply Gilbert packet loss model and estimate MOS per every packet loss rate using PESQ speech quality estimation algorithm. The proposed technique has effect of shortening delay over from 6.5ms to 27ms compared with existing media-specific FEC techniques. Simulation results for comparison of reconstructed speech quality show this novel technique improves the MOS over 0.1 in practical lossy environment of 5 % packet loss rate.

Performance Improvement of the QCELP using an Efficient LSF Coding (효율적인 LSF 양자화기를 이용한 QCELP 성능개선)

  • Kim, Hae-Jin;Kang, Sang-Won
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.1
    • /
    • pp.10-15
    • /
    • 1997
  • In this paper, an efficient LSF quantizer, named improved PSVQ(IPSVQ), is proposed to apply in the 8 kbps QCELP speech coder. By using 27 bits IPSVQ instead of 40 bits DPCM quantizer per frame, we can save 13 bits/frame and allocate those bits to the codebook gain and the pitch gain parameters. Hence we improve the overall performance of the QCELP codec. The enhanced QCELP shows the performance improvement of 0.9 dB SNR and 0.4 dB SEGSNR. Informal listening tests also confirm the improvement in the speech quality.

  • PDF

Changes in Breast-tumor Blood Flow in Response to Hypercapnia during Chemotherapy with Laser Speckle Flowmetry

  • Kim, Hoonsup;Lee, Youngjoo;Lee, Songhyun;Kim, Jae Gwan
    • Current Optics and Photonics
    • /
    • v.3 no.6
    • /
    • pp.555-565
    • /
    • 2019
  • Development of a biomarker for predicting tumor-treatment efficacy is a matter of great concern, to reduce time, medical expense, and effort in oncology therapy. In a preclinical study, we hypothesized that the blood-flow parameter based on laser speckle flowmetry (LSF) could be a potential indicator to estimate the efficacy of breast-cancer treatment. To verify this hypothesis, a 13762-MAT-B-III rat breast tumor was grown in a dorsal skinfold window chamber applied to a nude mouse, and the change in blood flow rate (BFR) - or the speckle flow index (SFI) is used together as the same meaning in this manuscript - was longitudinally monitored during tumor growth and metronomic cyclophosphamide treatment. Based on the daily LSF angiogram, several BFR parameters (baseline SFI, normalized SFI, and △rBFR) were compared to tumor size in the normal, treated, and untreated tumor groups. Despite the incomplete tumor treatment, we found that the daily changes in all BFR parameters tended to have partially positive correlation with tumor size. Moreover, we observed that the changes in baseline SFI and normalized SFI responded one day earlier than the tumor shrinkage during chemotherapy. However, daily variations in the hypercapnia-induced △rBFR lagged tumor shrinkage by one day. This study would contribute not only to evaluating tumor vascular response to treatment, but also to monitoring blood-flow-mediated diseases (in brain, skin, and retina) by using LSF in preclinical settings.

Designing a Quantizer of LPC Parameters for the Narrowband Speech Coder using Block-Constrained Trellis Coded Quantization (블록 제한 트렐리스 부호화 양자화 기법을 이용한 협대역 음성 부호화기용 LPC 계수 양자화기 설계)

  • Jun, Ja-Kyoung;Park, Sang-Kuk;Kang, Sang-Won
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.32 no.3C
    • /
    • pp.234-240
    • /
    • 2007
  • In this paper, low complexity block constrained trellis coded quantization (BC-TCQ) structures are introduced, and a predictive BC TCQ encoding method is developed for quantization of line spectrum frequencies (LSF) parameters for narrowband speech coding applications. Trellis-coded quantization(TCQ) is a form of VQ that builds the VQ codebook from interleaved constituent scalar quantization codebooks. The performance is compared to the other VQ, demonstrating reduction in spectral distortion and significant reduction in encoding complexity. The predictive BC-TCQ is about 0.47107 dB superior to the IS-641 split-VQ, 26bits/frame, in spectral distortion sense. The BC-TCQ is 64.54%, 76.93%, 2.35% of the IS-641 split-VQ, respectively, in the complexity of the additions, multiplies, comparisons.

A Line Spectrum Frequency Pairs Representation for Spectral Envelop Quantization

  • Park, Youngho;Lee, Won-Cheol;Bae, Myung-Jin
    • Proceedings of the IEEK Conference
    • /
    • 2000.09a
    • /
    • pp.787-790
    • /
    • 2000
  • This paper introduces a new type of representation of the LSPs as a promising alternative used for transmitting the LPC parameters. Major contribution in this paper is that the vocal track information embedded on the spectral envelope can be represented in terms of the reduced number of LSF compared tn the conventional. Hence, it provides a possibility that LPC parameters could be quantized at a reduced bit rate without causing any major spectral distortion. The simulation result illustrates the capability of the proposed LSPs representation as an efficient quantization method via a proper rejection of the redundant pairs of pole and zero along the unit circle.

  • PDF