• Title/Summary/Keyword: spectral distortion

Search Result 236, Processing Time 0.022 seconds

Design of the Vector-Scalar Quantizer of LSP Parameters for Wideband Speech Coder (광대역 음성부호화기를 위한 백터-스칼라 LSP 파라미터 양자화기 설계)

  • 신재현;이인성;지덕구;윤병식;최송인
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.4
    • /
    • pp.286-291
    • /
    • 2003
  • In this Paper, we designed an LSP(Line Spectral Pairs) parameter quantizer with cascaded structure of vector quantizer and scalar quantizer for the wideband speech coder. We have chosen the 16th-order of the LP coefficients. These coefficients are then transformed into the LSP parameters which have the excellent properties for quantization and easy stability checking condition of synthesis filter. In the first stage of quantization, input LSP parameters are split-vector-quantized using two 8-th order codebooks. In the second stage, the components of residual vector are individually quantized by the scalar quantizer utilizing the ordering property of LSP parameters. The designed adaptive VQ-SQ quantizer using 35 bits/frame shows the wideband transparency that the average spectral distortion should be less than 1.6 ㏈ and less than 4% of the frames should have SD above 3 ㏈. The simulation results show that the designed quantizer provides a 2-3 bits/frame saving over the typical vector-scalar quantizer.

Method of Harmonic Magnitude Quantization for Harmonic Coder Using the Straight Line and DCT (Discrete Cosine Transform) (하모닉 코더를 위한 직선과 이산코사인변환 (DCT)을 이용한 하모닉 크기값 (Magnitude) 양자화 기법)

  • Choi, Ji-Wook;Jeong, Gyu-Hyeok;Lee, In-Sung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.4
    • /
    • pp.200-206
    • /
    • 2008
  • This paper presents a method of quantization to extract quantization parameters using the straight-line and DCT (Discrete Cosine Transform) for two splited frequency bands. As the number of harmonic is variable frame to frame, harmonics in low frequency band is oversampled to fix the dimension and straight-lines present a spectral envelope, then the discontinuous points of straight-lines in low frequency is sent to quantizer. Thus, extraction of quantization parameters using the straight-line provides a fixed dimension. Harmonics in high frequency use variable DCT to obtain quantization parameters and this paper proposes a method of quantization combining the straight-line with DCT. The measurement (If proposed method of quantization uses spectral distortion (SD) for spectral magnitudes. As a result, The proposed method of quantization improved 0.3dB in term of SD better than HVXC.

Noisy Speech Recognition Based on Spectral Mapping Techniques (스펙트럼사상기법을 기초로 한 잡음음성인식)

  • Lee, Ki-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.1E
    • /
    • pp.39-45
    • /
    • 1995
  • This paper presents noisy speech recognition method based on spectral mapping techniques of speaker adaptation method. In the presented method, the spectral mapping training makes the spectral distortion of noisy speech reduced, and for the more correctively spectral mapping, let the adjustment window;s slope be adaptive to several word lengths. As a result of recognition experiment, the recognition rate is higher than that of the conventional method using VQ and DTW without noise processing. Even when SNR level is 0 dB, the recognition rate is 10 times more than that using the conventional method. It is confirmed that the speacker adaptation technique using the spectral mapping training has an ability to improve the recognition performance for noisy speech.

  • PDF

Analyzing Preprocessing for Correcting Lighting Effects in Hyperspectral Images (초분광영상의 조명효과 보정 전처리기법 분석)

  • Yeong-Sun Song
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.26 no.5
    • /
    • pp.785-792
    • /
    • 2023
  • Because hyperspectral imaging provides detailed spectral information across a broad range of wavelengths, it can be utilized in numerous applications, including environmental monitoring, food quality inspection, medical diagnosis, material identification, art authentication, and crime scene analysis. However, hyperspectral images often contain various types of distortions due to the environmental conditions during image acquisition, which necessitates the proper removal of these distortions through a data preprocessing process. In this study, a preprocessing method was investigated to effectively correct the distortion caused by artificial light sources used in indoor hyperspectral imaging. For this purpose, a halogen-tungsten artificial light source was installed indoors, and hyperspectral images were acquired. The acquired images were then corrected for distortion using a preprocessing that does not require complex auxiliary equipment. After the corrections were made, the results were analyzed. According to the analysis, a statistical transformation technique using mean and standard deviation with reference to a reference signal was found to be the most effective in correcting distortions caused by artificial light sources.

SNR-based Weight Control for the Spatially Preprocessed Speech Distortion Weighted Multi-channel Wiener Filtering (공간 필터와 결합된 음성 왜곡 가중 다채널 위너 필터에서의 신호 대 잡음 비에 의한 가중치 결정 방법)

  • Kim, Gibak
    • Journal of Broadcast Engineering
    • /
    • v.18 no.3
    • /
    • pp.455-462
    • /
    • 2013
  • This paper introduces the Spatially Preprocessed Speech Distortion Weighted Multi-channel Wiener Filter (SP-SDW-MWF) for multi-microphone noise reduction and proposes a method to determine the speech distortion weights. The SP-SDW-MWF is known as a robust noise reduction algorithm against the error caused by the mismatch in microphones. The SP-SDW-MWF adopts weights which determine the amount of noise reduction at the expense of introducing speech distortion in the noise-suppressed speech. In this paper, we use the error of power spectral density between the estimated signal and the desired signal as the evaluation measure. Thus the a priori SNR is used to control the speech distortion weights in the frequency domain. In the experimental results, the proposed method yields better result in terms of MFCC distortion compared to the conventional method.

Korean Digit Recognition Under Noise Environment Using Spectral Mapping Training (스펙트럼사상학습을 이용한 잡음환경에서의 한국어숫자음인식)

  • Lee, Ki-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.3
    • /
    • pp.25-32
    • /
    • 1994
  • This paper presents the Korean digit recognition method under noise environment using the spectral mapping training based on static supervised adaptation algorithm. In the presented recognition method, as a result of spectral mapping from one space of noisy speech spectrum to another space of speech spectrum without noise, spectral distortion of noisy speech is improved, and the recognition rate is higher than that of the conventional method using VQ (vector quatization) and DTW(dynamic time warping) without noise processing, and even when SNR level is 0dB, the recognition rate is 10 times of that using the conventional method. It has been confirmed that the spectral mapping training has an ability to improve the recognition performance for speech in noise environment.

  • PDF

Destripe Hyperspectral Images with Spectral-spatial Adaptive Unidirectional Variation and Sparse Representation

  • Zhou, Dabiao;Wang, Dejiang;Huo, Lijun;Jia, Ping
    • Journal of the Optical Society of Korea
    • /
    • v.20 no.6
    • /
    • pp.752-761
    • /
    • 2016
  • Hyperspectral images are often contaminated with stripe noise, which severely degrades the imaging quality and the precision of the subsequent processing. In this paper, a variational model is proposed by employing spectral-spatial adaptive unidirectional variation and a sparse representation. Unlike traditional methods, we exploit the spectral correction and remove stripes in different bands and different regions adaptively, instead of selecting parameters band by band. The regularization strength adapts to the spectrally varying stripe intensities and the spatially varying texture information. Spectral correlation is exploited via dictionary learning in the sparse representation framework to prevent spectral distortion. Moreover, the minimization problem, which contains two unsmooth and inseparable $l_1$-norm terms, is optimized by the split Bregman approach. Experimental results, on datasets from several imaging systems, demonstrate that the proposed method can remove stripe noise effectively and adaptively, as well as preserve original detail information.

A Study on a Improvement of the Speech Quality by Spectrum Analysis with Variable Window in CELP Vocoder (가변 윈도우 스펙트럼 분석을 이용한 CELP 부호화기의 음질 향상에 관한 연구)

  • 나덕수;민소연;배명진
    • Proceedings of the IEEK Conference
    • /
    • 2000.06d
    • /
    • pp.106-109
    • /
    • 2000
  • There have been proposed two types of low bit rate vocoder upto now : One is MBE type using the spectrum modeling and another is CELP type using the hybrid coding method. CELP type vocoder has mainly studied between them. Specially, much of intensity is concentrated in CELP vocoder due to the emergence of Internet Phone and PCS in a domestic. In order to improve the speech quality in CELP vocoder, in this paper, we proposed a new spectrum analysis algorithm with variable window, In CELP vocoder, the spectrum of the synthesised speech signal is distorted because the fixed size windows is used for spectrum analysis. So we have measured the spectral leakage and in order to minimize the spectral leakage have adjusted the window size. Applying this method G.723.1 ACELP, we can get SD(Spectral Distortion) reduction 0.084(dB), residual energy reduction 6.3% and MOS(Mean Opinion Score) improvement 0.1.

  • PDF

A Line Spectrum Frequency Pairs Representation for Spectral Envelop Quantization

  • Park, Youngho;Lee, Won-Cheol;Bae, Myung-Jin
    • Proceedings of the IEEK Conference
    • /
    • 2000.09a
    • /
    • pp.787-790
    • /
    • 2000
  • This paper introduces a new type of representation of the LSPs as a promising alternative used for transmitting the LPC parameters. Major contribution in this paper is that the vocal track information embedded on the spectral envelope can be represented in terms of the reduced number of LSF compared tn the conventional. Hence, it provides a possibility that LPC parameters could be quantized at a reduced bit rate without causing any major spectral distortion. The simulation result illustrates the capability of the proposed LSPs representation as an efficient quantization method via a proper rejection of the redundant pairs of pole and zero along the unit circle.

  • PDF

Harmonics-based Spectral Subtraction and Feature Vector Normalization for Robust Speech Recognition

  • Beh, Joung-Hoon;Lee, Heung-Kyu;Kwon, Oh-Il;Ko, Han-Seok
    • Speech Sciences
    • /
    • v.11 no.1
    • /
    • pp.7-20
    • /
    • 2004
  • In this paper, we propose a two-step noise compensation algorithm in feature extraction for achieving robust speech recognition. The proposed method frees us from requiring a priori information on noisy environments and is simple to implement. First, in frequency domain, the Harmonics-based Spectral Subtraction (HSS) is applied so that it reduces the additive background noise and makes the shape of harmonics in speech spectrum more pronounced. We then apply a judiciously weighted variance Feature Vector Normalization (FVN) to compensate for both the channel distortion and additive noise. The weighted variance FVN compensates for the variance mismatch in both the speech and the non-speech regions respectively. Representative performance evaluation using Aurora 2 database shows that the proposed method yields 27.18% relative improvement in accuracy under a multi-noise training task and 57.94% relative improvement under a clean training task.

  • PDF