• Title/Summary/Keyword: pitch period

Search Result 188, Processing Time 0.026 seconds

A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting (음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구)

  • Kim, Jong-Kuk;Jo, Wang-Rae;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.85-95
    • /
    • 2003
  • In speech signal processing, it is very important to detect the pitch exactly in speech recognition, synthesis and analysis. If we exactly pitch detect in speech signal, in the analysis, we can use the pitch to obtain properly the vocal tract parameter. It can be used to easily change or to maintain the naturalness and intelligibility of quality in speech synthesis and to eliminate the personality for speaker-independence in speech recognition. In this paper, we proposed a new pitch detection algorithm. First, positive center clipping is process by using the incline of speech in order to emphasize pitch period with a glottal component of removed vocal tract characteristic in time domain. And rough formant envelope is computed through peak-fitting spectrum of original speech signal infrequence domain. Using the roughed formant envelope, obtain the smoothed formant envelope through calculate the linear interpolation. As well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. Inverse fast fourier transform (IFFT) compute this flattened harmonics. After all, we obtain Residual signal which is removed vocal tract element. The performance was compared with LPC and Cepstrum, ACF. Owing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

A Study on Improving Voice Quality and Pitch Searching of the VSELP Coder (VSELP 부호화기의 음질 및 주기탐색 개선에 관한 연구)

  • 성기철;문상재
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.4
    • /
    • pp.740-749
    • /
    • 1994
  • This paper presents method for improving the performance of the VSELP speech coder. The hybrid method is employed for pitch period searching. Pitch searching time is reduced and pitch detection error, caused by quantization error of excitation signal of encoder in VSELP coder, is reduced by this method. This paper also adopts a pitch period enhancement filter and an adaptive first order filter. In this result, pitch period searching time is reduced to 26%, and MOS of reconstructed speech signal is increased by 3.19 to 4.04.

  • PDF

A Study on Speech Period and Pitch Detection for Continuous Speech Recognition (연속음성인식을 위한 음성구간과 피치검출에 관한 연구)

  • Kim Tai Suk;Chang jong chil
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.1
    • /
    • pp.56-61
    • /
    • 2005
  • In this thesis, propose speech period and pitch detection for continuous speech recognition. This mathod is distinguishes between vowel and consonant to frame unit in continuous speech, for distinguishable voice. Powerful extraction of speech period could threshold energy make use of input signal to real noise environment. Also algorithm of this method distinguish between vowel and consonant at the same time in voice make use of zero crossing rate and short time energy to extractible speech period.

  • PDF

Robust Backward Adaptive Pitch Prediction for Tree Coding (트리 코팅에서 전송에러에 강한 역방향 적응 피치 예측)

  • 이인성
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.8
    • /
    • pp.1587-1594
    • /
    • 1994
  • The pitch predictor is one of the most important part for the robust tree coder. The hybrid backward pitch adapation which is a combination of a block adaptation and a recursive adaptation is used for the pitch predictor. In order to improve the error performance and track the pitch period change of the input speech, it is proposed to smooth the input of the pitch predictor. The smoother with three taps can have fixed coefficients or variable coefficients depending on the estimated autocorrelation function of the output of the pitch synthesizer. The inclusion of a variable smoother can track the pitch period change within a block and reduce the effect of channel errors.

  • PDF

Segmentation of the Korean speech signals into phonetic units using the super resolution pitch determination (고해상 피치검출을 이용한 한국어 음성신호의 음소분리)

  • 이응구;이두수
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.18 no.2
    • /
    • pp.270-278
    • /
    • 1993
  • This paper is presented the phonetic segmentation alg9rithm of the Korean speech signals which is finded the exact pitch using the super resoluton pitch determination and is compared corss-correlation to threshold each pitch period. The features of the proposed algorithm are infinite resolution and high reliability, and also can separate transient or silent segment. The algorithm is instrumental to speech processing applications which require vector quantization and speech recognition. The presented algorithm is implemented by 386-MATLAB on PC 386/DX and is verified the exact pitch period and the phonetic segmentation of speech signals.

  • PDF

On Altering the Pitch of Speech Signals in Waveform Coding -(Altering Method by the LPC and the Pitch Halving)- (음성 파형코딩의 음원피치 변경에 관한 연구 - LPC와 주기반분법에 의한 피치변경법 -)

  • 민경중
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1991.06a
    • /
    • pp.45-49
    • /
    • 1991
  • In area of the speech synthesis, the waveform coding with high quality are mainly used to the synthesis by analysis. However, it is difficult to applying the waveform coding to the synthesis by rule, because the parameters of this coding are not classified as either excitation parameters and vocal tract parameters. In this paper, we proposed a new pitch change method that can alter the pitch periods in the waveform coding. The proposed method expands the pitch period by the LPC synthesis method, and then the period is compressed by the waveform halving technique. Thus, it is possible that the waveform coding is carried out the synthesis by rule in speech processing.

  • PDF

A Stable Pitch ]Determination via Dyadic Wavelet Transform (DyWT) (Dyadic Wavelet Transform 방식의 Pitch 주기결정)

  • Kim Namhoon;Yoon Gibum;Ko Hanseok
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.197-200
    • /
    • 2000
  • This paper presents a time-based Pitch Determination Algorithm (PDA) for reliable estimation of pitch Period (PP) in speech signal. In proposed method, we use the Dyadic Wavelet Transform (DyWT), which detects the presence of Glottal Closure Instants (GCI) and uses the information to determine the pitch period. And, the proposed method also uses the periodicity property of DyWT to detect unsteady GCI. To evaluate the performance of the proposed methods, that of other PDAs based on DyWT are compared with what this paper proposed. The effectiveness of the proposed method is tested with real speech signals containing a transition between voiced and the unvoiced interval where the energy of voiced signal is unsteady. The result shows that the proposed method provides a good performance in estimating the both the unsteady GCI positions as well as the steady parts.

  • PDF

A Study on Longitudinal Phugoid Mode Affected by Application of Nonlinear Control Laws

  • Kim, Chong-Sup;Hur, Gi-Bong;Kim, Seung-Jun
    • International Journal of Aeronautical and Space Sciences
    • /
    • v.8 no.1
    • /
    • pp.21-31
    • /
    • 2007
  • Relaxed Static Stability (RSS) concept has been applied to improve aerodynamic performance of modern version supersonic jet fighter aircraft. The T-50 advanced supersonic trainer employs the RSS concept in order to improve the aerodynamic performance. And the flight control system stabilizes the unstable aircraft and provides adequate handling qualities. The T-50 longitudinal control laws employ a proportional-plus-integral type controller based on a dynamic inversion method. The longitudinal dynamic modes consist of short period with high frequency and phugoid mode with low frequency. The design goal of longitudinal control law is optimization of short period damping ratio and frequency using Lower Order Equivalent System (LOES) complying the requirement of MIL-F-8785C. This paper addresses phugoid mode characteristics such as damping ratio and natural frequency that is affected by the nonlinear control laws such as angle of attack limiter, auto pitch attitude command system and autopilot of pitch attitude hold.

On a Study of the Reduction of Bit Rate by the Preprocessing of PSOLA Coding Technique in the G. 723.1 Vocoder (PSOLA 전처리과정을 이용한 G.723.1 보코더의 전송률 감소에 관한 연구)

  • 장경아;조성현;배명진
    • Proceedings of the IEEK Conference
    • /
    • 2002.06d
    • /
    • pp.401-404
    • /
    • 2002
  • In general, speech coding methods are classified into the following three categories: the waveform coding, the source coding and the hybrid coding. In this paper, First, the reference waveform is detected after searching the pitch period by NAMDF similarity and similarity between the reference waveform and the waveform each pitch period. It made a decision whether the waveform is compressed with the threshold of similarity. If the waveform is compressed only magnitude and pitch information is transmitted into the input of G.723.1 vocoder. Performing through the G.723.1 vocoder, the waveform is restored with the magnitude and pitch information by PSOLA synthesis method. The result of simulation with proposed algorithm has a 31% reduction of bit rate than the standard 5.3kbps G.723.1 ACELP vocoder.

  • PDF

A High Speed Pitch Extraction Method Based on Peak Detection and AMDF (Peak 검출과 AMDF에 의한 고속도 음성주기 추출방법)

  • 성원용;은종관
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.17 no.4
    • /
    • pp.38-44
    • /
    • 1980
  • We present a high speed pitch estimation algorithm that is based on peak detection and average magnitude difference function (AMDF). A few pitch candidates are first estimated from the low-pass filtered (800 Hz) speech by a peak detection algorithm. AMDF values of the pitch candidatestare then calculated, and the pitch candidate that yields the minimum AMDF value is chosen as the desired pitch period. The new method requires far less computation time than other pitch estimation algorithms, while it yields fairly accurate results.

  • PDF