• Title/Summary/Keyword: pitch alteration

Search Result 23, Processing Time 0.024 seconds

On a Pitch Alteration Method by Time-axis Scaling Compensated with the Spectrum for High Quality Speech Synthesis (고음질 합성용 스펙트럼 보상된 시간축조절 피치 변경법)

  • Bae, Myung-Jin;Lee, Won-Cheol;Im, Sung-Bin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.4
    • /
    • pp.89-95
    • /
    • 1995
  • The waveform coding technique has concerned with simply preserving the waveform shape of speech signal through a redundancy reduction process. In the case of speech synthesis, the waveform coding with high sound quality is mainly used to the synthesis by analysis. However, since the parameters of this coding are not classified into either excitation or vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In order to apply the waveform coding to the synthesis by rule, the pitch alteration technique is required in prosody control. In this paper, we propose a new pitch alteration method that can change the pitch period in waveform coding by scaling the time-axis and compensating the spectrum. This is relevant to the time-frequency domain method were the phase components of the waveform is preserved with a little spectrum distortion of 2.5 % and less for 50% pitch change.

  • PDF

On a Cepstral Pitch Alteration Technique for Prosody Control in the Speech Synthesis System with High Quality

  • Kim, Kyu-Hong;Baek, Seong-Joon;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.1E
    • /
    • pp.32-36
    • /
    • 1999
  • In the area of the speech synthesis techniques, the waveform coding methods maintain the intelligibility and naturalness of synthetic speech. In order to apply the waveform coding techniques to synthesis by rule, we must be able to alter the pitches of synthetic speech. In this paper, we propose a new pitch altering method that compensates phase distortion of the cepstral pitch alteration method with time scaling method in the time domain. This method can remove some spectrum distortion which is occurred in conjunction point between the waveforms. For performance test the spectrum distortion rate was used as objective criterion and the MOS(Mean Opinion Score) was used as subjective criterion. As a result, the spectrum distortion and MOS are obtained by 0.66% and 3.9, respectively.

  • PDF

On a Pitch Alteration Method using Scaling the Harmonics Compensated with the Phase for Speech Synthesis (위상 보상된 고조파 스케일링에 의한 음성합성용 피치변경법)

  • Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.6
    • /
    • pp.91-97
    • /
    • 1994
  • In speech processing, the waveform codings are concerned with simply preserving the waveform of signal through a redundancy reduction process. In the case of speech synthesis, the waveform codings with high quality are mainly used to the synthesis by analysis. Because the parameters of this coding are not classified as both excitation and vocal tract, it is difficult to apply the waveform coding to the synthesis by rule. Thus, in order to apply the waveform coding to synthesis by rule, it is necessary to alter the pitches. In this paper, we proposed a new pitch alteration method that can change the pitch period in waveform coding by dividing the speech signals into the vocal tract and excitation parameters. This method is a time-frequency domain method preserving the phase component of the waveform in time domain and the magnitude component in frequency domain. Thus, it is possible that the waveform coding is carried out the synthesis by rule in speech processing. In case of using the algorithm, we can obtain spectrum distortion with $2.94\%$. That is, the spectrum distortion is decreased more $5.06\%$ than that of the pitch alteration method in time domain.

  • PDF

A Study on Real Time Pitch Alteration of Speech Signal (음성신호의 실시간 피치변경에 관한 연구)

  • 김종국;박형빈;배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.1
    • /
    • pp.82-89
    • /
    • 2004
  • This paper describes how to reduce the effect of an occupation threshold by that the transform of mixture components of HMM parameters is controlled in hierarchical tree structure to prevent from over-adaptation. To reduce correlations between data elements and to remove elements with less variance, we employ PCA (principal component analysis) and ICA (independent component analysis) that would give as good a representation as possible, and decline the effect of over-adaptation. When we set lower occupation threshold and increase the number of transformation function, ordinary WLLR adaptation algorithm represents lower recognition rate than SI models, whereas the proposed MLLR adaptation algorithm represents the improvement of over 2% for the word recognition rate as compared to performance of SI models.

An Amplitude Warping Approach to Intra-Speaker Normalization for Speech Recognition (음성인식에서 화자 내 정규화를 위한 진폭 변경 방법)

  • Kim Dong-Hyun;Hong Kwang-Seok
    • Journal of Internet Computing and Services
    • /
    • v.4 no.3
    • /
    • pp.9-14
    • /
    • 2003
  • The method of vocal tract normalization is a successful method for improving the accuracy of inter-speaker normalization. In this paper, we present an intra-speaker warping factor estimation based on pitch alteration utterance. The feature space distributions of untransformed speech from the pitch alteration utterance of intra-speaker would vary due to the acoustic differences of speech produced by glottis and vocal tract. The variation of utterance is two types: frequency and amplitude variation. The vocal tract normalization is frequency normalization among inter-speaker normalization methods. Therefore, we have to consider amplitude variation, and it may be possible to determine the amplitude warping factor by calculating the inverse ratio of input to reference pitch. k, the recognition results, the error rate is reduced from 0.4% to 2.3% for digit and word decoding.

  • PDF

On Altering the Pitch of Speech Signals in Waveform Coding -Alteration Method by the LPC and the Pitch Halving- (음성 파형코딩 음원피치 변경에 관한 연구 -LPC와 주기반분법에 의한 피치변경법-)

  • 배명진;윤희상;안수길
    • The Journal of the Acoustical Society of Korea
    • /
    • v.10 no.5
    • /
    • pp.11-19
    • /
    • 1991
  • 음성 신호의 합성기법들 중에서 파형코딩법은 음질이 우수하기 때문에 분석에 의한 합성법으로 많이 사용하고 있다. 그렇지만 음원과 성도의특성을 분리하지 않고 파형의 잉여분만을 제거한 후에 파 형자체를 저장하기 때문에 규칙에 의한 합성기법으로 사용하기에는 어려움이 많다. 본 논문은 파형코딩 법 중 선형 PCM 코딩법으로 저장된 음성파형에 대해 피치를 양분할 수 있는 주기반분법을 제안하여 파형자체의 음원을 분리하지 않고 피치 주기를 변경시킬 수 있는 새로운 피치 변경법을 제안하였다. 따 라서 음질이 우수한 파형코딩 합성법으로 규칙에 의한 합성을 수행할 수 있다.

  • PDF

On the Pitch Alteration Methods for a High Quality Speech Synthesis (고음질 합성을 위한 피치변경법)

  • 배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.12 no.2
    • /
    • pp.66-77
    • /
    • 1993
  • 고음질 합성을 위해서는 파형부호화법이 바람직하다. 파형부호화법을 규칙에 의한 음성합성기법에 적용하기 위해서는 메모리용량의 문제와 피치변경법이 해결되어져야 한다.메모리 용량의 문제는 최근 반도체 기술에 의해 극복되어 졌으며 이제는 음원피치변경의 문제가 남아있다. 따라서 본 논문에서는 성도 포먼트의 특성은 변화시키지 않고, 음원피치를 변경시키는 문제에 대해 정리하였다. 먼저 기존의 제안된 몇가지 기법들의 장단점들을 열거한 다음에 우리 연구실에서 제안했던 방법들에 대해 논의하고자 한다.

  • PDF

On a Pitch Alteration Technique in the V/UV Spectrum for High Quality Speech Synthesis Technique (고음질 합성방식용 V/UV 스펙트럼상의 피치변경법에 관한 연구)

  • Jo, Wang-Rae;Bae, Myung-Jin;Kim, Dong-Sung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.6
    • /
    • pp.99-103
    • /
    • 1996
  • Most waveform coding techniques attempt to reduce redundancy of speech signal while preserving the shape of the waveform. In speech synthesis, wavefrom coding methods are used to the synthesis by rule for high quality speech. However, it is difficult to apply the waveform coding to the synthesis by rule because the parameters of the wavefrom coding cannot be classified as either the excitation or the vocal tract parameters. The proposed method shows little spectrum distortion of 2.7% or less for 50% pitch changes. It also achieves smooth connection of wavefrom magnitudes among the frames by compensating the phase in time domain.

  • PDF

Analytical Study on Equivalent Shear Modulus according to Shape of Egg-box Core (에그-박스 코어 형상 변화에 따른 등가 전단 탄성계수 수치 해석 연구)

  • Lee, SangYoun;Yun, Su-Jin;Park, DongChang;Hwang, Kiyoung
    • Journal of the Korean Society of Propulsion Engineers
    • /
    • v.18 no.2
    • /
    • pp.73-79
    • /
    • 2014
  • The sandwich shell with Egg-box core has been used for the combustion chamber case of air breathing propulsion system. The alteration on pitch length and thickness of Egg-box core was required to be lighter and save manufacturing time and cost of combustion chamber case. In this paper, the finite element analysis method which simulated bending test was used to predict the equivalent shear modulus which affect structural stability of sandwich shell in short time. The result of FE calculation on sandwich panel with homogeneous material, H130-foam core, showed a good agreement with the values available in the reference. The equivalent shear modulus of Egg-box core according to the variation of pitch length and thickness can be obtained.

A Study on the Pitch Alteration Technique by Sub-band Linear Approximation in Spectrum (서브밴드 선형근사에 의한 피치변경법에 관한 연구)

  • 김영규;김봉영;배명진
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2423-2426
    • /
    • 2003
  • 음성합성은 합성방식에 따라 파형부호화법, 신호원부호화법, 혼성부호화법으로 분류할 수 있다. 특히 고음질 합성을 위해서는 파형부호화를 이용한 합성방식이 적합하다 하지만 파형부호화를 이용한 합성법은 여기 성분과 여파기 성분을 분리하지 않고 처리하기 때문에 음절단위나 음소단위의 합성기법으로는 바람직하지 못하다. 따라서 파형부호화법을 규칙에 의한 합성에 적용되도록 음원피치를 변경시키기 위한 피치 변경법이 필요하게 된다. 본 논문에서는 스펙트럼 왜곡을 최소화하기 위해 서브 선형근사에 의하여 스펙트럼 평탄화 시킨 후 스펙트럼 스케일링을 이용하여 피치를 변경하는 방법에 대하여 제안하였다. 기존 방법인 LPC법, Cepstrum법과 비교하여 어느 정도의 우수성을 보이는지 평가하였고 평가방법은 각각의 평탄화 된 신호의 분산을 구하여 평탄화의 정도를 측정하였다. 이때 평탄화 된 신호는 최고점이 영이 되도록 정규화 시키고 평균이 영인 분산을 계산하였다. 제안한 방법의 성능을 평가하기 위해 스펙트럼 왜곡율을 측정하여 본 결과 평균 스펙트럼 왜곡율은 평균 2.12% 이하로 유지되었으며 실험결과 제안한 방법이 기존의 방법보다 우수함을 보여주었다.

  • PDF