통합 검색 | Korea Science

음성 파형코딩의 음원피치 변경에 관한 연구 - LPC와 주기반분법에 의한 피치변경법 - (On Altering the Pitch of Speech Signals in Waveform Coding -(Altering Method by the LPC and the Pitch Halving)-)

민경중
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1991년도 학술발표회 논문집
- /
- pp.45-49
- /
- 1991
In area of the speech synthesis, the waveform coding with high quality are mainly used to the synthesis by analysis. However, it is difficult to applying the waveform coding to the synthesis by rule, because the parameters of this coding are not classified as either excitation parameters and vocal tract parameters. In this paper, we proposed a new pitch change method that can alter the pitch periods in the waveform coding. The proposed method expands the pitch period by the LPC synthesis method, and then the period is compressed by the waveform halving technique. Thus, it is possible that the waveform coding is carried out the synthesis by rule in speech processing.
PDF

포만트 분석/합성 시스템 구현 (Implementation of Formant Speech Analysis/Synthesis System)

이준우;손일권;배건성
- 음성과학
- /
- 제1권
- /
- pp.295-314
- /
- 1997
In this study, we will implement a flexible formant analysis and synthesis system. In the analysis part, the two-channel (i.e., speech & EGG signals) approach is investigated for accurate estimation of formant information. The EGG signal is used for extracting exact pitch information that is needed for the pitch synchronous LPC analysis and closed phase LPC analysis. In the synthesis part, Klatt formant synthesizer is modified so that the user can change synthesis parameters arbitarily. Experimental results demonstrate the superiority of the two-channel analysis method over the one-channel(speech signal only) method in analysis as well as in synthesis. The implemented system is expected to be very helpful for studing the effects of synthesis parameters on the quality of synthetic speech and for the development of Korean text-to-speech(TTS) system with the formant synthesis method.
PDF

음성인식 후처리에서 음소 유사율을 이용한 오류보정에 관한 연구 (A Study on Error Correction Using Phoneme Similarity in Post-Processing of Speech Recognition)

한동조;최기호
- 한국ITS학회 논문지
- /
- 제6권3호
- /
- pp.77-86
- /
- 2007
최근 텔레매틱스 단말기 등과 같이 음성인식을 인터페이스로 하는 음성기반 검색시스템들이 많이 개발되고 있다. 그러나 음성인식에는 여전히 많은 오류가 존재하며, 이에 오류보정에 대한 여러 가지 연구가 진행되고 있다. 본 논문에서는 한국어의 음소가 갖는 특징을 기반으로 음성인식 후처리에서의 오류보정을 제안하였다. 이를 위해 한국어 음소의 특징을 고려한 음소 유사율을 사용하였다. 음소 유사율은 훈련데이터를 모노폰으로 훈련시켜 한국어 음소 각각에 대하여 MFCC와 LPC 특징추출방법을 사용하여 특징추출을 수행하고, 바타차랴 거리 측정법을 사용하여 각 음소 사이의 유사율을 구하였다. 음소 유사율과 신뢰도를 이용하여 오류보정률을 구하였으며, 이를 사용하여 음성인식 과정에서 오류로 판명된 어절에 대하여 오류보정을 수행하고, 음절 복원과 형태소 분석을 재수행하는 과정을 거쳤다. 실험 결과 MFCC와 LPC 각각 7.5%와 5.3%의 인식 향상률을 보였다.
PDF

태평소의 음향분석을 통한 팔랑 특성 추출 (Extraction of Characteristics Corresponding to Bell of Taepyeongso Based on Acoustical Analysis)

변중배;조상진;홍연우;정의필
- 한국음향학회지
- /
- 제27권1호
- /
- pp.12-17
- /
- 2008
태평소는 고려 말경 원나라로부터 소개된 이후로 대취타, 풍물놀이, 범패, 종묘제례악, 시나위 등에 널리 쓰여 왔고 최근들어 대중가요에 사용되며 비교적 쉽게 연주할 수 있어 일반인들에게 주목받고 있다. 본 연구는 물리적 모델링을 이용하여 태평소를 전자화 하기위한 일환으로 태평소를 분석한다. 이를 위해 율명에 따른 분석을 통해 태평소의 공명 특성을 추출하였고, 팔랑, 관대, 조롱목에 대하여 FFT 및 LPC곡선을 이용하여 분석하였다. 그 결과 팔랑은 관대와 팔랑 사이의 반사필터와 2극점 필터로 표현할 수 있었다.
https://doi.org/10.7776/ASK.2008.27.1.012 인용 PDF KSCI

Performance Analysis, Real Time Simulation and Control of Medium-Scale Commercial Aircraft Turbofan Engine

Kong, Chang-Duk;Jayoung Ki;Chung, Suk-Chou
- Journal of Mechanical Science and Technology
- /
- 제15권6호
- /
- pp.776-787
- /
- 2001
The turbofan engine performance analysis for a medium scale commercial aircraft was carried out and the LQR control scheme for performance optimization was studied. By using scaled component maps from well-known CF6 engine characteristics, the steady-state performance analysis result was compared with BR715-56 engine performance data. The transient performance analysis was performed with four fuel schedules. The linear simulation was done at the maximum take-off condition. The real time linear simulation was performed by interpolation of the system matrices, which used the least square method as the function of LPC rotational speed. By using linear system matrices of design point, the LQR controller which used control variables for the fuel flow and the LPC bleed air was designed.
PDF

Split Model Speech Analysis Techniques for Wideband Speech Signal

Park YoungHo;Ham MyungKyu;You KwangBock;Bae MyungJin
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1999년도 학술발표대회 논문집 제18권 1호
- /
- pp.20-23
- /
- 1999
In this paper, The Split Model Analysis Algorithm, which can generate the wideband speech signal from the spectral information of narrowband signal, is developed. The Split Model Analysis Algorithm deals with the separation of the $10^{th}$ order LPC model into five cascade-connected $2^{nd}$ order model. The use of the less complex $2^{nd}$ order models allows for the exclusion of the complicated nonlinear relationships between model parameters and all the poles of the LPC model. The relationships between the model parameters and its corresponding analog poles is proved and applied to each $2^{nd}$ order model. The wideband speech signal is obtained by changing only the sampling rate
PDF

Split Model Speech Analysis Techniques for Speech Signal Enhancement

Park, Young-Ho;You, Kwang-Bock;Bae, Myung-Jin
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 1999년도 추계종합학술대회 논문집
- /
- pp.1135-1138
- /
- 1999
In this paper, The Split Model Analysis Algorithm, which can generate the wideband speech signal from the spectral information of narrowband signal, is developed. The Split Model Analysis Algorithm deals with the separation of the 10$\^$th/ order LPC model into five cascade-connected 2$\^$nd/ order model. The use of the less complex 2$\^$nd/ order models allows for the exclusion of the complicated nonlinear relationships between model parameters and all the poles of the LPC model. The relationships between the model parameters and its corresponding analog poles is proved and applied to each 2$\^$nd/ order model. The wideband speech signal is obtained by changing only the sampling rate.
PDF

스펙트럼 형태 불변 실시간 음성 변환 시스템 (Spectral Shape Invariant Real-time Voice Change System)

김원구
- 한국지능시스템학회논문지
- /
- 제15권1호
- /
- pp.48-52
- /
- 2005
본 논문에서는 음성의 스펙트럼 형태는 유지하면서 음성을 기계적인 음성으로 변환시키기는 실시간 음성 변환 방법을 제안하였다. 이러한 목적을 위하여 LPC 분석 및 합성 방법을 사용하여 변환된 음성의 스펙트럼은 유지하였고 합성된 음성의 피치는 자유롭게 변경되도록 하였다. 제안된 방법에서는 변환된 음성이 보다 자연스럽게 들리게 하기 위하여 여기 신호 발생기에 이득 정합 방법을 적용하였다. 제안된 방법의 성능을 평가하기 위하여 음성 변환 실험을 수행하였다. 실험 결과에서 원 음성 신호는 원 화자의 신원을 알기가 어려운 기계적인 음성 신호로 바뀌는 것을 알 수 있었고 피치의 심한 변화에도 변환된 음성의 의미는 정확히 전달될 수 있었다. 제안된 시스템은 시스템의 실시간으로 구현될 수 있는지 확인하기 위하여 TI TMS320C6711DSK 보드를 사용하여 구현되었다.
https://doi.org/10.5391/JKIIS.2005.15.1.048 인용 PDF KSCI

디지털 이동통신을 위한 음성 부호기의 성능 분석 (A Performance Analysis of the Speech Coders for Digital Mobile Radio)

정영모;이상욱
- 대한전자공학회논문지
- /
- 제27권4호
- /
- pp.491-501
- /
- 1990
Recently, four speech coding techniques, namely, SBC-APCM(sub-band coding adaptive PCM), RPE-LPC(regualr pulse excitation linear predictive codec), MPE-LTP(multi-pulse excited long-term prediction) and CELP (code-excited linear prediction) are proposed for digital mobile radio applications. However, a performance comparison of these coders in the Rayleigh fading environment has not been made yet. In this paper, the performances of the four spech coders in the random bit error and burst error environment are investigated. For the channel coding of SBC-APCM, RPE-LPC and MPE-LTP, the sensitivity of output bit stream is measured and a bit selective forward error correction is provided acording to the measured bit sensitivity. And for an attempt to improve the performance of CELP, an optimum quantizer is applied for transmitting scalar quantities in CELP. However, an improvement over the conventional approach is found to be negligible. For the channel coding of CELP, Reed-Solomon code, Golay code, convolutional code of rate 1/2 shows the best performance. Finally, from the simulation results, it is concluded that CELP is the best candidate for digital mobile radio and is followed by MPE-LTP, SBC-APCM and RPE-LPC.
PDF

LPC를 이용한 평안방언의 음향지표에 관한 연구 (A Study for Acoustic Cues of Pyoung-An Do Dialect Using LPC)

송철규;이명호;김영배
- 대한의용생체공학회:의공학회지
- /
- 제13권3호
- /
- pp.195-200
- /
- 1992
This paper deal with the acoustic cues of Pyoung-An Do dialect using linear prediction. Also, this paper descrbes a statistical comparison between standard tone speech data and Pyoung-An Do dia lects. The analysis done mainly focused on the distribution of formants and pitch periods accord to ac- cents variation. For the purpose of objective comparison, the experiments are performed by extracts for- mant LPC spectrum and pithch periods from average magnitude difference function waveforms. Summing up the results, In disyllable words (VCV pattern) , prepositioned vowels have longer phona lion time than postpositioned vowels and the intrin, iii phonation time is whore longer in the low vowels than in the high ones. The africative consonants show the mixed characteristics of the plosive and frlc ative consonants. The remarkable acoustic cues are the low frequency noise-like waves just before the 1st formants in the plosive consonants, the high frequency noise-like waves in the fricative consonants, and phonation time is not affected by the kinds of prepositioned or postpositioned vowels.
PDF

검색결과 95건 처리시간 0.022초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)