Search | Korea Science

Flattening Techniques for Pitch Detection (피치 검출을 위한 스펙트럼 평탄화 기법)

김종국;조왕래;배명진
- Proceedings of the IEEK Conference
- /
- 2002.06d
- /
- pp.381-384
- /
- 2002
In speech signal processing, it Is very important to detect the pitch exactly in speech recognition, synthesis and analysis. but, it is very difficult to pitch detection from speech signal because of formant and transition amplitude affect. therefore, in this paper, we proposed a pitch detection using the spectrum flattening techniques. Spectrum flattening is to eliminate the formant and transition amplitude affect. In time domain, positive center clipping is process in order to emphasize pitch period with a glottal component of removed vocal tract characteristic. And rough formant envelope is computed through peak-fitting spectrum of original speech signal in frequency domain. As a results, well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. After all, we obtain residual signal which is removed vocal tract element The performance was compared with LPC and Cepstrum, ACF 0wing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.
PDF

An Acoustic Analysis of Vowels for Severe-profound Hearing Impaired Children (최고도이상의 청력손실을 가진 아동의 모음음형대 분석)

Huh, Myung-Jin
- Speech Sciences
- /
- v.14 no.2
- /
- pp.65-71
- /
- 2007
The severe-profound hearing impaired children have various disorders in everday communication due to the lack of hearing feedback. Especially, their speech produced unstable voice, omission and distortion of articulation, pitch break, cul-de-sac voice, and so on so that they were difficult to accurately deliver an intended message. This study attempts to analyze the acoustic characteristics of 4 vowel sounds produced by 35 severe-profound hearing impaired children using CSL(Computerized Speech Lab, Model 4300b). The formant data were obtained from the spectrogram and analyzed data by 12 formant filter and auto-correlation among the formants. Results showed that the hearing impaired children's formant values came out very high. They produced the vowels at the mode of hypertension with unstable voice. In order to improve their speech, they would need some adequate auditory feedback.
PDF

NOISE ROBUST FORMANT FREQUENCY ESTIMATION BASED ON COMPLEX AUTOCORRELATION FUNCTION

Diankha, Ousmane;Shimamura, Tetsuya
- Proceedings of the IEEK Conference
- /
- 2002.07c
- /
- pp.1799-1802
- /
- 2002
This paper proposes an improved method for formant frequencies estimation based on the complex autocorrelation function of the speech signal. Instead of using the incoming signal as an input fur the LPC analysis, the analytic signal of the autocorrelation function of the speech signal is computed and itself used as an input for the LPC analysis. Due to the properties of the analytic signal, which occupies half of the bandwidth of the original signal, the required model order for the LPC analysis is halved. The accuracy of the proposed method in noisy environments is examined on five natural vowels. The effectiveness of the proposed method is shown by the estimated spectral shapes and the estimation errors of the formant frequencies.
PDF

Nursing and Suckling Behaviour in Domestic Pigs 1. Characteristics of the Grunting Sound of the Sow(Landrace $\times$ Yorkshire) during Nursing Behaviour (돼지의 수.포유 행동 I. 수유 행동에서 모돈(랜드레이스$\times$요크셔) 발성음의 특성)

장홍희;연성찬
- Journal of Veterinary Clinics
- /
- v.19 no.2
- /
- pp.191-194
- /
- 2002
The nursing vocalization of domestic pigs(Landrace$\times$Yorkshire) was investigated with respect to common features. All vocalizations uttered during nursings in 5 sows at 5 days after farrowing were recorded and 305 grunts were processed in a spectrograph. The sow's repeated grunting during nursing can be regarded as a contact call and a signal of the mother to start and synchronize the suckling behavior of the piglets. Analysis in the time domain revealed the gross structure of the call, whereas in the frequency domain the fine structure of single grunts was investigated. Nursing interval, duration of nursing behavior, duration of grunt, grunt rate per 10 seconds, fundamental frequency, 1 formant, 2 formant, 3 formant, 4 formant and spectrum were investigated. The results showed that mean interval between the nursing following one another was 25, 4.6 min and duration of nursing behavior was 3.2 $\pm$ 0.7 min. Average duration of grunt was 203.9 $\pm$ 63.6 ms. The formant contours could be identified. The nursing behavior might be disturbed by the grunts of alien sow.
PDF KSCI

Long Term Average Spectral Analysis for Acoustical Discrimination of Korean Nasal Consonants (한국어 비음의 음향학적 구분을 위한 장구간 스펙트럼(LTAS) 분석)

Choi, Soon-Ai;Seong, Cheol-Jae
- MALSORI
- /
- no.60
- /
- pp.67-84
- /
- 2006
The purpose of this study is to find some acoustic parameters on frequency domain to distinguish the Korean nasals, $/m,\;n,\;{\eta}/$ from each other. The new parameters are devised on the basis of LTAS (Long Term Average Spectrum). The maximum peak amplitude and the relevant formant frequency are measured in low and high frequency range, respectively. The frequency of spectral valley and its energy level are also obtained in the specific frequency range of the spectrum. Spectral slope, total energy value in specific frequency range, statistical distribution of spectral energy like centroid, skewness, and kurtosis are suggested as new parameters as well. The parameters that show statistically significant differences across nasals are summerized as follows. 1) in syllable initial positions: the total energy value from 1,500 to 2,200 Hz(zeroENG); 2) in syllable final positions: the peak amplitude of the first formant(peak1_a), the formant frequency with maximum peak amplitude from 4,000 to 8,000 Hz(peak2_f), the maximum peak amplitude of the formant frequency from 4,000 to 8,000 Hz(peak2_a), and the total energy value from 1,500 to 2,200 Hz(zeroENG).
PDF

Measurement of Cardiac Function Improvement by Auricular Acupuncture Applying Speech Signal Analysis (음성신호 분석을 적용한 이침요법(耳針療法)에 따른 심장 기능 향상 측정)

Kim, Bong-Hyun;Cho, Dong-Uk;Han, Kil-Sung
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.12 no.12
- /
- pp.5588-5593
- /
- 2011
In this paper, measure of change the speech analysis parameter by stimulating ears blood points corresponding to cardiac. To do this, we collected voice of before and after a stimulation corresponding points to ears to select normal heart having 10 subjects. We analyzed changes before and after corresponding points to ear in cardiac to apply Jitter, the second zFormant Frequency Bandwidths related to heart of elements of voice analysis. As a result of us experiment, we were able to analyze correlation of voice with cardiac according to corresponding points to ears decreased values of Jitter, the Second Formant Frequency Bandwidths of 90% of subjects. Finally, the effectiveness of proposed method is demonstrated by several experiments.
https://doi.org/10.5762/KAIS.2011.12.12.5588 인용 PDF KSCI

The Study of Tonsil Affected Voice Quality after Tonsillectomy (편도적출술로 음성변화가 올 수 있는 편도 상태에 관한 연구)

안철민;정덕희
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.9 no.1
- /
- pp.32-37
- /
- 1998
Tonsillectomy is the one of operation that is performed the most commonly in otolaryngology field. Many changes that include range of voice, tone, voice quality and resonance were made by tonsillectomy. Sometimes, any patients taken tonsillectomy has suffer from these voice problem after tonsillectomy. However there are less study for these problems until now. Then, we studied to find the anatomical findings that affected the voice quality when tonsillectomy was performed. We evaluated the voice in 2 groups, one is the group showed the normal pharyngeal space by using the transnasal fiberscopy, the other is group showed medially bulging tonsil at pharyngeal cavity by using same method, with perceptual evaluation, nasalance score, nasality, oral formant and nasal formant. We used the computerized speech analysis system, the nasometer and the spectrogram in the CSL program. We could not find any differences in perceptual evaluation between two groups. But objective measures were provided. Nasalance score and nasality on the nasometric analysis were increased significantly and oral formant on the spectrogram was changed singnificantly after tonsillectomy in Group 2. Authors thought medially bulging tonsil in the pharynx is able to affect the voice quality after tonsillectomy when we evaluted through the nasal cavity by the using of fiberscopy and this evaluation would be important especially in singers.
PDF

An Analysis of the Vowel Formants of the Young Males in the Buckeye Corpus (벅아이 코퍼스에서의 젊은 성인 남성의 모음 포먼트 분석)

Yoon, Kyu-Chul;Noh, Hye-Uk
- Phonetics and Speech Sciences
- /
- v.4 no.2
- /
- pp.41-49
- /
- 2012
The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, syllabic stress information, the location in a word, location in utterance, speech rate of three consecutive words, and the word frequency in the corpus. The results indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants. The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, the syllabic stress information, the location in a word, the location in an utterance, the speech rate of the three consecutive words, and the word frequency in the corpus. The result indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants.
https://doi.org/10.13064/KSSS.2012.4.2.041 인용 PDF

Correlation Analysis of Between Paranasal Sinuses and Formant Frequency According to External Stimulation (외부 자극에 따른 부비동과 포먼트주파수와의 상관성 분석)

Kim, Bong-Hyun
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.17 no.8
- /
- pp.1955-1961
- /
- 2013
Paranasal sinuses of the empty space is filled with air that exists in the bones in the face. However, the pus becomes inflamed paranasal sinuses sinusitis onset brings the voice of change, and complained of headaches and lethargy. Therefore, in this paper, paranasal sinuses related diseases to predict voice analysis parameter as measured by changes in paranasal sinuses through external stimuli is investigated and carried out a study to analysis the function consisting of the frontal sinus, ethmoid sinus, maxillary sinus, sphenoid sinus. From this, cold pack stimulation in the paranasal sinus area for stimulation before and after voice was performed by measuring formant frequency and external stimuli through correlation analysis of the mutual impact on paranasal sinuses were analyzed.
https://doi.org/10.6109/jkiice.2013.17.8.1955 인용 PDF KSCI

Performance Evaluation of Cochlear Implants Speech Processing Strategy Using Neural Spike Train Decoding (Neural Spike Train Decoding에 기반한 인공와우 어음처리방식 성능평가)

Kim, Doo-Hee;Kim, Jin-Ho;Kim, Kyung-Hwan
- Journal of Biomedical Engineering Research
- /
- v.28 no.2
- /
- pp.271-279
- /
- 2007
We suggest a novel method for the evaluation of cochlear implant (CI) speech processing strategy based on neural spike train decoding. From formant trajectories of input speech and auditory nerve responses responding to the electrical pulse trains generated from a specific CI speech processing strategy, optimal linear decoding filter was obtained, and used to estimate formant trajectory of incoming speech. Performance of a specific strategy is evaluated by comparing true and estimated formant trajectories. We compared a newly-developed strategy rooted from a closer mimicking of auditory periphery using nonlinear time-varying filter, with a conventional linear-filter-based strategy. It was shown that the formant trajectories could be estimated more exactly in the case of the nonlinear time-varying strategy. The superiority was more prominent when background noise level is high, and the spectral characteristic of the background noise was close to that of speech signals. This confirms the superiority observed from other evaluation methods, such as acoustic simulation and spectral analysis.
https://doi.org/10.9718/JBER.2007.28.2.271 인용 PDF KSCI

Search Result 191, Processing Time 0.033 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)