Search | Korea Science

The Correlation between Speech Intelligibility and Acoustic Measurements in Children with Speech Sound Disorders (말소리장애 아동의 말명료도와 음향학적 측정치 간 상관관계)

Kang, Eunyeong
- Journal of The Korean Society of Integrative Medicine
- /
- v.6 no.4
- /
- pp.191-206
- /
- 2018
Purpose : This study investigated the correlation between speech intelligibility and acoustic measurements of speech sounds produced by the children with speech sound disorders and children without any diagnosed speech sound disorder. Methods : A total of 60 children with and without speech sound disorders were the subjects of this study. Speech samples were obtained by having the subjects? speak meaningful words. Acoustic measurements were analyzed on a spectrogram using the Multi-speech 3700 program. Speech intelligibility was determined according to a listener's perceptual judgment. Results : Children with speech sound disorders had significantly lower speech intelligibility than those without speech sound disorders. The intensity of the vowel /u/, the duration of the vowel /${\omega}$/, and the second formant of the vowel /${\omega}$/ were significantly different between both groups. There was no difference in voice onset time between the groups. There was a correlation between acoustic measurements and speech intelligibility. Conclusion : The results of this study showed that the speech intelligibility of children with speech sound disorders was affected by intensity, word duration, and formant frequency. It is necessary to complement clinical setting results using acoustic measurements in addition to evaluation of speech intelligibility.
https://doi.org/10.15268/ksim.2018.6.4.191 인용 PDF KSCI

A Study on the Relation Between the LSF's and Spectral Distribution of Speech Signals (Line Spectral Frequency와 음성신호의 주파수 분포에 관한 연구)

이동수;김영화
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.25 no.4
- /
- pp.430-436
- /
- 1988
LSF(Line Spectral Frequency) derived from LPC has known as a very useful transmission parameter of speech signals, for it has a good linear interpolation characteristics and a low spectrum distortion at low bit rates coding. This paper presents that it is possible to extract directly the formant frequencies of speech signals from LSF parameter without application of FFT algorithm by comparing the distribution of LSF parameter with the frequency distribution of analysis filter. This paper suggests the advanced algorithm that results in improving the speed of convergence at analytic solution method. Also, for the flexibility of parameters, the process that transforms from LSF to LPC is presented.
PDF

CHARACTERISTICS OF COW′S VOICES IN TIME AND FREQUENCY DOMAINS FOR RECOGNITION

Ikeda, Y.;Ishii, Y.
- Proceedings of the Korean Society for Agricultural Machinery Conference
- /
- 2000.11b
- /
- pp.196-203
- /
- 2000
On the assumption that the voices of the cows are produced by the linear prediction filter, we characterized the cows' voices. The order of this filter is determined by examining the voices characteristics both in time and frequency domains. The proposed order of the linear prediction filter is 15 for modeling voice production of the cow. The combination of the two parameters of the fundamental frequency, the slope of the straight line regressed from the log-log spectra of the amplitude-envelope and the only one coefficient involved in the linear prediction filter can differentiate the two cows.
PDF

A Study on Monitoring of Liver Function Based on Voice Signal Analysis for u-Health System (u-Health 시스템을 위한 음성신호 분석 기반의 간 기능 모니터링에 관한 연구)

Kim, Bong-Hyun;Cho, Dong-Uk
- The KIPS Transactions:PartB
- /
- v.18B no.6
- /
- pp.389-396
- /
- 2011
There is getting worse to various liver diseases due to change in eating habits, stress, alcohol etc in modern society. Therefore, we proposed methodology to diagnose early for liver disease to study the influence on voice in liver diseases. To this end, we carried out experiment to apply parameter of voice analysis to collect each voice inpatients and patients by treatment of liver diseases patients. Particularly, we carried out experiment to apply element value of pronunciation and the third formant frequency bandwidths about velar sounds associated liver in oriental medicine, then to produce objective index resonance cavity and influence vocalization in liver diseases. In addition, we carried out to study about design of system to monitoring a liver function in u-Health environment based on result by experiment.
https://doi.org/10.3745/KIPSTB.2011.18B.6.389 인용 PDF KSCI

A Proposal for Effect Analysis Techniques of Kidney Hand Acupuncture through Face Image and Voice Signal Measurement (얼굴 영상 및 음성신호 측정을 통한 신장 수지침 효과 분석 기법의 제안)

Kim, Bong-Hyun;Cho, Dong-Uk
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.37 no.3C
- /
- pp.217-223
- /
- 2012
In this paper, we would like to propose techniques to analyze effect according to stimulation kidney associated hand acupuncture by applying technique to measure changes of facial image and voice signal. To this end, we measured color change of JIGAK(jaw) area associated kidney in facial image and voice signal stimulation before and after of kidney associated hand acupuncture. In addition, we measured changes of the first formant frequency bandwidth and Shimmer to element of voice signal analysis in connection with kidney in experiment. We can be measured reduction of the first formant frequency bandwidth and Shimmer, black of JIGAK area according to stimulation of kidney associated hand acupuncture. Finally, we would like to demonstrate objective effect of kidney associated hand acupuncture through the analysis of statistical significance by measurement techniques of facial image and voice signal.
https://doi.org/10.7840/KICS.2012.37C.3.217 인용 PDF KSCI

The Difference between Acoustic Characteristics of Acute Epiglottitis and Peritonsillar Abscess (급성 후두개염과 편도주위 농양 환자의 발화시 조음 및 음성의 차이)

Lee, Nam-Hoon;Lee, Jae-Yeon;Lee, Sang-Hyuck;Choi, Jung-Im;Song, Yun-Kyung;Jin, Sung-Min
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.21 no.1
- /
- pp.48-53
- /
- 2010
Backgraound and Objectives : The voice change can occur in acute epiglottitis or peritonsillar abscess, and the labelings of both changes as a "muffled voice" or "hot potato voice", The aim of this study was to investigate the difference of changes in acoustic feature of voice before and after treatment in patients with acute epiglottitis or peritonsillar abscess. Subjects and Method: 13 patients with acute epiglottitis and 12 patients with peritonsillar abscess were enrolled in the study. Acoustic analysis on sustained Korean vowels /${\alpha}$/, /u/ and /i/ were performed before and after treatment. Results: In patients with acute epiglottitis, the first formant frequency (F1) of /${\alpha}$/ was increased, and the second frequency (F2) of /i/ was decreased. In patients with peritonsillar abscess, F1 and F2 of /${\alpha}$/ were decreased. F1 of /i/ and /u/ were increased, while F2 were decreased. Conclusion : The anatomical and functional changes of oropharynx and larynx by acute epiglottitis and peritonsillar abscess can cause different change in resonance and speech quality. We suggest that these changes could be the cause of 'muffled vocie' in patients of acute epiglottitis or peritonsillar abscess, but different characteristics of phonation in each disease should be distinguished.
PDF

Long Term Average Spectrum Characteristics of Speaking Voice of Western Operatic Singers (Long Term Average Spectrum을 이용한 성악가들의 Speaking Voice 분석)

Lee, Kyung-Chul;Hong, Seok-Jin;Jin, Sung-Min
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.15 no.2
- /
- pp.122-127
- /
- 2004
Background and Objectives : Many studies have described and analyzed singer's formant and it has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. A similar phenomenon produced by trained singers in their speech led some authors to examine the speaker's ring. This study was designed to analyze the speaking voice of the singers and speaker's ring. Baterials and Methods : Ten tenors, fifteen baritones, fifteen sopranos and ten mezzo sopranos attending the music college, department of vocal music were chosen for this study. Fifteen male and fifteen female untrained normal speakers were chosen for control group. Each subject was asked to produce a sample of a sustained spoken vowel /ah/ sound for at least five seconds and read sentence 'Kaeul'. The sound data was analyzed using the Fast Fourier Transform(FFT) - based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social Sciences(SPSS). Results : For LTA Power spectrum of/ah/ sound, a significant increase was seen in the 2,500-3,500Hz region(p<0.01) in four trained singer group compared with untrained speaker group, and a significant increase in the 9,000-10,000Hz region(p<0.01) in soparano group. Similarly, in sentence 'Kaeul', there was a significant increase in energy in the tenor, baritone, mezzo soprano group compared with the untrained speaker group in the 2,500-3,500Hz region(p<0.01), and a significant increase in all frequency region(p<0.01) in the soprano group. Conclusions : The LTA power spectrum suggests that trained singers group show more energy concentration in the 'singer's formant' region in the speaking voice, and authors believe this region to be the 'speaker's ring'. Further research is needed on the effect of singing training on the resonance of the speaking voice.
PDF

A Comparison Study on the Speech Signal Parameters for Chinese Leaners' Korean Pronunciation Errors - Focused on Korean /ㄹ/ Sound (중국인 학습자의 한국어 발음 오류에 대한 음성 신호 파라미터들의 비교 연구 - 한국어의 /ㄹ/ 발음을 중심으로)

Lee, Kang-Hee;You, Kwang-Bock;Lim, Ha-Young
- Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
- /
- v.7 no.6
- /
- pp.239-246
- /
- 2017
This paper compares the speech signal parameters between Korean and Chinese for Korean pronunciation /ㄹ/, which is caused many errors by Chinese leaners. Allophones of /ㄹ/ in Korean is divided into lateral group and tap group. It has been investigated the reasons for these errors by studying the similarity and the differences between Korean /ㄹ/ pronunciation and its corresponding Chinese pronunciation. In this paper, for the purpose of comparison the speech signal parameters such as energy, waveform in time domain, spectrogram in frequency domain, pitch based on ACF, Formant frequencies are used. From the phonological perspective the speech signal parameters such as signal energy, a waveform in the time domain, a spectrogram in the frequency domain, the pitch (F0) based on autocorrelation function (ACF), Formant frequencies (f1, f2, f3, and f4) are measured and compared. The data, which are composed of the group of Korean words by through a philological investigation, are used and simulated in this paper. According to the simulation results of the energy and spectrogram, there are meaningful differences between Korean native speakers and Chinese leaners for Korean /ㄹ/ pronunciation. The simulation results also show some differences even other parameters. It could be expected that Chinese learners are able to reduce the errors considerably by exploiting the parameters used in this paper.
https://doi.org/10.14257/ajmahs.2017.06.56 인용

A SOUND SPECTROGRAPHICAL STUDY ON THE KOREAN VOWELS AND CONSONANTS PRONOUNCED BY OPENBITE PATIENTS - Frequency Analysis - (SOUND SPECTROGRAPH를 이용한 개교환자의 한국어 자${\cdot}$모음의 발성에 관한 연구 - 주파수 분석을 중심으로 -)

Kim, Ki-Dal;Yang, Won Sik
- The korean journal of orthodontics
- /
- v.15 no.1
- /
- pp.55-66
- /
- 1985
The study was undertaken to ascertain the speech defect of patients with malocclusion, especially of openbite patients, by means of the spectral analysis method. The experimental group was composed of ten female openbite patients and their mean age was 13.8 yrs. The control group was also composed of ten female girls and their mean age was 13.7 yrs. As for the speech material, eight Korean monophthrongs and two Korean fricatives and two affricatives were used. Speeches were recorded and then analyzed by a Kay 7800 digital sonagraph. Formant frequency level or range was used as a phonemic parameter. The results were as follows: 1. Among Vowels /a:/ : $F_1,\;F_3\;and\;F_1/F_2$ showed abnormality. /o:/ and $/w:/:F_2,\;F_2-F_1\;and\;F_1/F_2$ showed abnormality. 2. Among Consonants /S/ and /h/ : The upper and lower borders of the frequency range showed abnormality. (equation omitted) : The lower border of the frequency range showed abnormality. $/C^{h}/$ : The upper and lower borders of the frequency range and concentration point showed abnormality.
PDF

F-ratio of Speaker Variability in Emotional Speech

Yi, So-Pae
- Speech Sciences
- /
- v.15 no.1
- /
- pp.63-72
- /
- 2008
Various acoustic features were extracted and analyzed to estimate the inter- and intra-speaker variability of emotional speech. Tokens of vowel /a/ from sentences spoken with different modes of emotion (sadness, neutral, happiness, fear and anger) were analyzed. All of the acoustic features (fundamental frequency, spectral slope, HNR, H1-A1 and formant frequency) indicated greater contribution to inter- than intra-speaker variability across all emotions. Each acoustic feature of speech signal showed a different degree of contribution to speaker discrimination in different emotional modes. Sadness and neutral indicated greater speaker discrimination than other emotional modes (happiness, fear, anger in descending order of F-ratio). In other words, the speaker specificity was better represented in sadness and neutral than in happiness, fear and anger with any of the acoustic features.
PDF

Search Result 184, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)