• Title/Summary/Keyword: 발화속도

Search Result 127, Processing Time 0.023 seconds

Speaker age estimation and acoustic characteristics: According to pitch and speech rate (화자 연령 지각과 음성적 특성: 음높이와 발화 속도를 중심으로)

  • Seo, YoonJeong;Shin, Jiyoung
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.9-18
    • /
    • 2019
  • This study aimed to investigate the correlation between speaker's chronological age (CA) and perceived age (PA) and to specify the effect of pitch and speech rate as acoustic cue on judging age, using perceptual testing and acoustic analysis. Three tasks were conducted to identify the degree of listener's accuracy about age estimation. Three perception tasks were conducted to measure the accuracy of 80 Korean listeners when presented with different types of speech. In all the tasks, participants listened to speech samples and gave their estimate of the speaker's age in figures. It was found that Korean listeners are able to gauge the age of a speaker fairly precisely. CA and mean PA were positively correlated in all three tasks. It is clear that the amount and type of information included in the voice samples affected the accuracy of a listener's judgement. Moreover, the result revealed that listeners make use of acoustic information such as pitch and speech rate to estimate speaker's age.

Improvement in Korean Speech Recognition using Dynamic Multi-Group Mixture Weight (동적 다중 그룹 혼합 가중치를 이용한 한국어 음성 인식의 성능향상)

  • 황기찬;김종광;김진수;이정현
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.10d
    • /
    • pp.544-546
    • /
    • 2002
  • 본 논문은 CDHMM(Continuous Density Hidden Markov Model)의 훈련하는 방법을 동적 다중 그룹 혼합 가중치(Dynamic Mutli-Group mixture weight)을 이용하여 재구성하는 방법을 제안한다. 음성은 Hidden 상태열에 의하여 특성화되고, 각 상태는 가중된 혼합 가우시안 밑도 함수에 의해 표현된다. 음성신호를 더욱더 정확하게 계산하려면 각 상태를 위한 가우시안 함수를 더욱더 많이 사용해야 하며 이것은 많은 계산량이 요구된다. 이러한 문제는 가우시안 분포 확률의 통계적인 평균을 이용하면 계산량을 줄일 수 있다. 그러나 이러한 기존의 방법들은 다양한 화자의 발화속도와 가중치의 적용이 적합하지 못하여 인식률을 저하시키는 단점을 가지고 있다. 이 문제를 다양한 화자의 발화속도에 적합하도록 화자의 화자의 발화속도에 따라 동적으로 5개의 그룹으로 구성하고 동적 다중 그룹 혼합 가중치를 적용하여 CDHMM 파라미터를 재구성함으로써 8.5%의 인식율이 증가되었다.

  • PDF

Guidance of Web Document Structure and Voice Firing Rate Control in the Voice Web Browser (음성 웹브라우저에서의 문서구조안내 및 발화속도제어)

  • 조철환;최훈일;연제용;장영건
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.04a
    • /
    • pp.415-417
    • /
    • 2002
  • 본 논문은 HTML을 분석하여 추출된 내용을 트리로 표현하여 문서구조안내에 이용하고, 웹 문서의 내용의 숙독 필요성에 따라 실시간으로 음성 발화속도를 제어할 수 있는 음성 웹 브라우저의 설계와 구현에 관한 것이다. 이 시스템의 특징은 웹 브라우저 상에 태그로 표현된 모든 요소를 추출하고, 이러한 정보를 트리로 표현하고 음성인식으로 정보를 선택하도록 하고, 선택한 정보의 이도와 필요성에 따라 사용자가 실시간 발화속도제어를 통하여 정보를 쉽게 알 수 있도록 했다. 이 방식은 문서의 내용에 따른 구조를 쉽게 인식하여 사용자가 빠른 시간 내에 필요한 정보를 수집할 수 있고, 문서가 발음되는 것을 청취하여 문서의 필요성을 인식하고, 숙독 필요성에 따라 실시간으로 낭독 속도를 제어할 수 있는 장점이 있다.

  • PDF

The influences of speech rate, utterance length and sentence complexity of disfluency in preschool children who stutter and children who do not stutter (문장 따라말하기에서 말속도, 발화길이 및 통사적 복잡성에 따른 말더듬 아동과 일반아동의 비유창성 비교)

  • Kim, Yesul;Sim, Hyunsub
    • Phonetics and Speech Sciences
    • /
    • v.13 no.1
    • /
    • pp.53-64
    • /
    • 2021
  • According to Demand and Capacity Model (DCM), external and internal environments influence the disfluency of children who stutter (CWS). This study investigated the effects of simultaneous changes in motoric and linguistic demands on CWS and children who do not stutter (CWNS). Participants were 4-6 years old CWS and CWNS. A sentence imitation task with changes in speech rate, utterance length, and sentence complexity was used to examine their effects on children's disfluency. When the utterance length changed, CWS showed more disfluency regardless of utterance length and as the speech rate changed, CWS showed more disfluency at fast speech rate than CWNS. When the utterance length and speech rate changed, at fast speech rate, CWS showed more disfluency in both utterances than CWNS. When sentence complexity changed, CWS showed more disfluency than CWNS in complex sentences. Changes in linguistic elements such as speech rate, utterance length, and sentence complexity affect disfluency in CWS, especially when they were exposed to faster, longer, and more complex sentences. This indicates that CWS are vulnerable to fast and complex speech motor control and language processing ability than CWNS. Thus, this study suggests that parents and therapists consider both the speech rate and the utterance length when talking with CWS.

Voice Features Extraction of Lung Diseases Based on the Analysis of Speech Rates and Intensity (발화속도 및 강도 분석에 기반한 폐질환의 음성적 특징 추출)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The KIPS Transactions:PartB
    • /
    • v.16B no.6
    • /
    • pp.471-478
    • /
    • 2009
  • The lung diseases classifying as one of the six incurable diseases in modern days are caused mostly by smoking and air pollution. Such causes the lung function damages, and results in malfunction of the exchange of carbon dioxide and oxygen in an alveolus, which the interest is augment with risk diseases of life prolongation. With this in the paper, we proposed a diagnosis method of lung diseases by applying parameters of voice analysis aiming at the getting the voice feature extraction. Firstly, we sampled the voice data from patients and normal persons in the same age and sex, and made two sample groups from them. Also, we conducted an analysis by applying the various parameters of voice analysis through the collected voice data. The relational significance between the patient and normal groups can be evaluated in terms of speech rates and intensity as a part of analized parameters. In conclusion, the patient group has shown slower speech rates and bigger intensity than the normal group. With this, we propose the method of voice feature extraction for lung diseases.

Korean prosodic properties between read and spontaneous speech (한국어 낭독과 자유 발화의 운율적 특성)

  • Yu, Seungmi;Rhee, Seok-Chae
    • Phonetics and Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.39-54
    • /
    • 2022
  • This study aims to clarify the prosodic differences in speech types by examining the Korean read speech and spontaneous speech in the Korean part of the L2 Korean Speech Corpus (speech corpus for Korean as a foreign language). To this end, the articulation length, articulation speed, pause length and frequency, and the average fundamental frequency values of sentences were set as variables and analyzed via statistical methodologies (t-test, correlation analysis, and regression analysis). The results found that read speech and spontaneous speech were structurally different in the form of prosodic phrases constituting each sentence and that the prosodic elements differentiating each speech type were articulation length, pause length, and pause frequency. The statistical results show that the correlation between articulation speed and articulation length was highest in read speech, explaining that the longer a given sentence is, the faster the speaker speaks. In spontaneous speech, however, the relationship between the articulation length and the pause frequency in a sentence was high. Overall, spontaneous speech produces more pauses because short intonation phrases are continuously built to make a sentence, and as a result, the sentence gets lengthened.

Characteristic of Thermal Decomposition and Ignition Temperature of Magnesium Particles (마그네슘 분진의 열분해 및 발화온도 특성)

  • Han, Ou-Sup;Lee, Jung-Suk
    • Journal of the Korean Institute of Gas
    • /
    • v.17 no.5
    • /
    • pp.69-74
    • /
    • 2013
  • The study was conducted experimentally on characteristic of thermal decomposition and minimum ignition temperature of magnesium dusts. For this purpose, three different Mg dusts of mean diameter (38, 142, $567{\mu}m$) were used. Experimental investigations were conducted by using TGA(Thermo gravimetric analysis) and MIT(Minimum Ignition Temperature) apparatus made in accordance with IEC 61241-2-1 standard. As the results, temperature of weight gain in Mg dust layers increased with increasing of heating rates in air and, under the same heating rate condition, minimum ignition temperature increased with particle size. Also the MIT of suspended Mg dust clouds tended to increase with increasing of mean diameter.

Diagnosis and Evaluation of Humanities Therapy: The Phonetic Analysis of Speech Rates and Fundamental Frequency According to Preferred Sensation Type (인문치료의 진단 및 평가: 감각유형에 따른 말속도와 기본주파수의 실험음성학적 분석)

  • Lee, Chan-Jong;Heo, Yun-Ju
    • The Journal of the Acoustical Society of Korea
    • /
    • v.30 no.4
    • /
    • pp.231-237
    • /
    • 2011
  • The purpose of this study is to examine the correlation between the preferred sensation type and speech sounds, especially on $F_0$ and the speech rates. Data for the sensation types and speech sounds were collected from 36 undergraduate and graduate students (17 male, 19 female). Subjects were asked to read a given text (400 syllables), describe a drawing, and give answers to some questions. We measured speakers' $F_0$ and speech rates. The results show that type V (Visual) has the correlation with the speech rates when type D (Digital) was ruled out, and type A (Auditory) has the correlation with the speech rates when type D was included. Furthermore, the analysis of the mean values of V, A, K (Visual, Auditory, Kinethetic) indicates that type V is characterized with faster speech rates and higher $F_0$ in all parts except for interview and the same is true for that of V, A, K, D (Visual, Auditory, Kinethetic, Digital) in all parts. In conclusion, this study proved that the preferred sensation type has the correlation with $F_0$ and speech rates. Based on the results of this study, $F_0$ and speech rates can be used to analyze the sensation types for individualized education as well as consultation. In addition, this study has great significance in that it lays a foundation for the study on the correlation between a preferred sensation type and speech sounds.

Spontaneous Combustion of Various Fuels of Carbonization Rank (탄화도별 발전연료의 자연발화 특성 평가)

  • Kim, Jae-Kwan;Park, Seok-Un;Jeong, Jae-Hyeok;Shin, Dong-Ik;Hong, Jun-Seok;Hong, Jin Pyo
    • Journal of Energy Engineering
    • /
    • v.26 no.3
    • /
    • pp.78-89
    • /
    • 2017
  • Spontaneous combustion propensity of various coals of carbonization grade as a pulverized fuel of coal fired power plant has been tested from an initial temperature of $25^{\circ}C$ to $600^{\circ}C$ by heated in an oven with air to analyze an self oxidation starting temperature. This tests produce a CPT(Cross Point Temperature), IT(Ignition temperature) and CPS(Cross Point Slope) by calculated as the slope of time taken a rapid exothermic oxidation reaction at CPT base. CPS show a carbonization rank dependence, whereby wood pellet has the highest propensity to spontaneous combustion of $20.995^{\circ}C/min$. A subbituminous KIDECO coal shows an CPS values of $15.370^{\circ}C/min$ whereas it of pet coke of the highest carbonization rank has $20.950^{\circ}C/min$. The nature of this trend is most likely a concentration of volatile matter and oxygen functional groups of coal surface that governs the available component for oxidation as well as surface area of fuel char, and constant pressure molar heat.

A Study on Thermal Characteristics and Ignitability of Dead Leaves and Living Leaves for Main Species of Trees in Youngdong Areas (영동지역의 주요 수종별 낙엽과 생업의 열적특성 및 발화특성에 관한 연구)

  • Lee, Hae-Pyeong;Lee, Si-Young;Park, Young-Ju
    • Fire Science and Engineering
    • /
    • v.23 no.1
    • /
    • pp.21-32
    • /
    • 2009
  • In order to inspect the danger of forest fires, the thermal characteristics and the ignitability of the dead leaves and the living leaves for the main species of trees in Youngdong areas have been studied by the TG/DTA and the group flammability tester. From this work, the thermal delay has been increased with the increase of the heating rate. The fractions of the thermal weight loss for the dead leaves and the living leaves of the coniferous trees were higher than those of the broadleaf trees. Also, it was confirmed that the ignitable dangers of the dead leaves and the coniferous trees were higher than those of the living leaves and the broadleaf trees, due to the low auto ignition temperature and thermal resistance.