• Title/Summary/Keyword: Korean speech

Search Result 5,300, Processing Time 0.028 seconds

Comparison of overall speaking rate and pause between children with speech sound disorders and typically developing children (말소리장애 아동과 일반 아동의 발화 속도와 쉼 비교)

  • Lee, HeungIm;Kim, SooJin
    • Phonetics and Speech Sciences
    • /
    • v.9 no.2
    • /
    • pp.111-118
    • /
    • 2017
  • This study compares speech rate, articulatory rate, and pause between the children with mild and moderate Speech Sound Disorder (SSD) who performed Sentence Repetition Tasks and the Typically Developing children (TD) of the same chronological age. The results showed that three groups are categorized in terms of speaking rate and articulatory rate. There is no difference between the two groups with SSD children, namely between the mild and moderate groups. However, there is a significant difference in their rate of speech and the articulatory rate between the two groups, such that the two groups with SSD are significantly slower than the TD group. The results also showed that there are no significant difference in the length and frequency of pause between the moderate group and the mild group. However, there is a substantial difference between them and the TD group. This study, provided the basic data for evaluating the speech rate of the children and implies that there are limitations in speech rate among the children with SSD.

Furlow Palatoplasty in Submucous Cleft Palate-Timing of Operation (점막하 구개열에서 Furlow 구개성형술의 수술시기)

  • Kim, Suk Wha;Park, Joon Kyu
    • Archives of Plastic Surgery
    • /
    • v.34 no.6
    • /
    • pp.741-747
    • /
    • 2007
  • Purpose: In order to determine the differences in speech outcome based on timing of operation in submucous cleft palate, we have reviewed our experiences in the Furlow palatoplasty over the last 11 years. Methods: From March 1996 to March 2006, 38 submucous cleft palate patients received Furlow palatoplasty. 10 developmentally delayed patients were excluded and 5 patients were lost to follow up. The rest 23 patients were reviewed. Speech was evaluated preoperatively and postoperatively, and speech therapy was performed accordingly. Perceptual speech assessment included hypernasality, nasal emission and articulation disorder. Cinefluorography was performed to aid perceptual assessment. Based on timing of operation, the patients were divided into 3 groups as following: Group A under 24 months(8 patients), Group B from 25 to 48 months(6 patients), and Group C over 49 months (9 patients). Except 1 patient under speech therapy yet, resultant speech was compared. 200707Results: The rate of abnormal speech was higher in Group C(3/9, 33.3%) than in Group A(0%) or B(0%). All 3 patients who had been discontinued of speech therapy from the parent's judgment had abnormal speech. The reason for the discontinuation was that the regular speech therapy was a burden at school age. Any patients who had continued speech therapy had normal speech. Conclusion: The results of our study shows that operative timing is associated with speech development. Maintenance of speech therapy was an important factor for normal speech development. It will be helpful to perform a palatoplasty before 48 months of age to complete speech therapy before the school age.

Formant Locus Overlapping Method to Enhance Naturalness of Synthetic Speech (합성음의 자연도 향상을 위한 포먼트 궤적 중첩 방법)

  • 안승권;성굉모
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.28B no.10
    • /
    • pp.755-760
    • /
    • 1991
  • In this paper, we propose a new formant locus overlapping method which can effectively enhance a naturalness of synthetic speech produced by ddemisyllable based Korean text-to-speech system. At first, Korean demisyllables are divided into several number of segments which have linear formant transition characteristics. Then, database, which is composed of start point and length of each formant segments, is provided. When we synthesize speech with these demisyllable database, we concatenate each formant locus by using a proposed overlapping method which can closely simulate haman articulation mechanism. We have implemented a Korean text-to-speech system by using this method and proved that the formant loci of synthetic speech are similar to those of the natural speech. Finally, we could illustrate that the resulting spectrograms of proposed method are more similar to natural speech than those of conventional method.

  • PDF

Emotion Robust Speech Recognition using Speech Transformation (음성 변환을 사용한 감정 변화에 강인한 음성 인식)

  • Kim, Weon-Goo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.20 no.5
    • /
    • pp.683-687
    • /
    • 2010
  • This paper studied some methods which use frequency warping method that is the one of the speech transformation method to develope the robust speech recognition system for the emotional variation. For this purpose, the effect of emotional variations on the speech signal were studied using speech database containing various emotions and it is observed that speech spectrum is affected by the emotional variation and this effect is one of the reasons that makes the performance of the speech recognition system worse. In this paper, new training method that uses frequency warping in training process is presented to reduce the effect of emotional variation and the speech recognition system based on vocal tract length normalization method is developed to be compared with proposed system. Experimental results from the isolated word recognition using HMM showed that new training method reduced the error rate of the conventional recognition system using speech signal containing various emotions.

Two Cases of Aphasic Stroke Patients treated with Speech Therapy and Korean Medical Therapy (언어치료와 한방치료를 병행한 중풍 실어증환자 치험 2례)

  • Yeo, Jin-Ju;Lee, Tae-Ho;Yu, Gyung;Kim, Lak-Hyung;Seo, Eui-Seok;Jang, In-Soo
    • The Journal of Internal Korean Medicine
    • /
    • v.25 no.3
    • /
    • pp.662-668
    • /
    • 2004
  • Cerebrovascular accident(CVA) is a leading cause of death, and severe sequelae, like motor disturbance, mental disorder, dysphagia, recognition disorder, speech disorder(aphasia) often occur. Most of medical cure about CVA sequelae lay emphasis on motor disturbance, so speech disorder(aphasia) has been neglected. But speech disorder therapy is essential for social rehabiltation. Recently, inside and outside South Korea, various clinical approaches and potential medical cures for speech disorder (aphasia) have been researched. In Korean Medicine, papers pertaining to speech disorders have been but a few. In this study two cases of aphasic stroke patients who were treated for speech and language disorders through Korean medical therapy are reported.

  • PDF

The realization of English rhythm by Busan Korean speakers

  • Choe, Wook Kyung
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.81-87
    • /
    • 2019
  • The purpose of the current study is to investigate the realization of speech rhythm in English as spoken by Korean learners of English. The study particularly aims to examine the rhythm metrics of English read speech by learners who speak Busan or the South Kyungsang dialect of Korean. Twenty-four learners whose L1 is Busan Korean and eight native speakers of English read a passage wherein five sentences were segmented and labeled as vocalic and intervocalic intervals. Various rhythm metrics such as %V, Varcos, and Pairwise Variability Indexes (PVIs) were calculated. The results show that Korean learners read English sentences with significantly more vocalic and consonantal intervals at a slower speech rate than native English speakers. The analyses of rhythm metrics revealed that when the speech rate was not normalized, Korean learners' English showed more variability in the length of consonantal and vocalic intervals. However, speech-rate-normalized rhythm metrics for vocalic intervals indicated that Korean learners transferred their L1 rhythmic structures (a syllable-timed language) into their L2 speech (a stress-timed language). Overall, the results suggest that Korean learners' English reflects the rhythmic characteristics of their L1. The effect of the learners' L1 dialect on the realization of L2 speech rhythm is also speculated.

A Study on the Endpoint Detection Algorithm (끝점 검출 알고리즘에 관한 연구)

  • 양진우
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1984.12a
    • /
    • pp.66-69
    • /
    • 1984
  • This paper is a study on the Endpoint Detection for Korean Speech Recognition. In speech signal process, analysis parameter was classification from Zero Crossing Rate(Z.C.R), Log Energy(L.E), Energy in the predictive error(Ep) and fundamental Korean Speech digits, /영/-/구/ are selected as date for the Recognition of Speech. The main goal of this paper is to develop techniques and system for Speech input ot machine. In order to detect the Endpoint, this paper makes choice of Log Energy(L.E) from various parameters analysis, and the Log Energy is very effective parameter in classifying speech and nonspeech segments. The error rate of 1.43% result from the analysis.

  • PDF

A Study on the Voice Conversion with HMM-based Korean Speech Synthesis (HMM 기반의 한국어 음성합성에서 음색변환에 관한 연구)

  • Kim, Il-Hwan;Bae, Keun-Sung
    • MALSORI
    • /
    • v.68
    • /
    • pp.65-74
    • /
    • 2008
  • A statistical parametric speech synthesis system based on the hidden Markov models (HMMs) has grown in popularity over the last few years, because it needs less memory and low computation complexity and is suitable for the embedded system in comparison with a corpus-based unit concatenation text-to-speech (TTS) system. It also has the advantage that voice characteristics of the synthetic speech can be modified easily by transforming HMM parameters appropriately. In this paper, we present experimental results of voice characteristics conversion using the HMM-based Korean speech synthesis system. The results have shown that conversion of voice characteristics could be achieved using a few sentences uttered by a target speaker. Synthetic speech generated from adapted models with only ten sentences was very close to that from the speaker dependent models trained using 646 sentences.

  • PDF

A Study of Decision Tree Modeling for Predicting the Prosody of Corpus-based Korean Text-To-Speech Synthesis (한국어 음성합성기의 운율 예측을 위한 의사결정트리 모델에 관한 연구)

  • Kang, Sun-Mee;Kwon, Oh-Il
    • Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.91-103
    • /
    • 2007
  • The purpose of this paper is to develop a model enabling to predict the prosody of Korean text-to-speech synthesis using the CART and SKES algorithms. CART prefers a prediction variable in many instances. Therefore, a partition method by F-Test was applied to CART which had reduced the number of instances by grouping phonemes. Furthermore, the quality of the text-to-speech synthesis was evaluated after applying the SKES algorithm to the same data size. For the evaluation, MOS tests were performed on 30 men and women in their twenties. Results showed that the synthesized speech was improved in a more clear and natural manner by applying the SKES algorithm.

  • PDF

Word class information in perception of prosodic prominence by Korean learners of English

  • Im, Suyeon
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.1-8
    • /
    • 2019
  • This study aims to investigate how prosodic prominence is perceived in relation to word class information (or parts-of-speech) by Korean learners of English compared with native English speakers in public speech. Two groups, Korean learners of English and native English speakers, were asked to judge words perceived as prominent simultaneously while listening to a speech. Parts-of-speech and three acoustic cues (i.e., max F0, mean phone duration, and mean intensity) were analyzed for each word in the speech. The results showed that content words tended to be higher in pitch and longer in duration than function words. Both groups of listeners rated prominence on content words more frequently than on function words. This tendency, however, was significantly greater for Korean learners of English than for native English speakers. Among the parts-of-speech of the content words, Korean learners of English were more likely than native English speakers to judge nouns and verbs as prominent. This study presents evidence that Korean learners of English consider most, if not all, content words as landing locations of prosodic prominence, in alignment with the previous study on the production of prominence.