• 제목/요약/키워드: Korean speech

검색결과 5,286건 처리시간 0.029초

파킨슨병 환자의 말 특성과 언어치료 관련 국내문헌연구 (A Study of Korean Literature Review Related to Speech Characteristics and Speech Therapy in Patients with Parkinson Disease)

  • 강하늘;유재연
    • 대한후두음성언어의학회지
    • /
    • 제30권2호
    • /
    • pp.87-94
    • /
    • 2019
  • The purpose of this study was to investigate the speech characteristics and speech therapy of Parkinson disease (PD). This study selected 28 papers published in Korea from 1998 to 2018 after searching the terms 'Parkinson voice' and 'Parkinson speech therapy.' Literature review had been conducted in the two aspects of speech characteristics and speech therapy. The speech characteristics were divided into respiration, phonation, articulation, prosody, vowel production, and voice questionnaire. Speech therapy was divided into Lee Sliverman voice treatment (LSVT) and other voice therapy. PD patients did not differ in respiration function compared to normal elderly people, but their speech and articulation function were poorer. There was also a difference in the speech rate, frequency of pause, and accuracy of vowel production compared with normal elderly people. PD had a lower VHI score and their voice related quality of life was a little poorer. The LSVT was typically used in speech therapy for PD. The methods of speech therapy for PD have been shown to improve respiration and phonation. It is necessary to establish voice norms in PD patients and develop effective speech therapy in the following study.

네트워크 환경에서 서버용 음성 인식을 위한 MFCC 기반 음성 부호화기 설계 (A MFCC-based CELP Speech Coder for Server-based Speech Recognition in Network Environments)

  • 이길호;윤재삼;오유리;김홍국
    • 대한음성학회지:말소리
    • /
    • 제54호
    • /
    • pp.27-43
    • /
    • 2005
  • Existing standard speech coders can provide speech communication of high quality while they degrade the performance of speech recognition systems that use the reconstructed speech by the coders. The main cause of the degradation is that the spectral envelope parameters in speech coding are optimized to speech quality rather than to the performance of speech recognition. For example, mel-frequency cepstral coefficient (MFCC) is generally known to provide better speech recognition performance than linear prediction coefficient (LPC) that is a typical parameter set in speech coding. In this paper, we propose a speech coder using MFCC instead of LPC to improve the performance of a server-based speech recognition system in network environments. However, the main drawback of using MFCC is to develop the efficient MFCC quantization with a low-bit rate. First, we explore the interframe correlation of MFCCs, which results in the predictive quantization of MFCC. Second, a safety-net scheme is proposed to make the MFCC-based speech coder robust to channel error. As a result, we propose a 8.7 kbps MFCC-based CELP coder. It is shown from a PESQ test that the proposed speech coder has a comparable speech quality to 8 kbps G.729 while it is shown that the performance of speech recognition using the proposed speech coder is better than that using G.729.

  • PDF

Speech Outcomes in 5-Year-Old Korean Children with Bilateral Cleft Lip and Palate

  • Kyung S. Koh;Seungeun Jung;Bo Ra Park;Tae-Suk Oh;Young Chul Kim;Seunghee Ha
    • Archives of Plastic Surgery
    • /
    • 제51권1호
    • /
    • pp.80-86
    • /
    • 2024
  • Background Among the cleft types, bilateral cleft lip and palate (BCLP) generally requires multiple surgical procedures and extended speech therapy to achieve normal speech development. This study aimed to describe speech outcomes in 5-year-old Korean children with BCLP and examine whether normal speech could be achieved before starting school. Methods The retrospective study analyzed 52 children with complete BCLP who underwent primary palatal surgery at a tertiary medical center. Three speech-language pathologists made perceptual judgments on recordings from a speech follow-up assessment of 5-year-old children. They assessed the children's speech in terms of articulation, speech intelligibility, resonance, and voice using the Cleft Audit Protocol for Speech-Augmented-Korean Modification. Results The results indicated that at the age of five, 65 to 70% of children with BCLP presented articulation and resonance within normal or acceptable ranges. Further, seven children with BCLP (13.5%) needed both additional speech therapy and palatal surgery for persistent velopharyngeal insufficiency and speech problems even at the age of five. Conclusion This study confirmed that routine follow-up speech assessments are essential as a substantial number of children with BCLP require secondary surgical procedures and extended speech therapy to achieve normal speech development.

연변 조선족 방언 음성의 실험적 연구 (Experimental Phonetic Study of Yanjin Sino-Korean Dialect)

  • 김현기
    • 말소리와 음성과학
    • /
    • 제1권1호
    • /
    • pp.47-52
    • /
    • 2009
  • The speech of Sino-Korean has been evolved from geopolitical cause since 1945. The aim of this study is to collect Yanji dialectal speech and to compare with South Korean dialectal speech. Twenty Yanbian university students participated as informants. Acoustic speech informations are analyzed using the Multi-Speech Windows Vista version. Dialectal speech characteristics of Yanji sino-Korean showed posterior vowel /${\alpha}$/, neutralization of mid-vowel /o/ between /o/ and /Ɔ/. Lenis stop sound showed the tendency of glottalization based on VOT value. Sibilant sound contains aspiration following constriction and lateral /l/ realized the approximant /r/.

  • PDF

한국어 아동 지향어에 나타난 폐쇄음의 음향 음성학적 특성 (Acoustic Characteristics of Korean Stops in Korean Child-directed Speech)

  • 김민정
    • 말소리와 음성과학
    • /
    • 제1권3호
    • /
    • pp.117-122
    • /
    • 2009
  • A variety of cross-linguistic studies has documented that the acoustic properties of speech addressed to young children include exaggeration of pitch contours and acoustically salient features of phonetic units. It has been suggested that phonetic modifications of child-directed speech facilitate young children's learning of speech sounds by providing detailed phonetic information about the target word. While there are several studies reporting vowel modifications in speech to infants (i.e., hyper-articulated vowels), there has been little research about consonant modifications in speech to young children (except for VOT). The present study examines acoustic properties of Korean stops in Korean mothers' speech to their children (seven children aged 27 to 38 months). Korean tense, lax, and aspirated stops are all voiceless in word-initial position, and are perceptually differentiated by several acoustic parameters including VOT, $f_0$ of the following vowel, and the amplitude difference of the first and second harmonics at the voice onset of the following vowel. This study compares values of these parameters in Korean child-directed speech to those in adult-directed speech from same speakers. Conclusions focus on the acoustic properties of Korean stops in child-directed speech and how they are modified to help Korean young children learn the three-way phonetic contrast.

  • PDF

How Different are Vowel Epentheses in Learner Speech and Loanword Phonology?

  • Park, Mi-Sun;Kim, Jong-Mi
    • 음성과학
    • /
    • 제15권2호
    • /
    • pp.33-51
    • /
    • 2008
  • Difference of learner speech and loanword phonology is investigated in terms of Korean learners' speech and their loanword adaptation of English words with a post-vocalic word-final stop. When we compared the speech of 12 Korean learners in mid-intermediate level with that of eight English speakers, the learner speech did not reflect loanword phonology of the vowel insertion after a voiced word-final stop (e.g., rib$[\dotplus]$, bad$[\dotplus]$, gag$[\dotplus]$ vs. tip[=], cat[=], book[=]), but, instead, the target phonology of vowel lengthening before a voiced word-final stop (e.g., rib[r.I:b], CAD$[k{\ae}:d]$, bag$[b{\ae}:g]$ vs. rip[rI.p], cat$[k{\ae}t]$, back$[b{\ae}k])$. A longitudinal study of learner speech before and after instruction showed some development toward the acquisition of target phonology. The results indicate that learner speech departs from loanword phonology, and approaches to target speech in a faster rate than direct ratio. Thus, native phonology predicts loanword phonology, but lends little support to learner speech. Our results also indicate that loanword phonology is constant, while learner speech changes toward the acquisition of target phonology.

  • PDF

한국 표준어 화자의 유창성과 말속도에 관한 연구 (Fluency and Speech Rate for the Standard Korean Speakers)

  • 심홍임
    • 음성과학
    • /
    • 제11권3호
    • /
    • pp.193-200
    • /
    • 2004
  • This was a preliminary study for standardizing speech rate and fluency of normal adult Korean speakers and comparing speech rate and fluency of normal speakers with those of professional speakers. The purposes of this study were to investigate (a) the speech rates (the overall speech rate and the articulation rate) and the disfluency characteristics of normnal adult speakers and (b) the speech rates (the overall speech rate and the articulation rate) and the disfluency characteristics between normal adult speakers and professional speakers. The results were as follows: The most frequent disfluency type was 'interjection' in story-telling, 'revision' in text reading and announcing of professional speakers. The professional speakers had the fastest speech rates (overall speech rate and articulation rate) among the 3 groups.

  • PDF

식도발성 남성 발화의 말 속도 (Speech Rates of Male Esophageal Speech)

  • 박원경;심희정;고도흥
    • 말소리와 음성과학
    • /
    • 제4권3호
    • /
    • pp.143-149
    • /
    • 2012
  • The purpose of this study is to investigate the speech rate of an esophageal speech group that is capable of vocalization after surgery. The subjects in this experiment were 10 male esophageal speakers and 10 male laryngeal speakers. Each group read a reading passage that was recorded by a DAT recorder (Rolando, EDIROL R-09). These records were analyzed by using CSL (Computerized Speech Lab, model 4150). The results were as follows: (1) the overall speech rate of esophageal speech was 2.50 SPS (syllable per second) while the overall speech rate of laryngeal speech was 4.23 SPS. (2) The articulatory rate of esophageal speech was 3.14 SPS (syllable per second) while the articulatory rate of laryngeal speech was 4.75 SPS. Speech rates as well as articulatory rates of esophageal speech were significantly lower than laryngeal speech. These differences between the two groups may be due to reduced efficiency of airflows across the pharyngeal-esophageal segment for esophageal speakers when compared to airflow through the glottis for laryngeal speakers. These results would provide a guideline in speech rates for esophageal speakers in clinical settings.

파킨슨 환자의 클리어 스피치 전후 음향학적 공기역학적 특성 (An aerodynamic and acoustic characteristics of Clear Speech in patients with Parkinson's disease)

  • 신희백;고도홍
    • 말소리와 음성과학
    • /
    • 제9권3호
    • /
    • pp.67-74
    • /
    • 2017
  • An increase in speech intelligibility has been found in Clear Speech compared to conversational speech. Clear Speech is defined by decreased articulation rates and increased frequency and length of pauses. The objective of the present study was to investigate improvement in immediate speech intelligibility in 10 patients with Parkinson's disease (age range: 46 to 75 years) using Clear Speech. This experiment has been performed using the Phonatory Aerodynamic System 6600 after the participants read the first sentence of a Sanchaek passage and the "List for Adults 1" in the Sentence Recognition Test (SRT) using casual speech and Clear Speech. Acoustic and aerodynamic parameters that affect speech intelligibility were measured, including mean F0, F0 range, intensity, speaking rate, mean airflow rate, and respiratory rate. In the Sanchaek passage, use of Clear Speech resulted in significant differences in mean F0, F0 range, speaking rate, and respiratory rate, compared with the use of casual speech. In the SRT list, significant differences were seen in mean F0, F0 range, and speaking rate. Based on these findings, it is claimed that speech intelligibility can be affected by adjusting breathing and tone in Clear Speech. Future studies should identify the benefits of Clear Speech through auditory-perceptual studies and evaluate programs that use Clear Speech to increase intelligibility.

Syllable-timing Interferes with Korean Learners' Speech of Stress-timed English

  • Lee, Ok-Hwa;Kim, Jong-Mi
    • 음성과학
    • /
    • 제12권4호
    • /
    • pp.95-112
    • /
    • 2005
  • We investigate Korean learners' speech-timing of English before and after instruction in comparison with native speech, in an attempt to resolve disagreements in the literature as to whether speech-timing is measurable (Lehiste, 1977; Roach, 1982; Dauer, 1983 vs. Low et al., 2000; Yun 2002; Jian, 2004). We measured the pair-wise variability between the adjacent stressed and unstressed syllables within a foot as well as that among adjacent feet in approximately 555 English sentences, which were read by 29 native speakers and 41 Korean learners in the intermediate proficiency level. The results show that in comparison with native American English, Korean learner speech is before instruction significantly (p<.001) smaller for the pair-wise variability between the adjacent stressed and unstressed syllables within a foot; and significantly (p=.01) bigger for the variability among adjacent feet within the utterance. The learner speech after instruction showed significant (p=.01) improvement in the pair-wise variability of syllable sequence toward native speech values. The variability among adjacent feet was progressively smaller for learner speech before and after instruction and for native speech (p=.03). We thus conclude that the speech timing difference between Korean English and American English is measurable in terms of the duration. of stressed and unstressed syllables and that the latter is stress-timed and the former is syllable-timing interfered.

  • PDF