• Title/Summary/Keyword: Speech Recording

Search Result 97, Processing Time 0.026 seconds

Perceptual Boundary on a Synthesized Korean Vowel /o/-/u/ Continuum by Chinese Learners of Korean Language (/오/-/우/ 합성모음 연속체에 대한 중국인 한국어 학습자의 청지각적 경계)

  • Yun, Jihyeon;Kim, EunKyung;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.111-121
    • /
    • 2015
  • The present study examines the auditory boundary between Korean /o/ and /u/ on a synthesized vowel continuum by Chinese learners of Korean language. Preceding researches reported that the Chinese learners have difficulty pronouncing Korean monophthongs /o/ and /u/. In this experiment, a nine-step continuum was resynthesized using Praat from a vowel token from a recording of a male announcer who produced it in isolated form. F1 and F2 were synchronously shifted in equal steps in qtone (quarter tone), while F3 and F4 values were held constant for the entire stimuli. A forced choice identification task was performed by the advanced learners who speak Mandarin Chinese as their native language. Their experiment data were compared to a Korean native group. ROC (Receiver Operating Characteristic) analysis and logistic regression were performed to estimate the perceptual boundary. The result indicated the learner group has a different auditory criterion on the continuum from the Korean native group. This suggests that more importance should be placed on hearing and listening training in order to acquire the phoneme categories of the two vowels.

The Acoustic Analysis of Diphthongs of Jeju Dialect Speakers in their 20s, 50s, and 70s and their Diphthong Inventories (제주방언화자의 세대별(20대, 50대, 70대) 이중모음의 음향분석과 이중모음체계)

  • Kim, Won-Bo;Byun, Gil-Ja;Ko, Mi-Sook
    • Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.99-114
    • /
    • 2007
  • This study attempted to acoustically analyze the diphthongs of native Jeju speakers in their 70s, 50s, and 20s to observe their diphthong inventories 96 subjects participated in the recording using a set of picture cards. Results show that Jeju dialect speakers in their 70s distinguish /we/ and $/w{\varepsilon}/$, /yo/ and /yc/, but have difficulty pronouncing /ye/, $/y{\varepsilon}/$, and /iy/ correctly. It is interesting to find that the diphthong inventory of Jeju dialect speakers in their 50s are in transitional stage. They share the diphthong inventories of those people in their 70's containing /we/ and $/w{\varepsilon}/$, and /yo/ and /yc/, but they do not produce them as clearly as the latter age group. The former age group also share the common feature with those people in their 20's because both age groups show the same pronunciation of /iy/. The youngest group seldom pronounce /we/ and $/w{\varepsilon}/$, /yo/ and /yc/ correctly unlike native Jeju speakers in their 70s, but they can easily pronounce the diphthong /iy/ like standard Korean speakers.

  • PDF

A Comparison of Resonance Parameters before and after Pharyngeal Flap Surgery:A Preliminary Report (인두피판술 전.후의 공명파라미터의 비교: 예비연구)

  • Kang, Young-Ae;Kang, Nak-Heon;Lee, Tae-Yong;Seong, Cheol-Jae
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.133-144
    • /
    • 2009
  • Pharyngeal flap surgery changes the space and shape of the oral cavity and vocal tract, and these changing conditions bring resonance change. The purpose of this study was to determine the most reliable and valuable parameters for evaluating hypernasality to distinguish two patients before and after pharyngeal flap surgery. Each patient was asked to clearly speak the vowels /a/, /i/, /u/, /e/, /o/ for voice recording. There were nine parameters: Formant (F1, F2, F3), Bandwidth (BW1, BW2, BW3), LPC energy slope ($\Delta$ |A2-A1/F2-F1|), and Band Energy (0-500 Hz, 500-1000 Hz) by each vowel. From the results of discrimination analyses on acoustic parameters, the vowels /a/, /e/ appeared to be insignificant but vowels /i/, /u/, /o/ appeared to be efficient in the separation. A 95%, 100%, and 100% recognition score could be reached when vowels /i/, /u/, and /o/ were analyzed. The results showed that F2, BW3, and LPC slope are more important parameters than the others. Finally, there is a relation between perceptual evaluation score and LPC energy slope of acoustic parameters by least square slope.

  • PDF

The Comprehension and Production of Tense Markings in Language Delayed Children and Typically Developing Children (언어발달지체아동과 일반아동의 시제 표지 이해 및 산출 특성)

  • Jo, Miok;Choi, Soyoung;Hwang, Mina
    • Phonetics and Speech Sciences
    • /
    • v.6 no.2
    • /
    • pp.123-131
    • /
    • 2014
  • The purpose of this study is to investigate the comprehension and production of various tense markings in Korean-speaking children with and without language delay. Thirty children with language delay(LD) and 30 typically developing(TD) children participated in the study. In each group, half were at the age of 4-years and the other half at 7-years. In both the comprehension and production task, 28 verbs containing four types of tense markings were used: past tense '-et ta', two present progressives '-ko itta', '-enta', and future tense '-elyeko hanta'. In the comprehension task, the children were presented with three printed still-scenes of video recording of a verb action, each representing future, present progressive, and past tense of the verb, respectively. Then they listened to the action verb with one of the 4 tense markings and had to pick the scene that matched the verb tense. In the production task, the children were given one of the three scenes and asked to produce the verb with appropriate tense marking. In both tasks, the LD children performed significantly worse than the TD children, and the older children performed significantly better than the younger children. Interestingly, the pattern of performances across different types of tense markings at the two language-age levels were closely similar in LD children and TD children. This similarity of groups seemed stronger in the comprehension task than the production task.

Pilot Study on the Classification for Sasangin by the Voice Analysis (음성분석에 의한 체질진단에 관한 연구)

  • Lee Eui-Ju;Song Kwang-Bin;Choi Hwan-Soo;Yoo Jung-Hee;Kwak Chang-Kyu;Sohn Eun-Hae;Koh Byung-Hee
    • The Journal of Korean Medicine
    • /
    • v.26 no.1 s.61
    • /
    • pp.93-102
    • /
    • 2005
  • Objective : This research was conducted to evaluate the method of sasangin classification by voice analysis, The 2 pilot tests were thus designed to solve the following problems: 'What are the conditions at classification for sasangin by the voice analysis?' and 'What are the important variances of /a/ parameter?'. Methods: 122 volunteers Were examined to make a diagnosis of sasangin by QSCC II and they were disease-free and healthy, First, they said /a/ three times for 2 seconds in their usual voice, Second, they said /a/ for 2 seconds by the different ways of high tone, mid tone, and low tone. The sounds were collected by a recording program (cooledit 2000) through a Sony microphone (ecm-26l). We analyzed the voices by maltlab, the simulation tool. Results: There were no differences and were correlations when one said /a/ three times for 2 seconds in the usual voice. There were some things to correlate when one said /a/ three times for 2 seconds by the different ways of high speech, usual speech, and low speech. Others were nothing to correlate. We evaluated the value of sasangin classification method by only /a/ voice analysis. The hit ratio was average $66.3\%\;:\;soyangin\;67.9\%,\;taeumin\;68.0\%,\;soeumin\;63.9\%$. Conclusion: We must set up the conditions to use the method of sasangin classification by voice analysis. The value of sasangin classification method by only fa! voice analysis was a hit ratio of $66.3\%$.

  • PDF

Optimize the Acoustic Environment Using a Sound Masking Effects of the Audio Signal Compression Principle (음성신호의 압축원리를 이용한 사운드 마스킹 효과로 음향 환경 최적화)

  • Ann, Sook-Hyang
    • Journal of the Korean Institute of Electrical and Electronic Material Engineers
    • /
    • v.28 no.11
    • /
    • pp.748-751
    • /
    • 2015
  • Sound Masking System technology as by sound the same on all bands and artificially generates a constant sound shield People want to hear or recognize the people with the noise generated from the interior of the way. Prevent hearing or prevent recognition by using the technology to control the audible frequency band Continue to emit constant and uniform shielding sound audible frequency band Even the security content of speech (20 Hz~20 KHz). That interception laser eavesdropping, internal solicitations, during recording Or delay the decoding was a result of the effect of interference calculated Experience noise disturbance index is applied around the Stress Index is the average index is 10.16 was a luxury for the average index is then applied to the index 3.07 Noise is significantly lower stress level has improved noise conditions.

The Effects of the Methods of Disguised Voice on the Aural Decision (위장 발화 방법의 차이가 청취 판단에 미치는 영향)

  • Song Min-Chang;Shin Jiyoung;Kang SunMee
    • MALSORI
    • /
    • no.46
    • /
    • pp.25-35
    • /
    • 2003
  • This study deals with the disguised voice (or voice disguise) in the field of forensic phonetics. We especially studied the effects of the methods of disguised voice on the aural decision. Within the nonelectronic-deliberate voice disguise area, the methods of disguised voice include use of lowered pitch, pinched nostrils, falsetto, and whisper. Ten (male:5, female:5) Seoul speakers made a recording of 16 sentences. In the aural test, 30 subjects listened normal and disguised voice. And they were asked to make a decision whether speakers identified or not. The result is as follows: The speaker verification of the falsetto and whisper was more difficult than the lowered pitch and pinched nostrils.

  • PDF

A Study On Voice and Articulation in Children with Hearing Impairment (청각장애아동의 음성 및 조음 특성 연구)

  • Park HeeJung;Chae JungHee;Park Hyun;Shin HyeJung;Seok DongIl
    • Proceedings of the KSPS conference
    • /
    • 2003.10a
    • /
    • pp.129-132
    • /
    • 2003
  • The purpose of this study was to investigate the fundamental frequency(Fo) of voice signal, the first to the third(F1-F3), and duration in children with hearing impairment. Each subject made a recording of sustained /i/ and /a/, four VbV as and four VsV. The Praat 4.1.6. was used for analysis. The results of this study were as follows: First, F0 of children with hearing impairment were higher than normal children. Second, /a/ vowel was showed that F1, F2 and duration were higher than normal children. Third, /i/ vowel was showed that F1 and duration were higher than normal children. However, F2 was lower than normal children. Therapeutic implications have been drawn.

  • PDF

An Analysis of Pronunciation Errors in Word-initial Onglides in English and a Suggestion of Teaching Method (어두에 나타나는 상향 이중모음의 오류분석 및 지도방안 연구)

  • Choi, Ju-Young;Park, Han-Sang
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.183-186
    • /
    • 2007
  • This study analyzes Korean high school students' pronunciation errors in word-initial onglides in English. For this study, 24 Korean high school students read 34 English words including glide-vowel sequences in word-initial positions and vowel-initial words in a frame sentence. The results showed 2 different error types: glide deletion and vowel distortion. After the analysis of the first recording, the subjects were taught how to pronounce glide-vowel sequences properly in a 60-minute class. Comparison of the analyses of the first and second recordings showed that the subjects improved on the pronunciation of glide-vowel sequences. After the training, the pronunciation errors of diphthongs unique to English, [$j_I$], decreased substantially. However, most subjects still had difficulties in pronouncing [$w{\mho}$], [wu], and [wo]. There was no significant correlation between English course grade and error reduction.

  • PDF

A study about five-sounds(Gong, Sang, jiao, zhi, yu) of Sasang constitutional sound analysis (오음의 사상의학적 음성분석과 고찰)

  • Kim, Dal-Rea
    • Journal of Sasang Constitutional Medicine
    • /
    • v.15 no.1
    • /
    • pp.50-59
    • /
    • 2003
  • Purpose Five animals sounds which are come under five sounds(Gong, Sang, jiao, zhi, yu) which are compared with the musical scale. It is looking for similarity between five animals' sounds and the musical scale. Methods 녹음 record 1 ig machine 1. Five animals (cattle, horse, pheasant, pig, sheep) sounds has been recording on tape. 2. That was transfer to CSL(computerized speech lab) 3. That was analysed to pitch, formant 1,2,3. energy pitch 4. That analysed result (Pitch, formant 1,2,3. energy ratio) of five animals are calculated and compared with the five musical scale(five sounds) Result The ratio of five animals sounds is not consistent with the musical scale in any five item (pitch, formant 1,2,3. energy). Conclusion 1.The five musical scale has no similarity with the five animals sounds 2.The five sound is supposed to oriented form theoretical back ground of five-going not have no relative with the five animals sounds

  • PDF