• 제목/요약/키워드: vowel addition

검색결과 77건 처리시간 0.032초

성별에 따른 한국 정상 성인 음성의 음향학적 평가 기준치 (Acoustic Characteristics of the Voices of Korean Normal Adults by Gender on MDVP)

  • 김재옥
    • 말소리와 음성과학
    • /
    • 제1권4호
    • /
    • pp.147-157
    • /
    • 2009
  • The purpose of the study is to develop the normal voice database and to analyze the acoustic characteristics of Korean adults' voices by gender using MDVP. Eight categories in the 34 parameters of MDVP were analyzed in the voices of 170 Korean normal adults taken from /a/ vowel. Among them, Fundamental Frequency Parameters and Frequency Perturbation Parameters were significantly different by gender. In addition, Fundamental Frequency Parameters of our data were remarkably different from the data suggested in the MDVP program which currently used in clinics. Therefore, the data obtained from the current study can be effectively used for the diagnosis of voice disorders of Korean adults as the standard parameter values of MDVP.

  • PDF

성도 자기공명 영상과 음향정보(F1/F2)를 이용한 한국어 단모음 [이, 에, 아, 오, 우, 으] 판별 (A Vowel Discrimination of Korean Monophthongs [i, e, a, o, u, ${\omega}$] Using Vocal Tract Magnetic Resonance Image and F1/F2)

  • 성철재;박종원;김귀룡
    • 대한음성학회지:말소리
    • /
    • 제56호
    • /
    • pp.103-125
    • /
    • 2005
  • We present a new method of measuring the volume and cross-sectional area of the vocal tract from magnetic resonance images. The vocal tract was divided by the 2 constriction points on the horizontal and vertical planes. The ratios of the volumes of the segment vocal tracts to that of the entire vocal tract play a crucial role in discriminating Korean monophthongs in that vowels were successfully discriminated by the ratios. The discriminant analysis also demonstrated that the acoustic parameters F1 and F2, in addition to the segment volumes, serve as significant parameters in discriminating Korean monophthongs.

  • PDF

ACOUSTIC CHARACTERISTICS OF KOREAN TRADITIONAL SINGING VOICE: A PRELIMINARY REPORT

  • Moon, Seung-Jae
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 1996년도 10월 학술대회지
    • /
    • pp.367-371
    • /
    • 1996
  • Most Koreans agree that Korean traditional singing voice has a very peculiar sound comparing to Western singing voice. The goal of this paper is to investigate the acoustic characteristics of Korean traditional singing voice called 'Pansori' Materials are analyzed from 3male professional singers and 4 female professional singers. Their singing was compared with their own conversation and other non-singers' conversation. Long term average spectra indicated that all the singers showed a much less spectral tilt than non-singers. The phenomenon was prevailing for professional singers not only in their singing, but also in their conversation. This suggests that it is not the result of a temporary effort but it may involve a certain permanent change in their physiological configuration. (To assess this hypothesis, voice source should be looked at directly. Therefore, in further research, using Rothenberg mask (Rothenberg, 1973) is strongly recommended.) In addition to LTA, individual vowel formants will be studied later.

  • PDF

Recognition of Virtual Written Characters Based on Convolutional Neural Network

  • Leem, Seungmin;Kim, Sungyoung
    • Journal of Platform Technology
    • /
    • 제6권1호
    • /
    • pp.3-8
    • /
    • 2018
  • This paper proposes a technique for recognizing online handwritten cursive data obtained by tracing a motion trajectory while a user is in the 3D space based on a convolution neural network (CNN) algorithm. There is a difficulty in recognizing the virtual character input by the user in the 3D space because it includes both the character stroke and the movement stroke. In this paper, we divide syllable into consonant and vowel units by using labeling technique in addition to the result of localizing letter stroke and movement stroke in the previous study. The coordinate information of the separated consonants and vowels are converted into image data, and Korean handwriting recognition was performed using a convolutional neural network. After learning the neural network using 1,680 syllables written by five hand writers, the accuracy is calculated by using the new hand writers who did not participate in the writing of training data. The accuracy of phoneme-based recognition is 98.9% based on convolutional neural network. The proposed method has the advantage of drastically reducing learning data compared to syllable-based learning.

포먼트에 의한 영어모음 비교 분석 (A Comparative Analysis on English Vowels of Korean Students by Formant Frequencies)

  • 황영순
    • 음성과학
    • /
    • 제8권4호
    • /
    • pp.221-228
    • /
    • 2001
  • The purpose of this study is to analyze the problems Korean students, having acoustic structure of Korean vowels, have when they pronounce English vowels by measuring formant frequencies. The experimental results show that the pronunciation of English vowels by Korean students is partially influenced by their Korean vowels. There is little distinction between /i/ and /I/, /U/ and /u/ due to the absence of short and long vowels in Korean pronunciation. Also, as observed in typical Korean vowel pronunciation, there is little difference between the F1 values of /$\varepsilon$/ and /$\{\ae}$/ by Korean speakers, resulting in inaccurate English pronunciation. In addition, compared to English native speakers, Korean speakers show the biggest difference in F1 value of /c/. The fact that they make pronunciation of /c/ covering /e/, /$\Lambda$/ and /c/ positions probably accounts for such phenomenon. The results of this experiment show the interference of Korean that occurred in some English vowels by native Korean speakers.

  • PDF

Reduction of Unstressed Prevocalic /u/ in English

  • Hwangbo, Young-Shik
    • 영어영문학
    • /
    • 제55권6호
    • /
    • pp.1139-1161
    • /
    • 2009
  • This paper deals with the reduction of unstressed prevocalic /u/ and the appearance of /w/ which are observed in such words as ambiguity [ˌæm bǝ ˈgju: ǝ ti] - ambiguous [æm ˈbɪ gjǝ wǝs]. This phenomenon is recorded in Merriam-Webster Online Dictionary, Webster's Third New International Dictionary, Unabridged, and the draft revisions of Oxford English Dictionary Online. Since this phenomenon has not been studied in detail up to now, this paper aims 1) to collect the data related to the reduction of unstressed prevocalic /u/, 2) to classify them systematically, and 3) to explain the phenomenon in terms of Optimality Theory. In the course of analysis, Prevocalic Lengthening, which is crucial to the preservation of unstressed prevocalic /u/, is reinterpreted as one of the ways to prevent hiatus (annual /æ nju: ǝl/). /w/-insertion is another way to prevent hiatus (annual /æ njǝ wǝl/). In addition it is argued that prevocalic /u/ behaves differently from prevocalic /i/ due to the difference in the articulators involved.

The Effects of Pitch Increasing Training (PIT) on Voice and Speech of a Patient with Parkinson's Disease: A Pilot Study

  • Lee, Ok-Bun;Jeong, Ok-Ran;Shim, Hong-Im;Jeong, Han-Jin
    • 음성과학
    • /
    • 제13권1호
    • /
    • pp.95-105
    • /
    • 2006
  • The primary goal of therapeutic intervention in dysarthric speakers is to increase the speech intelligibility. Decision of critical features to increase the intelligibility is very important in speech therapy. The purpose of this study is to know the effects of pitch increasing training (PIT) on speech of a subject with Parkinson's disease (PD). The PIT program is focused on increasing pitch while a vowel is sustained with the same loudness. The loudness level is somewhat higher than that of the habitual loudness. A 67-year-old female with PD participated in the study. Speech therapy was conducted for 4 sessions (200 minutes) for one week. Before and after the treatment, acoustic, perceptual and speech naturalness evaluation was peformed for data analysis. Speech and voice satisfaction index (SVSI) was obtained after the treatment. Results showed Improvements in voice quality and speech naturalness. In addition, the patient's satisfaction ratings (SVSI) indicated a positive relationship between improved speech production and their (the patient and care-givers) satisfaction.

  • PDF

한국어 어휘 인식을 위한 혼합형 음성 인식 단위 (Monophone and Biphone Compuond Unit for Korean Vocabulary Speech Recognition)

  • 이기정;이상운;홍재근
    • 한국컴퓨터산업학회논문지
    • /
    • 제2권6호
    • /
    • pp.867-874
    • /
    • 2001
  • 본 논문에서는 한국어의 발음 특성을 고려하여 인식시간 단축과 동시에 조음현상을 반영할 수 있는 인식단위 표현법을 제안하였다. 제안한 인식단위는 단음소(monophone)와 바이폰(biphone)의 혼합형으로서, 단음소 단위는 안정적인 특성을 나타내는 모음에 적용되고 바이폰 단위는 인접한 모음에 의해 변하는 자음에 적용된다. PBW455 데이터베이스에 대한 단어인식 실험에서 혼합형 단위표현법은 트라이폰 단위에 비해 비슷한 인식률을 나타내면서 57%의 인식시간 단축효과를 나타냈고, 음절 단위에 비해 향상된 인식률과 비슷한 인식시간을 나타내었다. 또한 트라이폰 및 음절 단위보다 적은 모델 수를 가져 메모리 양을 줄일 수 있었다.

  • PDF

Development of Parameters for Diagnosing Laryngeal Diseases

  • Kim, Yong-Ju;Wang, Soo-Geun;Kim, Gi-Ryun;Kwon, Soon-Bok;Jeon, Kye-Rok;Back, Moo-Jin;Yang, Byung-Gon;Jo, Cheol-Woo;Kim, Hyung-Soon
    • 음성과학
    • /
    • 제10권1호
    • /
    • pp.117-129
    • /
    • 2003
  • Many people suffer from various laryngeal diseases. Since we can notice voice change easily, acoustic analysis can be helpful to diagnose the diseases. Several attempts have been made to clarify the relation between the parameters and the state of sick vocal folds but any decisive parameters are not found yet. The purpose of this study was to select and develop those parameters useful for diagnosing and differentiating laryngeal diseases. We examined eight MDVP parameters, and two additional MFCC and LPC parameters obtained from the production of an open vowel by 252 subjects with or without laryngeal diseases. Using a statistical procedure through the artificial neural networks, we attempted to differentiate laryngeal disease groups. Results showed that the LPC parameters indicated the highest differentiating rate by the networks followed by the MFCC and the MDVP parameters. In addition, Jita, Shim and NHR among the MDVP parameters came out better parameters in diagnosing laryngeal diseases.

  • PDF

Comparison of English and Korean speakers for the nasalization of English stops

  • Yun, Ilsung
    • 말소리와 음성과학
    • /
    • 제7권3호
    • /
    • pp.3-11
    • /
    • 2015
  • This study compared English and Korean speakers with regard to the nasalization of the English stops /b, d, g, p, t, k/before a nasal within and across a word boundary. Nine English and thirty Korean speakers participated in the experiment. We used 37 speech items with different grammatical structures. Overall the English informants rarely nasalized the stops while the Korean informants generally greatly nasalized them though widely varying from no nasalization to almost complete nasalization. In general, voiced stops were more likely to be nasalized than voiceless stops. Also, the alveolar stops /d, t/tended to be nasalized the most, the bilabial stops /b, p/ the second most, and the velar stops /g, k/ the least. Besides, the closer the grammatical relationship between neighboring words, the more likely the stop nasalization occurred. In contrast, the Korean syllabification - the addition of the vowel /i/ to the final stops - worked against the stop nasalization. On the other hand, different stress (accent) or rhythm effects of the two languages are assumed to contribute to the significantly different nasalization between English and Korean speakers. The spectrum of stop nasalization obtained from this study can be used as an index to measure how close a certain Korean speaker's stop nasalization is to English speakers'.