• Title/Summary/Keyword: phonetic analysis

Search Result 273, Processing Time 0.026 seconds

The Study on Intraoral Pressure, Closure Duration, and VOT During Phonation of Korean Bilabial Stop Consonants (한국어 양순 파열음 발음시 구강내압과 폐쇄기, VOT에 대한 연구)

  • Pyo Hwa Young;Choi Hong Shik
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.390-398
    • /
    • 1996
  • Acoustic analysis study was performed on 20 normal subjects by speaking nonsense syllables composed of Korean bilabial stops(/p, $p^{*}$/, ph/) and their Preceding and/or following vowel /a/(that is, [pa, $p^{*}a$, pha, apa, $ap^{*}a$, apha]) with an ultraminiature pressure sensor in their mouths. Speech materials were phonated twice, once with a moderate voice, another time with a loud voice. The acoustic signal and intraoral pressure were recorded simultaneously on computer. By these procedures, we were to measure the intraoral pressure, closure duration and VOT of Korean bilabial stops, and to compare the values one another according to the intensity of phonation and the position of the target consonants. Intraoral pressure was measured by the peak intraoral pressure value of its wave; closure duration by the time interval between the onset of intraoral pressure build-up and the burst meaning the release of closure; Voice onset time(VOT) by the time interval between the burst and the onset of glottal vibration. Heavily aspirated bilabial stop consonant /ph/ showed the highest intraoral pressure value, unaspirated /p$^{*}$/, the second, slightly aspirated /p/, the lowest. The syllable initial bilabial stops showed higher intraoral pressure than word initial stops, and the value of loudly phonated consonants were higher than moderate consonants. The longest closure duration period was that of /$p^{*}$/ and the shortest, /p/, and the duration was longer in word initial position and in the moderate voice. In VOT, the order of the longest to shortest was /ph/, /p/, /$p^{*}$/, and the value was shorter when the consonant was in intervocalic position and when it was phonated with a loud voice.

  • PDF

Coarticulation and vowel reduction in the neutral tone of Beijing Mandarin

  • Lin Maocan
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.207-207
    • /
    • 1996
  • The neutral tone is one of the most important distinguishing features in Beijing Mandarin, but there are two completely different views on its linguistic function: a special tone(Xu, 1980) versus weak stress(Chao, 1968). In this paper, the acoustic manifestation of the neutral tone will be explored to show that it is closely related to weak stress. 122 disyllabic words in which the second syllable carries the neutral tone, including 22 stress pairs, were uttered by a native male speaker of Beijing dialect and analysed by Kay Digital Sonagraph 5500-1. The results of the acoustic analysis are presented as follows: 1) The first two formants of the medial and the syllabic vowel moves towards that of central vowel with a greater magnitude in the syllable with the neutral tone than in the syllable with any of the four normal tones. Also the vowel ending, and nasal coda /n/ and / / in the syllable with the neutral tone tends to be deleted. 2) In the syllables with the neutral tone, there are strong carryover coarticulations between the medial and syllabic vowel and the preceding unvoiced consonant. In general, the vowel is affected to move towards the position of the central vowel with more greater magnitude by coronal consonant than by labial or velar consonant. 3) In the syllable with the neutral tone, when and only when it precedes a syllable with tone-4, the high vowel following [f], [ts'], [s], [ts'], [s], [tc'] or [c] tends to be voiceless. 4) It can be seen from the acoustical results of 22 stress pairs that the duration of the syllable with the neutral tone is on the average reduced to 55% of that of the syllable with the four normal tones, and the duration of the final in the syllable with neutral tone is on the average reduced to 45% of that of the final in the syllable with the four normal tones(Lin & Yan 1980). 5) The FO contour of the neutral tone is highly dependent on the preceding normal tone(Lin & Yan 1993). For a number of languages it has been found that the vowel space is reduced as the level of stress placed upon the vowel is reduced(Nord 1986). Therefore we reach the conclusion that the syllable with neutral tone is related to weak stress(Lin & Yan 1990). The neutral tone is not a special tone because the preceding normal tone.

  • PDF

Vowel Classification of Imagined Speech in an Electroencephalogram using the Deep Belief Network (Deep Belief Network를 이용한 뇌파의 음성 상상 모음 분류)

  • Lee, Tae-Ju;Sim, Kwee-Bo
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.21 no.1
    • /
    • pp.59-64
    • /
    • 2015
  • In this paper, we found the usefulness of the deep belief network (DBN) in the fields of brain-computer interface (BCI), especially in relation to imagined speech. In recent years, the growth of interest in the BCI field has led to the development of a number of useful applications, such as robot control, game interfaces, exoskeleton limbs, and so on. However, while imagined speech, which could be used for communication or military purpose devices, is one of the most exciting BCI applications, there are some problems in implementing the system. In the previous paper, we already handled some of the issues of imagined speech when using the International Phonetic Alphabet (IPA), although it required complementation for multi class classification problems. In view of this point, this paper could provide a suitable solution for vowel classification for imagined speech. We used the DBN algorithm, which is known as a deep learning algorithm for multi-class vowel classification, and selected four vowel pronunciations:, /a/, /i/, /o/, /u/ from IPA. For the experiment, we obtained the required 32 channel raw electroencephalogram (EEG) data from three male subjects, and electrodes were placed on the scalp of the frontal lobe and both temporal lobes which are related to thinking and verbal function. Eigenvalues of the covariance matrix of the EEG data were used as the feature vector of each vowel. In the analysis, we provided the classification results of the back propagation artificial neural network (BP-ANN) for making a comparison with DBN. As a result, the classification results from the BP-ANN were 52.04%, and the DBN was 87.96%. This means the DBN showed 35.92% better classification results in multi class imagined speech classification. In addition, the DBN spent much less time in whole computation time. In conclusion, the DBN algorithm is efficient in BCI system implementation.

Web Contents Mining System for Real-Time Monitoring of Opinion Information based on Web 2.0 (웹2.0에서 의견정보의 실시간 모니터링을 위한 웹 콘텐츠 마이닝 시스템)

  • Kim, Young-Choon;Joo, Hae-Jong;Choi, Hae-Gill;Cho, Moon-Taek;Kim, Young-Baek;Rhee, Sang-Yong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.1
    • /
    • pp.68-79
    • /
    • 2011
  • This paper focuses on the opinion information extraction and analysis system through Web mining that is based on statistics collected from Web contents. That is, users' opinion information which is scattered across several websites can be automatically analyzed and extracted. The system provides the opinion information search service that enables users to search for real-time positive and negative opinions and check their statistics. Also, users can do real-time search and monitoring about other opinion information by putting keywords in the system. Proposing technique proved that the actual performance is excellent by comparison experiment with other techniques. Performance evaluation of function extracting positive/negative opinion information, the performance evaluation applying dynamic window technique and tokenizer technique for multilingual information retrieval, and the performance evaluation of technique extracting exact multilingual phonetic translation are carried out. The experiment with typical movie review sentence and Wikipedia experiment data as object as that applying example is carried out and the result is analyzed.

The effect of word frequency on the reduction of English CVCC syllables in spontaneous speech

  • Kim, Jungsun
    • Phonetics and Speech Sciences
    • /
    • v.7 no.3
    • /
    • pp.45-53
    • /
    • 2015
  • The current study investigated CVCC syllables in spontaneous American English speech to find out whether such syllables are produced as phonological units with a string of segments, showing a hierarchical structure. Transcribed data from the Buckeye Speech Corpus was used for the analysis in this study. The result of the current study showed that the constituents within a CVCC syllable as a phonological unit may have phonetic variations (namely, the final coda may undergo deletion). First, voiceless alveolar stops were the most frequently deleted when they occurred as the second final coda consonants of a CVCC syllable; this deletion may be an intermediate process on the way from the abstract form CVCC (with the rime VCC) to the actual pronunciation CVC (with the rime VC), a production strategy employed by some individual speakers. Second, in the internal structure of the rime, the proportion of deletion of the final coda consonant depended on the frequency of the word rather than on the position of postvocalic consonants on the sonority hierarchy. Finally, the segment following the consonant cluster proved to have an effect on the reduction of that cluster; more precisely, the following contrast was observed between obstruents and non-obstruents, reflecting the effect of sonority: when the segment following the consonant cluster was an obstruent, the proportion of deletion of the final coda consonant was increased. Among these results, the effect of word frequency played a critical role for promoting the deletion of the second coda consonant for clusters in CVCC syllables in spontaneous speech. The current study implies that the structure of syllables as phonological units can vary depending on individual speakers' lexical representation.

A study on the determining of vertical dimension of occlusion of edentulous patients using korean phonetic patterns (한국어 음성모형을 이용한 총의치 환자의 교합고경 결정에 관한 연구)

  • Song, Kwang-Seob;Song, Kwang-Yeob;Cho, Kook-Hyeon
    • Journal of Dental Rehabilitation and Applied Science
    • /
    • v.16 no.3
    • /
    • pp.187-196
    • /
    • 2000
  • This study was performed to offer convenience to determine the vertical dimension of occlusion of edentulous patients by investigating the interocclusal distances at physiologic rest position, at speaking of /m/ sound, and some korean short sounds, that is, /mem/ and /beb/ sounds, which were found in our previous study with dentulous subjects. Ten edentulous subjects - 6 men and 4 women - were selected for this study. The frequencies at speaking of /m/, /mem/, and /beb/ sounds were analyzed with Computerized speech lab($CSL^{TM}$, Model 4300B, Software version 5.X, Kay Elemetrics Co. U.S.A.). And the interocclusal distances at physiologic rest position and at speaking of /m/, /mem/, and /beb/ sounds were measured with K6 diagnostic system(Myo-tronics, Inc. U.S.A.). The results of this study were as follows ; 1. In the acoustic analysis by Computerized speech lab, frequencies of sounds of edentulous subjects with complete denture at speaking of /m/, /mem/, and /beb/ were similar to those of dentulous subjects. 2. In the linear correlation by Pearson's correlation coefficient, the interocclusal distance at physiologic rest position was most similar to those of speaking /mem/ sound, secondly /m/ sound, and thirdly /beb/ sound(p<0.05). In reliability by Cronbach's alpha, the results were reliable with alpha value 0.97. 3. It was found by Levene's test for equality of variance that the difference between men and women in the interocclusal distances at physiologic rest position and at speaking of /m/, /mem/, and /beb/ sounds was not statistically significant(p>0.05).

  • PDF

Real Time Lip Reading System Implementation in Embedded Environment (임베디드 환경에서의 실시간 립리딩 시스템 구현)

  • Kim, Young-Un;Kang, Sun-Kyung;Jung, Sung-Tae
    • The KIPS Transactions:PartB
    • /
    • v.17B no.3
    • /
    • pp.227-232
    • /
    • 2010
  • This paper proposes the real time lip reading method in the embedded environment. The embedded environment has the limited sources to use compared to existing PC environment, so it is hard to drive the lip reading system with existing PC environment in the embedded environment in real time. To solve the problem, this paper suggests detection methods of lip region, feature extraction of lips, and awareness methods of phonetic words suitable to the embedded environment. First, it detects the face region by using face color information to find out the accurate lip region and then detects the exact lip region by finding the position of both eyes from the detected face region and using the geometric relations. To detect strong features of lighting variables by the changing surroundings, histogram matching, lip folding, and RASTA filter were applied, and the properties extracted by using the principal component analysis(PCA) were used for recognition. The result of the test has shown the processing speed between 1.15 and 2.35 sec. according to vocalizations in the embedded environment of CPU 806Mhz, RAM 128MB specifications and obtained 77% of recognition as 139 among 180 words were recognized.

Acoustic-phonetic characteristics of fricatives distortion in functional articulation disorders (기능적 조음음운장애아동의 치조 마찰음 왜곡의 음향음성학적 특성)

  • Yang, Minkyo;Choi, Yaelin;Kim, Eun Yeon;Yoo, Hyun Ji
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.127-134
    • /
    • 2018
  • This study aims to explain the difficulties children with articulation and phonological disorders have in producing alveolar fricative sounds. The study will perform a comparative analysis revealing how ordinary children produce alveolar fricative sounds through five different acoustic variables, and consequently identifying objective differences, compared to children with articulation and phonological disorders. Therefore, this study compared and analyzed the differences between 10 children with articulation and phonological disorders and 10 ordinary children according to a phonation type of alveolar fricative sounds (/s/ and /$s^*$), a type of vowel (/i/, /ε/, /u/, /o/, /ɯ/, /ʌ/, /ɑ/), and a structure of syllables (CV, VCV) through acoustic variables including a central moment, skewness, kurtosis, a center of gravity and variance. That is, children with articulation and phonological disorders, when compared to ordinary children, have difficulties with concentrating an agile and momentary friction with strength when articulating alveolar fricative sounds, which uses strong energy and accompany tension. Furthermore, the values of alveolar fricative sounds of children with articulation and phonological disorders appeared to spread evenly over the average range, which means that the range of overall the standard deviation values for children with functional phonological disorders is wider than that of ordinary children. For a future study, if the mispronounced sounds relating to omission, substitution, and addition can be compared and analyzed for various target groups, it could be used effectively to help children with functional phonological disorders.

The Effects of Secondhand Smoking on Articulators Based on Phonetic Analysis (음성학적 분석 기반의 간접흡연이 조음기관에 미치는 영향)

  • Seo, Kyoung-Won;Kang, Deok-Hyun;Bae, Jung-Su;Jang, Yong-Jo;Yean, Yong-Hem;Lim, Soon-Yong;Min, Ji-Seon;Kim, Bong-Hyun;Ka, Min-Kyoung;Cho, Dong-Uk
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.648-651
    • /
    • 2010
  • 웰빙의 바람을 타고 이제 자신의 건강을 관리하는 사람들이 많아지고, 흡연에 대한 좋지 않은 인식이 높아지면서 금연의 열풍이 강하게 불고 있다. 하지만 금연을 한다고 해도 주위의 담배연기는 우리 몸의 건강을 해치기 때문에 담배연기로부터 해방되기는 매우 어렵다. 실제로 흡연하는 배우자를 가진 사람은 그렇지 않은 사람에 비해 심장병 발생률은 40%, 폐암 발생률은 30%가 더 높다. 따라서 본 논문에서는 간접흡연이 인체의 조음기관에 미치는 영향을 분석하기 위해 간접흡연에 따른 음성의 변화를 측정하고 비교, 분석하는 실험을 수행하였다. 이를 위해 간접흡연 전과 후의 음성을 수집하여 음성분석학적 요소 기술 중 Pitch, Jitter, Shimmer 등의 성대 진동 요소를 적용하고 인체 내의 공명기관을 분석하는 Formant를 적용하여 실험을 수행하여 간접흡연이 음성에 미치는 영향을 연구하였다.

Statistical Analysis of Korean Phonological Variations Using a Grapheme-to-phoneme System (발음열 자동 생성기를 이용한 한국어 음운 변화 현상의 통계적 분석)

  • 이경님;정민화
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.7
    • /
    • pp.656-664
    • /
    • 2002
  • We present a statistical analysis of Korean phonological variations using a Grapheme-to-Phoneme (GPT) system. The GTP system used for experiments generates pronunciation variants by applying rules modeling obligatory and optional phonemic changes and allophonic changes. These rules are derived form morphophonological analysis and government standard pronunciation rules. The GTP system is optimized for continuous speech recognition by generating phonetic transcriptions for training and constructing a pronunciation dictionary for recognition. In this paper, we describe Korean phonological variations by analyzing the statistics of phonemic change rule applications for the 60,000 sentences in the Samsung PBS Speech DB. Our results show that the most frequently happening obligatory phonemic variations are in the order of liaison, tensification, aspirationalization, and nasalization of obstruent, and that the most frequently happening optional phonemic variations are in the order of initial consonant h-deletion, insertion of final consonant with the same place of articulation as the next consonants, and deletion of final consonant with the same place of articulation as the next consonant's, These statistics can be used for improving the performance of speech recognition systems.