• Title/Summary/Keyword: phonetic system

Search Result 313, Processing Time 0.029 seconds

Speaker Identification using Phonetic GMM (음소별 GMM을 이용한 화자식별)

  • Kwon Sukbong;Kim Hoi-Rin
    • Proceedings of the KSPS conference
    • /
    • 2003.10a
    • /
    • pp.185-188
    • /
    • 2003
  • In this paper, we construct phonetic GMM for text-independent speaker identification system. The basic idea is to combine of the advantages of baseline GMM and HMM. GMM is more proper for text-independent speaker identification system. In text-dependent system, HMM do work better. Phonetic GMM represents more sophistgate text-dependent speaker model based on text-independent speaker model. In speaker identification system, phonetic GMM using HMM-based speaker-independent phoneme recognition results in better performance than baseline GMM. In addition to the method, N-best recognition algorithm used to decrease the computation complexity and to be applicable to new speakers.

  • PDF

Improving the Performance of the Continuous Speech Recognition by Estimating Likelihoods of the Phonetic Rules (음소변동규칙의 적합도 조정을 통한 연속음성인식 성능향상)

  • Na, Min-Soo;Chung, Min-Hwa
    • Proceedings of the KSPS conference
    • /
    • 2006.11a
    • /
    • pp.80-83
    • /
    • 2006
  • The purpose of this paper is to build a pronunciation lexicon with estimated likelihoods of the phonetic rules based on the phonetic realizations and therefore to improve the performance of CSR using the dictionary. In the baseline system, the phonetic rules and their application probabilities are defined with the knowledge of Korean phonology and experimental tuning. The advantage of this approach is to implement the phonetic rules easily and to get stable results on general domains. However, a possible drawback of this method is that it is hard to reflect characteristics of the phonetic realizations on a specific domain. In order to make the system reflect phonetic realizations, the likelihood of phonetic rules is reestimated based on the statistics of the realized phonemes using a forced-alignment method. In our experiment, we generates new lexica which include pronunciation variants created by reestimated phonetic rules and its performance is tested with 12 Gaussian mixture HMMs and back-off bigrams. The proposed method reduced the WER by 0.42%.

  • PDF

A Study on Consonant/Vowel/Unvoiced Consonant Phonetic Value Segmentation and Recognition of Korean Isolated Word Speech (한국어 고립 단어 음성의 자음/모음/유성자음 음가 분할 및 인식에 관한 연구)

  • Lee, Jun-Hwan;Lee, Sang-Beom
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.6
    • /
    • pp.1964-1972
    • /
    • 2000
  • For the Korean language, on acoustics, it creates a different form of phonetic value not a phoneme by its own peculiar property. Therefore, the construction of extended recognition system for understanding Korean language should be created with a study of the Korean rule-based system, before it can be used as post-processing of the Korean recognition system. In this paper, text-based Korean rule-based system featuring Korean peculiar vocal sound changing rule is constructed. and based on the text-based phonetic value result of the system constructed, a preliminary phonetic value segmentation border points with non-uniform blocks are extracted in Korean isolated word speech. Through the way of merge and recognition of the non-uniform blocks between the extracted border points, recognition possibility of Korean voice as the form of the phonetic vale has been investigated.

  • PDF

Phonetic Keyboard for International Korean Phonetic Alphabet (국제한글음성문자의 음성학적 자판배열)

  • LEE Hyun Bok;JO Unil
    • MALSORI
    • /
    • no.39
    • /
    • pp.43-51
    • /
    • 2000
  • The aim of this paper is to present a phonetically oriented keyboard array for the International Korean Phonetic Alphabet (IKPA). IKPA is a phonetic alphabet devised on the basis of Hangout (Korean alphabet) (Lee, 1999). Every computer has a keyboard as its input device and the English keyboard array is hewn as 'QWERTY' system, which represents the first six letters of the second line of the keyboard. This array is a traditional one devised to protect the congestion of the keys of the mechanical typewriter. To improve the anay of the keyboard, another system named 'Dvorak' has been devised. Likewise, a serious attempt has been made by the authors to work out an efficient keyboard for IKPA representing the manner of vowel and consonant classification. In the phonetic keyboard, the consonant symbols are arranged in the left hand side according to the Place and mauler of the articulation and the vowel symbols in the right hand side according to the vowel quadrilateral.

  • PDF

The Acoustic Analysis of the Diphthongs in Jeju Dialect (제주방언 이중모음의 음향분석)

  • Kim, Won-Bo
    • Speech Sciences
    • /
    • v.12 no.2
    • /
    • pp.29-41
    • /
    • 2005
  • This paper is to show the diphthong system of Jeju dialect speakers in their 70s or more on the basis of the acoustic analysis of their phonetic data. It is revealed through the analysis of their phonetic data that they clearly distinguish such diphthongs as [we], [w$\epsilon$], [yc] and [yo]. However, this paper shows that they are phonetically insensitive to the separation between [ye] and [y$\epsilon$] and they seldom make a precise pronunciation of diphthong [iy], which male speakers tend to pronounce to be [i] and female speakers to be [i].

  • PDF

Phonetic Transcription based Speech Recognition using Stochastic Matching Method (확률적 매칭 방법을 사용한 음소열 기반 음성 인식)

  • Kim, Weon-Goo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.5
    • /
    • pp.696-700
    • /
    • 2007
  • A new method that improves the performance of the phonetic transcription based speech recognition system is presented with the speaker-independent phonetic recognizer. Since SI phoneme HMM based speech recognition system uses only the phoneme transcription of the input sentence, the storage space could be reduced greatly. However, the performance of the system is worse than that of the speaker dependent system due to the phoneme recognition errors generated from using SI models. A new training method that iteratively estimates the phonetic transcription and transformation vectors is presented to reduce the mismatch between the training utterances and a set of SI models using speaker adaptation techniques. For speaker adaptation the stochastic matching methods are used to estimate the transformation vectors. The experiments performed over actual telephone line shows that a reduction of about 45% in the error rates could be achieved as compared to the conventional method.

Korean Native Speakers Auditory Cognitive Reactions to Chinese Korean-learners' Pronunciation: Centered on the utterance of consonants in the Korean Language (중국인 학습자의 한국어 발음에 대한 한국인 모어 화자의 청각 인지 반응 -중국인 학습자의 자음 발음을 중심으로-)

  • Kim, Ji-hyung
    • Journal of Korean language education
    • /
    • v.28 no.2
    • /
    • pp.37-60
    • /
    • 2017
  • This research has its basis with focus on the way Korean native speakers recognize Chinese Korean-learners' pronunciation. The objective of the study is to lay the cornerstone for establishing effective teaching-learning strategies for the education of the Korean phonetic system. In this study, the results of the experiment are presented which shows how native speakers of Korean identify Chinese Korean-learners' pronunciation of consonants. In the first place, stimulation tones were created from the original utterances of Chinese Korean-learners and seven scripts were made through the Pratt program. In addition, the subjects were asked to choose what the phonetic materials sounded like. The results of the research are represented as the ratio of frequency of Korean native speakers' response to each utterance to the total frequency. In addition, the paired t-test was taken in order to explore any relatedness to the changes in the level of proficiency of the Korean phonetic system, ranging from beginners to advanced learners. The outcome shows that the mistakes which Chinese Korean-learners make in pronouncing the consonants of Korean are relatively well-reflected in Korean native speakers' auditory cognitive reactions. To put it concretely, there is some difficulty in differentiating lax consonants from aspirates in the cases of plosives and affricates, but relatively little trouble with fortes. However, it is revealed that there is also a slight difference in relation to articulatory positions in detailed aspects. To provide an effective teaching method for the Korean phonetic system, it is essential to comprehend learners' phonetic mistakes through the precise analysis of data in terms of 'production.' Also, a more meticulous observation of 'phenomena' must be made through verification from the view of 'reception,' as attempted in this study. A more thorough diagnosis by applying methodology makes it possible to lay the foundation for developing effective teaching-learning strategies for the instruction of the Korean phonetic system. This study has its significance in making such attempts.

Study on the Recognition of Spoken Korean Continuous Digits Using Phone Network (음성망을 이용한 한국어 연속 숫자음 인식에 관한 연구)

  • Lee, G.S.;Lee, H.J.;Byun, Y.G.;Kim, S.H.
    • Proceedings of the KIEE Conference
    • /
    • 1988.07a
    • /
    • pp.624-627
    • /
    • 1988
  • This paper describes the implementation of recognition of speaker - dependent Korean spoken continuous digits. The recognition system can be divided into two parts, acoustic - phonetic processor and lexical decoder. Acoustic - phonetic processor calculates the feature vectors from input speech signal and the performs frame labelling and phone labelling. Frame labelling is performed by Bayesian classification method and phone labelling is performed using labelled frame and posteriori probability. The lexical decoder accepts segments (phones) from acoustic - phonetic processor and decodes its lexical structure through phone network which is constructed from phonetic representation of ten digits. The experiment carried out with two sets of 4continuous digits, each set is composed of 35 patterns. An evaluation of the system yielded a pattern accuracy of about 80 percent resulting from a word accuracy of about 95 percent.

  • PDF

The Study of Phonetic Research Methodology in Korean English Grammar ("선영문법(鮮英文法)"에 나타난 음성학 연구 방법에 대한 고찰)

  • Kim, Hyoung-Youb
    • Lingua Humanitatis
    • /
    • v.7
    • /
    • pp.291-309
    • /
    • 2005
  • It hasn't been long time since English language was introduced in Korea. At the end of the 18th century the importance of the way of using English properly started to be recognized as Chosun (former country in Korean peninsula) began to conclude a treaty with foreign countries. A lot of Koreans could learn the western culture by the acquired knowledge of English. One of the main factors opening the secluded nation to the world was the member of missionary from outside of Korea. As the number of missionaries increased those who already came to Korea found the necessity of wiring a sort of guidebook of Korean language for the newly dispatched missionaries. The book $\ulcorner$Korean English Grammar$\lrcorner$(written by Horace Grant Underwood in 1890), was the first one that linguistically compared the part of speech and the clausal structures of Korean and English. The revised one of the same book was written by the son, Horace Horton Underwood, in 1914. The revised one newly included the phonetic aspect of Korean language. In this paper the phonetic part of the book will be considered carefully in order to find how recent phonetic methodology has been applied to account for the Korean phonetic features.

  • PDF

A Study on Phonetic Value - Transcription Look-Up Table Generation for Postprocessing of Voice Recognition (음성인식 후처리를 위한 음가-표기 변환표 생성에 관한 연구)

  • 김경징;최영규;이상범
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.5
    • /
    • pp.585-594
    • /
    • 2002
  • This paper, describes about creation and implementation of phonetic value- transcription conversion table for postprocessing of the voice recognition. Transcription set generator, which produces transcription set that is pronounced as recognized phonetic value, is designed and implemented to postprocess for the voice recognition system which recognizes syllable unit phonetic value Phonetic value-transcription conversion table is produced with transcription-phonetic value conversion table produced by modeling standard pronunciation on petrinet. To show that phonetic value-transcription conversion table produces correct transcription set, transcription set generator is designed and implemented. This paper proves that correct transcription set is produced, which is including pre-vocalization transcription as a result of experimenting standard pronunciation examples and the words randomly sampled from pronunciation dictionary.

  • PDF