• Title/Summary/Keyword: Phonemes

Search Result 227, Processing Time 0.023 seconds

Improvement of Confidence Measure Performance in Keyword Spotting using Background Model Set Algorithm (BMS 알고리즘을 이용한 핵심어 검출기 거절기능 성능 향상 실험)

  • Kim Byoung-Don;Kim Jin-Young;Choi Seung-Ho
    • MALSORI
    • /
    • no.46
    • /
    • pp.103-115
    • /
    • 2003
  • In this paper, we proposed Background Model Set algorithm used in the speaker verification to improve calculating confidence measure(CM) in speech recognition. CM is to display relative likelihood between recognized models and antiphone models. In previous method calculating of CM, we calculated probability and standard deviation using all phonemes in composition of antiphone models. At this process, antiphone CM brought bad recognition result. Also, recognition time increases. In order to solve this problem, we studied about method to reconstitute average and standard deviation using BMS algorithm in CM calculation.

  • PDF

A Study on the Consonant Classification Using Fuzzy Inference (퍼지추론을 이용한 한국어 자음분류에 관한 연구)

  • 박경식
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1992.06a
    • /
    • pp.71-75
    • /
    • 1992
  • This paper proposes algorithm in order to classify Korean consonant phonemes same as polosives, fricatives affricates into la sounds, glottalized sounds, aspirated sounds. This three kinds of sounds are one of distinctive characters of the Korean language which don't eist in language same as English. This is thesis on classfication of 14 Korean consonants(k, t, p, s, c, k', t', p', s', c', kh, ph, ch) as a previous stage for Korean phone recognition. As feature sets for classification, LPC cepstral analysis. The eperiments are two stages. First, using short-time speech signal analysis and Mahalanobis distance, consonant segments are detected from original speech signal, then the consonants are classified by fuzzy inference. As the results of computer simulations, the classification rate of the speech data was come to 93.75%.

  • PDF

The Korean Word Length Effect on AudWord Recognition (청각단어 재인에서 나타난 한국어 단어 길이 효과)

  • Choi Wonil;Nam Kichun
    • MALSORI
    • /
    • no.44
    • /
    • pp.33-46
    • /
    • 2002
  • This study was conducted to examine the effect of word length on auditory word recognition. Word length can be defined by several sublexical units, such as letters, phonemes, syllables, etc. To find out which sublexical units are influential in auditory word recognition, the auditory lexical decision task was used. In Experiment 1, we examined the partial correlation between the speed of reaction time and the number of sublexical units, and in Experiment 2, we executed ANOVA to find out which sublexical length variable was an influential unit. Through these two experiment, we concluded syllable length was the most important variable on auditory word recognition.

  • PDF

The Statistical Study on the Patients with Functional Articulation Disorders - Centering on the Background Information and Phonological Processes of Errors - (단순 조음장애 환자군에 대한 통계적 연구 -배경정보와 조음 오류 양상을 중심으로-)

  • Pyo Hwa Young
    • MALSORI
    • /
    • no.39
    • /
    • pp.53-71
    • /
    • 2000
  • With the 130 patients who were diagnosed as functional articulation disorders with no physical problems, a statistical study was performed to investigate their background information and phonological processes of errors. The results are as follows: (1) Males showed higher prevalence than females, and 5-year-old-patients showed the highest in age. (2) Most patients showed errors of 2~5 phonemes (3) The most frequent errors were found in plosives and alveolar sounds, and the most frequent phonological process of errors in the aspects of manner and place of articulation were stop-assimilations and alveolar assimilations, respectively.

  • PDF

Modeling Cross-morpheme Pronunciation Variations for Korean Large Vocabulary Continuous Speech Recognition (한국어 연속음성인식 시스템 구현을 위한 형태소 단위의 발음 변화 모델링)

  • Chung Minhwa;Lee Kyong-Nim
    • MALSORI
    • /
    • no.49
    • /
    • pp.107-121
    • /
    • 2004
  • In this paper, we describe a cross-morpheme pronunciation variation model which is especially useful for constructing morpheme-based pronunciation lexicon to improve the performance of a Korean LVCSR. There are a lot of pronunciation variations occurring at morpheme boundaries in continuous speech. Since phonemic context together with morphological category and morpheme boundary information affect Korean pronunciation variations, we have distinguished phonological rules that can be applied to phonemes in within-morpheme and cross-morpheme. The results of 33K-morpheme Korean CSR experiments show that an absolute reduction of 1.45% in WER from the baseline performance of 18.42% WER was achieved by modeling proposed pronunciation variations with a possible multiple context-dependent pronunciation lexicon.

  • PDF

On Detecting the Transition Regions of Phonemes by Using the Asymmetrical Rate of Speech Waveforms (음성파형의 비대칭율을 이용한 음소의 전이구간 검출)

  • Bae, Myung-Jin;Lee, Eul-jae;Ann, Sou-Guil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.9 no.4
    • /
    • pp.55-65
    • /
    • 1990
  • To recognize continued speech, it is necessary to segment the connected acoustic signal into phonetic units, In this paper, as a parameter to detect transition regions in continued speech, we propose a new asymmetrical rate. The suggested rate represents a change rate of magnitude of speech signals. As comparing this rate with other rate in adjacent frame, the state of the frame can be distinguished between steady state and transient state.

  • PDF

The Voice Dialing System Using Dynamic Hidden Markov Models and Lexical Analysis (DHMM과 어휘해석을 이용한 Voice dialing 시스템)

  • 최성호;이강성;김순협
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.28B no.7
    • /
    • pp.548-556
    • /
    • 1991
  • In this paper, Korean spoken continuous digits are ercognized using DHMM(Dynamic Hidden Markov Model) and lexical analysis to provide the base of developing voice dialing system. After segmentation by phoneme unit, it is recognized. This system can be divided into the segmentation section, the design of standard speech section, the recognition section, and the lexical analysis section. In the segmentation section, it is segmented using the ZCR, O order LPC cepstrum, and Ai, parameter of voice speech dectaction, which is changed according to time. In the standard speech design section, 19 phonemes or syllables are trained by DHMM and designed as a standard speech. In the recognition section, phomeme stream are recognized by the Viterbi algorithm.In the lexical decoder section, finally recognized continuous digits are outputed. This experiment shiwed the recognition rate of 85.1% using data spoken 7 times of 21 classes of 7 continuous digits which are combinated all of the occurence, spoken by 10 man.

  • PDF

Ortho-phonic Alphabet Creation by the Musical Theory and its Segmental Algorithm (악리론으로 본 정음창제와 정음소 분절 알고리즘)

  • Chin, Yong-Ohk;Ahn, Cheong-Keung
    • Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.49-59
    • /
    • 2001
  • The phoneme segmentation is a very difficult problem in speech sound processing because it has found out segmental algorithm in many kinds of allophone and coarticulation's trees. Thus system configuration for the speech recognition and voice retrieval processing has a complex system structure. To solve it, we discuss a possibility of new segmental algorithm, which is called the minus a thirds one or plus in tripartitioning(삼분손익) of twelve temporament(12 율려), first proposed by Prof. T. S. Han. It is close to oriental and western musical theory. He also has suggested a 3 consonant and 3 vowel phonemes in Hunminjungum(훈민정음) invented by the King Sejong in the 15th century. In this paper, we suggest to newly name it as ortho-phonic phoneme(OPP/정음소), which carries the meaning of 'the absoluteness and independency'. OPP also is acceptable to any other languages, for example IPA. Lastly we know that this algorithm is constantly applicable to the global language and is very useful to construct a voice recognition and retrieval structuring engineering.

  • PDF

An Acoustic and Aerodynamic Study of Korean Fricatives and Affricates (한국어 마찰음과 파찰음의 음향학적 및 공기역학적 특성에 관한 연구)

  • Pyo, H.Y.;Lee, J.H.;Choi, S.H.;Sim, H.S.;Choi, H.S.
    • Speech Sciences
    • /
    • v.6
    • /
    • pp.145-161
    • /
    • 1999
  • 21 normal Korean native speakers participated as subjects to investigate the acoustic and aerodynamic study of Korean fricatives and affricates and to make good use of the results for the patients with articulation problems. Their productions of [sa], [s'a], [ca], [$c^{h}a$], [c'a], [asa], [as'a], [aca], [$ac^{h}a$], and [ac'a] were analyzed with CSL and AP II instruments. The results are as followings: (1) Fricatives showed higher frequency in minimum and maximum frequency and longer duration than affricates. (2) Fricatives showed higher peak flow rate and longer rise time than affricates. (3) When we compared the different phonemes with each other, their differences were usually statistically significant, but when we compared CV and VCV syllables, they did not show significant difference, even VCV's showed higher and longer values than CV syllables. (4) Normaly, lax fricatives and affricates showed lower frequency and higher peak flow rate, shorter frication duration, and longer rise time.

  • PDF

The Statistical Study on the Patients with Functional Articulation Disorders - Centering on the Background Information and Phonological Processes of Errors - (단순 조음장애 환자군에 대한 통계적 연구 - 배경정보와 조음 오류 양상을 중심으로 -)

  • Pyo Hwa Young
    • Proceedings of the KSPS conference
    • /
    • 2000.03a
    • /
    • pp.141-155
    • /
    • 2000
  • With the 130 patients who were diagnosed as functional articulation disorders with no physical problems, statistical study was performed to investigate their background informations and phonological processes of errors. The results are as followings : (1) Males showed higher prevalence than females, and 5-year-patients showed the highest in age. (2) Most patients showed errol.s of 2 - 5 phonemes (3) The most frequent errors were found in plosives and alveoalrs, and the most frequent phonological process of errors in the aspects of manner and place of articulation were stop-assimilations and alveolar assimilations, respectively.

  • PDF