Search | Korea Science

A Continuous Digits Speech Recognition Applied Vowel Sequence and VCCV Unit HMM (모음열과 VCCV단위 HMM을 이용한 연속 숫자 음성인식)

Youn Jeh-Seon;Chung Kwang-Woo;Hong Kwang-Seok
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.25-28
- /
- 2001
본 논문에서는 조음 효과에 대처할 수 있는 반음절, 반음절 + 반음절 단위 HMM과 모음열 정보를 적용하여 연속 숫자 음성인식을 구현하였다. 모음열 정보를 적용하여 기준모델을 모음이 포함된 HMM단위로만 구성한 시스템과 모든 기준모델과 비교하는 시스템과 성능을 비교하였다. 인식실험결과 인식률의 향상으로 제안된 방법이 효율적임을 확인하였다.
PDF

Connected Korean Digit Speech Recognition Using Vowel String and Number of Syllables (음절수와 모음 열을 이용한 한국어 연결 숫자 음성인식)

Youn, Jeh-Seon;Hong, Kwang-Seok
- The KIPS Transactions:PartA
- /
- v.10A no.1
- /
- pp.1-6
- /
- 2003
In this paper, we present a new Korean connected digit recognition based on vowel string and number of syllables. There are two steps to reduce digit candidates. The first one is to determine the number and interval of digit. Once the number and interval of digit are determined, the second is to recognize the vowel string in the digit string. The digit candidates according to vowel string are recognized based on CV (consonant vowel), VCCV and VC unit HMM. The proposed method can cope effectively with the coarticulation effects and recognize the connected digit speech very well.
https://doi.org/10.3745/KIPSTA.2003.10A.1.001 인용 PDF KSCI

A construction of vowel string dictionary for unlimited word speech recognition (무제한 단어 음성인식을 위한 모음열 사전의 구축)

김동환;윤재선;홍광석
- Proceedings of the Korea Institute of Convergence Signal Processing
- /
- 2000.08a
- /
- pp.177-180
- /
- 2000
기존의 제한적 단어 인식과는 달리 무제한 단어 음성인식에 있어서는 방대한 용량의 단어 모델을 참조로 인식이 이루어지게 되어, 참조모델과 입력패턴과의 비교를 위한 탐색시간이 너무 길어지게 된다. 본 논문에서 제한하는 방법은 무제한 단어 음성인식 시스템을 구축하기 위해 선행되어야 하는 모음열 사전을 구축하는 것이다. 음성인식시 입력패턴과 참조모델에 속한 모든 단어와의 비교를 수행하지 않고, 입력패턴의 모음열을 인식한 후, 인식된 모음열 단어들만을 참조모델에서 인식 후보로 두어 인식을 수행하게 하여 시간적인 측면에서의 효율성을 기하는 것이다. 결과적으로 본 연구 방법은 무제한 단어 음성인식에서의 실시간 처리라는 점에 주 목적을 두었다.
PDF

Influence of standard Korean and Gyeongsang regional dialect on the pronunciation of English vowels (표준어와 경상 지역 방언의 한국어 모음 발음에 따른 영어 모음 발음의 영향에 대한 연구)

Jang, Soo-Yeon
- Phonetics and Speech Sciences
- /
- v.13 no.4
- /
- pp.1-7
- /
- 2021
This study aims to enhance English pronunciation education for Korean students by examining the impact of standard Korean and Gyeongsang regional dialect on the articulation of English vowels. Data were obtained through the Korean-Spoken English Corpus (K-SEC). Seven Korean words and ten English mono-syllabic words were uttered by adult, male speakers of standard Korean and Gyeongsang regional dialect, in particular, speakers with little to no experience living abroad were selected. Formant frequencies of the recorded corpus data were measured using spectrograms, provided by the speech analysis program, Praat. The recorded data were analyzed using the articulatory graph for formants. The results show that in comparison with speakers using standard Korean, those using the Gyeongsang regional dialect articulated both Korean and English vowels in the back. Moreover, the contrast between standard Korean and Gyeongsang regional dialect in the pronunciation of Korean vowels (/으/, /어/) affected how the corresponding English vowels (/ə/, /ʊ/) were articulated. Regardless of the use of regional dialect, a general feature of vowel pronunciation among Korean people is that they show more narrow articulatory movements, compared with that of native English speakers. Korean people generally experience difficulties with discriminating tense and lax vowels, whereas native English speakers have clear distinctions in vowel articulation.
https://doi.org/10.13064/KSSS.2021.13.4.001 인용 PDF KSCI

Recognition of Hangeul Character Using Grapheme Segmentation and Pixel Distribution (자소분할과 픽셀분포를 이용한 한글문자인식)

Cho, Young-Guk;Lee, Dong-Wook
- Proceedings of the KIEE Conference
- /
- 2009.07a
- /
- pp.1919_1920
- /
- 2009
한글 문자 인식에 관한 연구는 통계적 방법과 구조적 방법, 신경 회로망 등 다양한 방법론이 제시되어 왔다. 그러나 한글은 영문이나 숫자에 비해 방대한 문자수와 복잡한 구조로 인하여 인식에 많은 어려움을 가지고 있다. 따라서 본 논문에서는 한글을 가장 단순한 구조인 자음과 모음으로 분리한 뒤 각 개체의 픽셀 분포를 파악하고, 한글의 구조적 특징을 이용하여 자소의 행과 열에서의 peak값과 픽셀의 분포를 그룹으로 나누어 한글을 인식하는 방법을 제시한다.
PDF

An Utterance Verification using Vowel String (모음 열을 이용한 발화 검증)

유일수;노용완;홍광석
- Proceedings of the Korea Institute of Convergence Signal Processing
- /
- 2003.06a
- /
- pp.46-49
- /
- 2003
The use of confidence measures for word/utterance verification has become art essential component of any speech input application. Confidence measures have applications to a number of problems such as rejection of incorrect hypotheses, speaker adaptation, or adaptive modification of the hypothesis score during search in continuous speech recognition. In this paper, we present a new utterance verification method using vowel string. Using subword HMMs of VCCV unit, we create anti-models which include vowel string in hypothesis words. The experiment results show that the utterance verification rate of the proposed method is about 79.5%.
PDF

Robust Speech Recognition Using Missing Data Theory (손실 데이터 이론을 이용한 강인한 음성 인식)

김락용;조훈영;오영환
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.3
- /
- pp.56-62
- /
- 2001
In this paper, we adopt a missing data theory to speech recognition. It can be used in order to maintain high performance of speech recognizer when the missing data occurs. In general, hidden Markov model (HMM) is used as a stochastic classifier for speech recognition task. Acoustic events are represented by continuous probability density function in continuous density HMM(CDHMM). The missing data theory has an advantage that can be easily applicable to this CDHMM. A marginalization method is used for processing missing data because it has small complexity and is easy to apply to automatic speech recognition (ASR). Also, a spectral subtraction is used for detecting missing data. If the difference between the energy of speech and that of background noise is below given threshold value, we determine that missing has occurred. We propose a new method that examines the reliability of detected missing data using voicing probability. The voicing probability is used to find voiced frames. It is used to process the missing data in voiced region that has more redundant information than consonants. The experimental results showed that our method improves performance than baseline system that uses spectral subtraction method only. In 452 words isolated word recognition experiment, the proposed method using the voicing probability reduced the average word error rate by 12％ in a typical noise situation.
PDF

Hunminjungum Keypad (훈민정음 글자판)

Kim, Sungwook
- Journal of Internet Computing and Services
- /
- v.22 no.4
- /
- pp.29-49
- /
- 2021
This paper proposes the Hunminjungum Keypad that applied the creation principle of Hunminjungum to the design of keypad. The proposed keypad arranged 28 letters of Hunminjungum to have correlations with each other between consonants, between vowels, and between consonants and vowels. That is, Consonant buttons are arranged by grouping letters of the same sound by sounds of five voices. And the vowel buttons are arranged at the bottom and the right side of the consonant area according to the position where a vowel is attached to the consonant. In the meantime, Hangul keypads have mainly used 12 button keypads in 4 lines and 3 columns. These keypads have structurally disadvantageous in the touch count and moving distance. Recently, keypads with many letter buttons such as QWERTY and single-vowel are also used a lot. If the number of letter buttons provided in the keypad increases, touch count decreases. And If the letter buttons are arranged to have a correlation with each other, the moving distance becomes smaller. The experimental results show that the proposed keypad has high efficiency in all evaluation factors such as touch count, moving distance and input time.
https://doi.org/10.7472/jksii.2021.22.4.29 인용 PDF KSCI HTML

Continuous Digits Speech Recognition using Semisyllable Unit HMM (반음절 단위 HMM을 이용한 연속 숫자 음성인식)

윤재선;홍광석
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.5
- /
- pp.73-78
- /
- 1998
본 논문에서는 조음 효과에 대처할 수 있는 새로운 음성인식 단위로 반음절, 반음절 +반음절 단위 HMM을 제안하여 연속 숫자 음성인식을 하였다. 반음절 단위는 무음과 안정 구간으로, 반음절+반음절 단위는 안정, 천이, 안정구간으로 구성되어 있고, 음성인식 단위 분 할시 비교적 스펙트럼의 변화가 안정한 모음구간에서 분할하므로 분할 위치가 약간 변하여 도 인식성능에는 큰 영향을 주지 않게 된다. 또한, 제안된 반음절, 반음절+반음절 인식단위 는 그 패턴 안에 다음 숫자열의 정보를 포함하고 있기 때문에 모든 HMM 패턴들과 비교하 는 것이 아니라, 다음 숫자열의 정보를 포함한 HMM 패턴들과 비교한다. 인식실험결과 제 안된 방법이 효율적임을 확인하였다.
PDF

A Study on Digit Modeling for Korean Connected Digit Recognition (한국어 연결숫자인식을 위한 숫자 모델링에 관한 연구)

김기성
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.293-297
- /
- 1998
전화망에서의 연결 숫자 인식 시스템의 개발에 대한 내용을 다루며, 이 시스템에서 다양한 숫자 모델링 방법들을 구현하고 비겨하였다. Word 모델의 경우 문맥독립 whole-word 모델을 구현하였으며, sub-word 모델로는 triphone 모델과 불파음화 자음을 모음에 포함시킨 modified triphone 모델을 구현하였다. 그리고 tree-based clustering 방법을 sub-word 모델과 문맥종속 whole-word 모델에 적용하였다. 이와 같은 숫자모델들에 대해 연속 HMM을 이용하여 화자독립 연결숫자 인식 실험을 수행한 결과, 문맥종속 단어 모델이 문맥독립 단어 모델보다 우수한 성능을 나타냈으며, triphone 모델과 modified triphone 모델은 유사한 성능을 나타냈다. 특히 tree-based clustering 방법을 적용한 문맥종속 단어 모델이 4연 숫자열에 대해 99.8%의 단어 dsltlr률 및 99.1%의 숫자열 인식률로서 가장 우수한 성능을 나타내었다.
PDF

Search Result 25, Processing Time 0.037 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)