Search | Korea Science

A Tow-stage Recognition Approach Based on Error Pattern Hypotheses for Connected Digit Recognition

Oh, Wook-Kwon;Un, Chong-Kwan
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.3E
- /
- pp.31-36
- /
- 1996
In this paper, a two-stage recognition approach based on error pattern hypotheses is proposed to reduce errors of a connected digit recognizer. In the approach, a conventional recognizer is first used to produce N-best candidate strings, and then error patterns are hypothesized by examining the candidate strings. For substitution error pattern hypotheses, error-pattern-dependent classifiers having more discriminative power than the first-stage classifier are used ; and for insertion and deletion errors, word duration and energy contour information are exploited are exploited to discriminated confusing pairs. Simulation results showed that the proposed approach achieves 15% decrease in word error rate for speaker-independent Korean connected digit recognition when a hidden Markov model-based recognizer is used for the first-stage classifier.
PDF

Phonetic Contrasts of One-syllable Words and Speech Intelligibility in Adults with Hearing Impairments (청각장애 성인의 일음절 낱말대조 명료도 특성)

Kim Soo-Jin;Do Yeon-Ji
- MALSORI
- /
- no.56
- /
- pp.1-13
- /
- 2005
This study examined the speech intelligibility of one-syllable words with phonetic contrasts and analyzed segmental factors that can predict the overall speech intelligibility in hearing-impaired adults. To identify the speech error characteristics, a Korean word list was audio-recorded by 7 hearing-impaired adults, and 35 listeners selected the heard word out of 5 choices. Based in part on previous studies of speech of the hearing impaired, the word list consisted of monosyllabic consonant-vowel-consonant (CVC) real word pairs. Stimulus words included 77 phonetic contrast pairs. The results showed that the percentage of errors in final position (coda) contrast was higher than in any other position in syllable. And the intelligibility deficit factors of phonetic contrast in the hearing-impaired were analyzed through stepwise regression analysis. The overall intelligibility was predicted by the error rate of manner contrast at coda, voicing contrast (homorganic triplets) at onset and high-low contrast at nucleus.
PDF

The Analysis of Relationship between Error Types of Word Problems and Problem Solving Process in Algebra (대수 문장제의 오류 유형과 문제 해결의 관련성 분석)

Kim, Jin-Ho;Kim, Kyung-Mi;Kwean, Hyuk-Jin
- Communications of Mathematical Education
- /
- v.23 no.3
- /
- pp.599-624
- /
- 2009
The purpose of this study was to investigate the relationship between error types and Polya's problem solving process. For doing this, we selected 106 sophomore students in a middle school and gave them algebra word problem test. With this test, we analyzed the students' error types in solving algebra word problems. First, We analyzed students' errors in solving algebra word problems into the following six error types. The result showed that the rate of student's errors in each type is as follows: "misinterpreted language"(39.7%), "distorted theorem or solution"(38.2%), "technical error"(11.8%), "unverified solution"(7.4%), "misused data"(2.9%) and "logically invalid inference"(0%). Therefore, we found that the most of student's errors occur in "misinterpreted language" and "distorted theorem or solution" types. According to the analysis of the relationship between students' error types and Polya's problem-solving process, we found that students who made errors of "misinterpreted language" and "distorted theorem or solution" types had some problems in the stage of "understanding", "planning" and "looking back". Also those who made errors of "unverified solution" type showed some problems in "planing" and "looking back" steps. Finally, errors of "misused data" and "technical error" types were related in "carrying out" and "looking back" steps, respectively.
PDF

A Joint Statistical Model for Word Spacing and Spelling Error Correction Simultaneously (띄어쓰기 및 철자 오류 동시교정을 위한 통계적 모델)

Noh, Hyung-Jong;Cha, Jeong-Won;Lee, GaryGeun-Bae
- Journal of KIISE:Software and Applications
- /
- v.34 no.2
- /
- pp.131-139
- /
- 2007
In this paper, we present a preprocessor which corrects word spacing errors and spelling correction errors simultaneously. The proposed expands noisy-channel model so that it corrects both errors in colloquial style sentences effectively, while preprocessing algorithms have limitations because they correct each error separately. Using Eojeol transition pattern dictionary and statistical data such as n-gram and Jaso transition probabilities, it minimizes the usage of dictionaries and produces the corrected candidates effectively. In experiments we did not get satisfactory results at current stage, we noticed that the proposed methodology has the utility by analyzing the errors. So we expect that the preprocessor will function as an effective error corrector for general colloquial style sentence by doing more improvements.
PDF KSCI

Performance Comparison of Out-Of-Vocabulary Word Rejection Algorithms in Variable Vocabulary Word Recognition (가변어휘 단어 인식에서의 미등록어 거절 알고리즘 성능 비교)

김기태;문광식;김회린;이영직;정재호
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.2
- /
- pp.27-34
- /
- 2001
Utterance verification is used in variable vocabulary word recognition to reject the word that does not belong to in-vocabulary word or does not belong to correctly recognized word. Utterance verification is an important technology to design a user-friendly speech recognition system. We propose a new utterance verification algorithm for no-training utterance verification system based on the minimum verification error. First, using PBW (Phonetically Balanced Words) DB (445 words), we create no-training anti-phoneme models which include many PLUs(Phoneme Like Units), so anti-phoneme models have the minimum verification error. Then, for OOV (Out-Of-Vocabulary) rejection, the phoneme-based confidence measure which uses the likelihood between phoneme model (null hypothesis) and anti-phoneme model (alternative hypothesis) is normalized by null hypothesis, so the phoneme-based confidence measure tends to be more robust to OOV rejection. And, the word-based confidence measure which uses the phoneme-based confidence measure has been shown to provide improved detection of near-misses in speech recognition as well as better discrimination between in-vocabularys and OOVs. Using our proposed anti-model and confidence measure, we achieve significant performance improvement; CA (Correctly Accept for In-Vocabulary) is about 89％, and CR (Correctly Reject for OOV) is about 90％, improving about 15-21％ in ERR (Error Reduction Rate).
PDF

A Method of Intonation Modeling for Corpus-Based Korean Speech Synthesizer (코퍼스 기반 한국어 합성기의 억양 구현 방안)

Kim, Jin-Young;Park, Sang-Eon;Eom, Ki-Wan;Choi, Seung-Ho
- Speech Sciences
- /
- v.7 no.2
- /
- pp.193-208
- /
- 2000
This paper describes a multi-step method of intonation modeling for corpus-based Korean speech synthesizer. We selected 1833 sentences considering various syntactic structures and built a corresponding speech corpus uttered by a female announcer. We detected the pitch using laryngograph signals and manually marked the prosodic boundaries on recorded speech, and carried out the tagging of part-of-speech and syntactic analysis on the text. The detected pitch was separated into 3 frequency bands of low, mid, high frequency components which correspond to the baseline, the word tone, and the syllable tone. We predicted them using the CART method and the Viterbi search algorithm with a word-tone-dictionary. In the collected spoken sentences, 1500 sentences were trained and 333 sentences were tested. In the layer of word tone modeling, we compared two methods. One is to predict the word tone corresponding to the mid-frequency components directly and the other is to predict it by multiplying the ratio of the word tone to the baseline by the baseline. The former method resulted in a mean error of 12.37 Hz and the latter in one of 12.41 Hz, similar to each other. In the layer of syllable tone modeling, it resulted in a mean error rate less than 8.3% comparing with the mean pitch, 193.56 Hz of the announcer, so its performance was relatively good.
PDF

Word Problem with Figures Solving Ability and Error of Boys and Girls - with middle school 3rd grade students - (남녀학생들의 도형 문장제 해결 오류 및 해결력에 대한 비교 분석 - 중학교 3학년 대상으로 -)

Oh, Jeong-Yoon;Ro, Young-Soon
- Journal of the Korean School Mathematics Society
- /
- v.10 no.3
- /
- pp.353-367
- /
- 2007
The purpose of this study was to examine what errors students made in solving word problems with figures and to compare the problem-solving abilities of boys and girls for each type of word problems with figures. It's basically meant to provide information on effective teaching-learning methods about world problems with figures that were given the greatest weight among different sorts of word problems. The findings of the study were as fellows: First, there was no difference between the boys and girls in the types of error they made. Both groups made the most errors due to a poor understanding of sentences, and they made the least errors of making the wrong expression. And the students who gave no answers outnumbered those who made errors. Second, as for problem-solving ability, the boys outperformed the girls in problem solving except variable problems. There was the greatest gap between the two in solving combining problems. Third, they made the average or higher achievement in solving the types of problems that were included much in the textbooks, and made the least achievement in relation to the types of problems that were handled least often in the textbooks.
PDF

The error character Revision System of the Korean using Semantic relationship of sentence component (문장 성분의 의미 관계를 이용한 한국어 오류 문자 교정 시스템)

Park, Hyun-Jae;Park, Hae-Sun;Kang, One-Il;Sohn, Young-Sun
- Journal of the Korean Institute of Intelligent Systems
- /
- v.14 no.1
- /
- pp.28-32
- /
- 2004
Till now, Korean spelling proofreading system has corrected words of a sentence from the relationship of a collocation or the grammatical information of the sentence. In this paper, we propose a system that corrects a word using the relationship among the sememes in a single sentence and substitutes an apt word for a word of the sentence that has the meaningful mistake by a mistyping. The proposed system makes several sentences that are able to communicate with each sememe. The substantives forms meaning tree according to the meaning of the word and the predicate of a sentence defines the meaningful relationship between a substantives of the subject and the object. After this system compares and analyzes the relationship of meaning, it corrects the mistyping of a word in a single sentence that includes an error. If the system finds out the semantic error by the mistyping, it applies the spelling proofreading method that proposed in this paper.
https://doi.org/10.5391/JKIIS.2004.14.1.028 인용 PDF KSCI

Speech Parameters for the Robust Emotional Speech Recognition (감정에 강인한 음성 인식을 위한 음성 파라메터)

Kim, Weon-Goo
- Journal of Institute of Control, Robotics and Systems
- /
- v.16 no.12
- /
- pp.1137-1142
- /
- 2010
This paper studied the speech parameters less affected by the human emotion for the development of the robust speech recognition system. For this purpose, the effect of emotion on the speech recognition system and robust speech parameters of speech recognition system were studied using speech database containing various emotions. In this study, mel-cepstral coefficient, delta-cepstral coefficient, RASTA mel-cepstral coefficient and frequency warped mel-cepstral coefficient were used as feature parameters. And CMS (Cepstral Mean Subtraction) method were used as a signal bias removal technique. Experimental results showed that the HMM based speaker independent word recognizer using vocal tract length normalized mel-cepstral coefficient, its derivatives and CMS as a signal bias removal showed the best performance of 0.78% word error rate. This corresponds to about a 50% word error reduction as compare to the performance of baseline system using mel-cepstral coefficient, its derivatives and CMS.
https://doi.org/10.5302/J.ICROS.2010.16.12.1137 인용 PDF KSCI

Speech Recognition in the Car Noise Environment (자동차 소음 환경에서 음성 인식)

김완구;차일환;윤대희
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.30B no.2
- /
- pp.51-58
- /
- 1993
This paper describes the development of a speaker-dependent isolated word recognizer as applied to voice dialing in a car noise environment. for this purpose, several methods to improve performance under such condition are evaluated using database collected in a small car moving at 100km/h The main features of the recognizer are as follow: The endpoint detection error can be reduced by using the magnitude of the signal which is inverse filtered by the AR model of the background noise, and it can be compensated by using variants of the DTW algorithm. To remove the noise, an autocorrelation subtraction method is used with the constraint that residual energy obtainable by linear predictive analysis should be positive. By using the noise rubust distance measure, distortion of the feature vector is minimized. The speech recognizer is implemented using the Motorola DSP56001(24-bit general purpose digital signal processor). The recognition database is composed of 50 Korean names spoken by 3 male speakers. The recognition error rate of the system is reduced to 4.3% using a single reference pattern for each word and 1.5% using 2 reference patterns for each word.
PDF

Search Result 339, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)