• Title/Summary/Keyword: Vocabulary Recognition

Search Result 221, Processing Time 0.024 seconds

A Study on the Recognition of Exterior Image of Hanok Building - Using I.R.I Adjective Image Scale - (한옥건축물의 외관 이미지 인식에 관한 연구 - I.R.I 형용사 이미지 스케일을 활용하여 -)

  • Jang, sung-un;Park, Dae-hyun
    • Journal of the Korean Institute of Rural Architecture
    • /
    • v.25 no.4
    • /
    • pp.1-8
    • /
    • 2023
  • This study is meaningful in figuring out how much the Korean people's awareness of hanok has increased even though interest in hanok has also increased due to the Korean Wave craze. Therefore, with respect to the exterior of hanok, which is visually recognized first, the level of experts and ordinary people is grasped through a semantic discrimination scale, and the degree of visual recognition is to be investigated centering on the color image of hanok buildings. This is the process of thinking about how the Korean image should be reflected in the design, and we want to suggest the direction that modern hanok should go. The study compared and analyzed the difference in visual color based on the elevation of the hanok using a 7-point and 5-point scale method for the general public and experts, and utilized the IRI adjective vocabulary scale and the color matching image scale to construct new hanoks with insufficient differences in appearance and shape. It can be applied to design and image preservation and construction of existing hanok.

An Implementation of Rejection Capabilities in the Isolated Word Recognition System (고립단어 인식 시스템에서의 거절기능 구현)

  • Kim, Dong-Hwa;Kim, Hyung-Soon;Kim, Young-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.6
    • /
    • pp.106-109
    • /
    • 1997
  • For the practical isolated word recognition system, the ability to reject the out-of -vocabulary(OOV) is required. In this paper, we present a rejection method which uses the clustered phoneme modeling combined with postprocessing by likelihood ratio scoring. Our baseline speech recognition system was based on the whole-word continuous HMM. And 6 clustered phoneme models were generated using statistical method from the 45 context independent phoneme models, which were trained using the phonetically balanced speech database. The test of the rejection performance for speaker independent isolated words recogntion task on the 22 section names shows that our method is superior to the conventional postprocessing method, performing the rejection according to the likelihood difference between the first and second candidates. Furthermore, this clustered phoneme models do not require retraining for the other isolated word recognition system with different vocabulary sets.

  • PDF

Korean Speech Recognition Based on Syllable (음절을 기반으로한 한국어 음성인식)

  • Lee, Young-Ho;Jeong, Hong
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.1
    • /
    • pp.11-22
    • /
    • 1994
  • For the conventional systme based on word, it is very difficult to enlarge the number of vocabulary. To cope with this problem, we must use more fundamental units of speech. For example, syllables and phonemes are such units, Korean speech consists of initial consonants, middle vowels and final consonants and has characteristic that we can obtain syllables from speech easily. In this paper, we show a speech recognition system with the advantage of the syllable characteristics peculiar to the Korean speech. The algorithm of recognition system is the Time Delay Neural Network. To recognize many recognition units, system consists of initial consonants, middle vowels, and final consonants recognition neural network. At first, our system recognizes initial consonants, middle vowels and final consonants. Then using this results, system recognizes isolated words. Through experiments, we got 85.12% recognition rate for 2735 data of initial consonants, 86.95% recognition rate for 3110 data of middle vowels, and 90.58% recognition rate for 1615 data of final consonants. And we got 71.2% recognition rate for 250 data of isolated words.

  • PDF

Design of Linguistic Contents of Speech Copora for Speech Recognition and Synthesis for Common Use (공동 이용을 위한 음성 인식 및 합성용 음성코퍼스의 발성 목록 설계)

  • Kim Yoen-Whoa;Kim Hyoung-Ju;Kim Bong-Wan;Lee Yong-Ju
    • MALSORI
    • /
    • no.43
    • /
    • pp.89-99
    • /
    • 2002
  • Recently, researches into ways of improving large vocabulary continuous speech recognition and speech synthesis are being carried out intensively as the field of speech information technology is progressing rapidly. In the field of speech recognition, developments of stochastic methods such as HMM require large amount of speech data for training, and also in the field of speech synthesis, recent practices show that synthesis of better quality can be produced by selecting and connecting only the variable size of speech data from the large amount of speech data. In this paper we design and discuss linguistic contents for speech copora for speech recognition and synthesis to be shared in common.

  • PDF

Malay Syllables Speech Recognition Using Hybrid Neural Network

  • Ahmad, Abdul Manan;Eng, Goh Kia
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.287-289
    • /
    • 2005
  • This paper presents a hybrid neural network system which used a Self-Organizing Map and Multilayer Perceptron for the problem of Malay syllables speech recognition. The novel idea in this system is the usage of a two-dimension Self-organizing feature map as a sequential mapping function which transform the phonetic similarities or acoustic vector sequences of the speech frame into trajectories in a square matrix where elements take on binary values. This property simplifies the classification task. An MLP is then used to classify the trajectories that each syllable in the vocabulary corresponds to. The system performance was evaluated for recognition of 15 Malay common syllables. The overall performance of the recognizer showed to be 91.8%.

  • PDF

The Effect of the Number of Training Data on Speech Recognition

  • Lee, Chang-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.2E
    • /
    • pp.66-71
    • /
    • 2009
  • In practical applications of speech recognition, one of the fundamental questions might be on the number of training data that should be provided for a specific task. Though plenty of training data would undoubtedly enhance the system performance, we are then faced with the problem of heavy cost. Therefore, it is of crucial importance to determine the least number of training data that will afford a certain level of accuracy. For this purpose, we investigate the effect of the number of training data on the speaker-independent speech recognition of isolated words by using FVQ/HMM. The result showed that the error rate is roughly inversely proportional to the number of training data and grows linearly with the vocabulary size.

Isolated Word Recognition By HMM using Multisection MSVQ (MSVQ를 이용한 HMM에 의한 단독어 인식)

  • 안태옥;변용규;김순협
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.27 no.9
    • /
    • pp.1468-1475
    • /
    • 1990
  • In this paper, isolated words are recognized using multisection VQ and HMM. As recognition vocabuaries, 20 area-name which is uttered 5 times by 3 speakers is selected. In generating codebook, we devide recognition vocabulary into equal length, section, and make standard VQ codebook to each section and calculate observation by section and than recognize isolated words by HMM training. Multisection VQ codebook has time information and as observation is calculated by eacy section, computation is lesser and recongnition rate is higher than by whole codword. As a result, it is proved that recognition rate is higher in case of HMM using multisection VQ codebook.

  • PDF

Speech Recognition Error Compensation using MFCC and LPC Feature Extraction Method (MFCC와 LPC 특징 추출 방법을 이용한 음성 인식 오류 보정)

  • Oh, Sang-Yeob
    • Journal of Digital Convergence
    • /
    • v.11 no.6
    • /
    • pp.137-142
    • /
    • 2013
  • Speech recognition system is input of inaccurate vocabulary by feature extraction case of recognition by appear result of unrecognized or similar phoneme recognized. Therefore, in this paper, we propose a speech recognition error correction method using phoneme similarity rate and reliability measures based on the characteristics of the phonemes. Phonemes similarity rate was phoneme of learning model obtained used MFCC and LPC feature extraction method, measured with reliability rate. Minimize the error to be unrecognized by measuring the rate of similar phonemes and reliability. Turned out to error speech in the process of speech recognition was error compensation performed. In this paper, the result of applying the proposed system showed a recognition rate of 98.3%, error compensation rate 95.5% in the speech recognition.

Performance Evaluation of the Variable Vocabulary Speech Recognition System in the Noisy and Vocabulary-Independent Environments (잡음환경 및 어휘독립 환경에서의 가변어휘 음성인식기의 성능 분석)

  • 이승훈
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.08a
    • /
    • pp.56-59
    • /
    • 1998
  • POW 3848 DB 및 SNR 이 크게 다른 2 종류의 PC168 DB를 대상으로 가변어휘 음성인식 시스템을 이용하여 훈련 및 성능 평가 실험을 수행한 내용에 대해서 기술하고 있다. 실험의 목적은 위의 3종류의 DB를 조합하여 얻은 DB 환경하에서 인식기를 훈련시키면서, DB 의 조합 및 훈련방법에 따른 인식기의 성능과의 상관관계를 도출하고자 하였다. DB 의 조합은 POW DB 와 SNR 이 높은 PC DB , 및 3종류의 DB 모두로 구성하였다. 인식기는 40개의 음소로 구성된 문맥 독립형 SCHMM 모델이며, 각 음소당 3개의 상태로 이루어져 있다. 실험 결과, 대부분의 경우에서 ITERATION이 1.0인 경우에 최고 인식률을 나타내고 있으며, INTERATION 이 3.0 이상인 경우에는 항상 CASE 3의 실험방법이 우세한 결과를 나타내었다. 또한 CASE 1으로 훈련한 경우가 CASE 2 보다는 각각의 실험 DB 에 대해서 대체적으로 좋은 결과를 보였다.

  • PDF

Utterance Verification Using Search Confusion Rate and Its N-Best Approach

  • Kim, Kyu-Hong;Kim, Hoi-Rin;Hahn, Min-Soo
    • ETRI Journal
    • /
    • v.27 no.4
    • /
    • pp.461-464
    • /
    • 2005
  • Recently, a variety of confidence measures for utterance verification has been studied to improve speech recognition performance by rejecting out-of-vocabulary inputs. Most of the conventional confidence measures for utterance verification are based primarily on hypothesis testing or an approximated posterior probability, and their performances depend on the robustness of an alternative hypothesis or the prior probability. We introduce a novel confidence measure called a search confusion rate (SCR), which does not require an alternative hypothesis or the approximation of posterior probability. Our confusion-based approach shows better performance in additive noise-corrupted speech as well as in clean speech.

  • PDF