Search | Korea Science

An Experiment of a Spoken Digits-Recognition System (숫자음성 자동 인식에 관한 일실험)

;安居院猛
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.15 no.6
- /
- pp.23-28
- /
- 1978
This paper describes a speech recognition system for ten isolated spoken digits. In this system, acoustic parameters such as zero crossing rate, log energy and three formant frequencies estimated by linear prediction method were extracted for classification and/or recognition purpose(s). The former two parameters were used for the classification of unvoiced consonants and the latter one for the recognition of vowels and voiced consonants. Promising recognition results were obtained in this experiment for ten digit utterances spoken by a male speaker.
PDF

Emotion Recognition Method using Physiological Signals and Gesture (생체 신호와 몸짓을 이용한 감성인식 방법)

Kim, Ho-Deok;Yang, Hyeon-Chang;Park, Chang-Hyeon;Sim, Gwi-Bo
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2007.04a
- /
- pp.25-28
- /
- 2007
Electroencephalograhic(EEG)는 심리학의 영역에서 인간 두뇌의 활동을 측정 기록하는데 오래 전부터 사용하였다. 과학의 발달함에 따라 점차적으로 인간의 두뇌에서 감정을 조절하는 기본적인 영역들이 밝혀지고 있다. 그래서 인간의 감정을 조절하는 인간의 두뇌 활동 영역들을 EEG를 이용하여 측정하였다. 본 논문에서는 EEG의 신호들과 몸짓을 이용해서 감정을 인식하였다. 특히, 기존에 생체신호나 몸짓 중 한 가지만을 이용하여 각각 실험해서 감성을 인식하였지만, 본 논문에서는 EEG 신호와 몸짓을 동시에 이용해서 피 실험자의 감성을 인식하는 실험을 하였다. 실험결과 기존의 생체신호나 몸짓 한 가지만을 가지고 실험했을 때의 인식률 보다 더 높은 인식률을 보임을 알 수 있었다. 그리고 생체신호와 몸짓들의 특징 신호들은 강화학습의 개념을 이용한 IFS(Interactive Feature Selection)를 이용하여 특징 선택을 하였다.
PDF

Implementation of Speech Recognizer using Relevance Vector Machine (RVM을 이용한 음성인식기의 구현)

Kim, Chang-Keun;Koh, Si-Young;Hur, Kang-In;Lee, Kwang-Seok
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.11 no.8
- /
- pp.1596-1603
- /
- 2007
In this paper, we experimented by three kind of method for feature parameter, training method and recognition algorithm of most suitable for speech recognition system and considered. We decided speech recognition system of most suitable through two kind of experiment after we make speech recognizer. First, we did an experiment about three kind of feature parameter to evaluate recognition performance of it in speech recognizer using existent MFCC and MFCC new feature parameter that change characteristic space using PCA and ICA. Second, we experimented recognition performance or HMM, SVM and RVM by studying data number. By an experiment until now, feature parameter by ICA showed performance improvement of average 1.5% than MFCC by high linear discrimination from characteristic space. RVM showed performance improvement of maximum 3.25% than HMM in an experiment by decrease of studying data. As such result, effective method for speech recognition system to propose in this paper derives feature parameters using ICA and un recognition using RVM.
https://doi.org/10.6109/jkiice.2007.11.8.1596 인용 PDF KSCI

Synthesis and Classification of Active Sonar Target Signal Using Highlight Model (하이라이트 모델을 이용한 능동소나 표적신호의 합성 및 인식)

Kim, Tae-Hwan;Park, Jeong-Hyun;Nam, Jong-Geun;Lee, Su-Hyung;Bae, Keun-Sung
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.2
- /
- pp.135-140
- /
- 2009
In this paper, we synthesized active sonar target signals based on highlights model, and then carried out target classification using the synthesized signals. If the target aspect angle is changed, the different signals are synthesized. To know the result, two different experiments are done. First, The classification results with respect to each aspect angle are shown. Second, the results in two group in aspect angle are acquired. Time domain feature extraction is done using matched filter and envelope detection. It shows the pattern of each highlights. Artificial neural networks and multi-class SVM are used for classifying target signals.
https://doi.org/10.7776/ASK.2009.28.2.135 인용 PDF KSCI

Word Recognition Using K-L Dynamic Coefficients (K-L 동적 계수를 이용한 단어 인식)

김주곤
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06c
- /
- pp.103-106
- /
- 1998
본 논문에서는 음성인식 시스템의 인식 정도의 향상을 위해서 동적 특징으로서 K-L(Karhanen-Loeve)계수를 이용하여 음소모델을 구성하는 방법을 제안하고, 음소, 단어, 숫자음 인식 실험을 통하여 그 유효성을 검토하였다. 인식 실험을 위한 음성자료는 한국 전자통신 연구소에서 채록한 445단어와 국어정보공학연구소에서 채록한 4연속 숫자음을 사용하였으며, K-L계수 동적 특징의 유효성을 확인하기 위해 정적 특징으로서 멜-켑스트럼과 동적 특징으로서 K-L계수 및 회귀계수를 추출한 후 음소, 단어, 숫자음 인식 실험을 수행하였다. 인식의 기본 단위로는 48개의 유사음소단위(Phoneme Likely Unite ; PLUs)를 음소모델로 사용하였으며, 단어와 숫자음 인식을 위해서는 유한상태 오토마타(Finite State Automata; FSA)에 의한 구문제어를 통한 OPDP(One Pass Dynamic Programming)법을 이용하였다. 인식 실험 결과, 음소인식에 있어서는 정적특징인 멜-켑스트럼을 사용한 경우 39.8%, K-L 동적 계수를 사용한 경우가 52.4%로 12.6%의 향상된 인식률을 얻었다. 또한, 멜-켑스트럼과 회수계수를 사용한 경우 60.1%, K-L계수와 회귀계수를 결합한 경우에 있어서도 60.4%로 높은 인식률은 얻었다. 이 결과를 단어인식에 확장하여 인식 실험을 수행한 결과, 기존의 멜-켑스트럼 계수를 사용한 경우 65.5%, K-L계수를 사용한 경우 75.8%로 10.3% 향상된 인식률을 얻었으며, 멜-켑스트럼과 회귀계수를 결합한 경우 91.2%, K-L계수와 회귀계수를 결합한 경우 91.4%의 높은 인식률을 보였다. 도한, 4연속 숫자음에 적용한 경우에 있어서도 멜-켑스트럼을 사용한 경우 67.5%, K-L계수를 사용한 경우 75.3%로 7.8%의 향상된 인식률을 보였으며 K-L계수와 회귀계수를 결합한 경우에서도 비교적 높은 인식률을 보여 숫자음에 대해서도 K-L계수의 유효성을 확인할 수 있었다.
PDF

School of Electronic and Electrical Engineering, Hong Ik University (균일분포 신경회로망을 이용한 얼굴인식 시스템)

조성원;박준하
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 1997.11a
- /
- pp.171-175
- /
- 1997
본 논문에서는 LVQ(Learning Vector Quentization) 신경회로망의 새로운 가중치 초기화법을 제안하고 이를 얼굴인식 시스템에 적용하였다. 제안한 방법은 초기가중치를 패턴 결정 경계면 주변에 설정함으로써 인식율을 높이는 방법이다. 얼굴인식의 특징 추출 방법으로서는 주성분 분석, 모멘트, 푸리에 기술자, 모멘트+주성분 분석 및 푸리에 기술자+주성분 분석 등을 사용하여 실험하였으며, 인식부의 LVQ 신경회로망에 제안된 방법을 적용하여 기존의 방법과 비교 실험하였다. 실험 결과 초기가중치를 최초 패턴으로 가지는 경우, 평균값을 취하는 경우, 랜덤하게 사용하는 경우 등에 비해서 우수한 인식율을 보임을 알 수 있었다.
PDF

Speaker-dependent Speech Recognition Algorithm for Male and Female Classification (남녀성별 분류를 위한 화자종속 음성인식 알고리즘)

Choi, Jae-Seung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.17 no.4
- /
- pp.775-780
- /
- 2013
This paper proposes a speaker-dependent speech recognition algorithm which can classify the gender for male and female speakers in white noise and car noise, using a neural network. The proposed speech recognition algorithm is trained by the neural network to recognize the gender for male and female speakers, using LPC (Linear Predictive Coding) cepstrum coefficients. In the experiment results, the maximal improvement of total speech recognition rate is 96% for white noise and 88% for car noise, respectively, after trained a total of six neural networks. Finally, the proposed speech recognition algorithm is compared with the results of a conventional speech recognition algorithm in the background noisy environment.
https://doi.org/10.6109/jkiice.2013.17.4.775 인용 PDF KSCI

A Study on the Korean Syllable As Recognition Unit (인식 단위로서의 한국어 음절에 대한 연구)

Kim, Yu-Jin;Kim, Hoi-Rin;Chung, Jae-Ho
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.3
- /
- pp.64-72
- /
- 1997
In this paper, study and experiments are performed for finding recognition unit fit which can be used in large vocabulary recognition system. Specifically, a phoneme that is currently used as recognition unit and a syllable in which Korean is well characterized are selected. From comparisons of recognition experiments, the study is performed whether a syllable can be considered as recognition unit of Korean recognition system. For report of an objective result of the comparison experiment, we collected speech data of a male speaker and processed them by hand-segmentation for phoneme boundary and labeling to construct speech database. And for training and recognition based on HMM, we used HTK (HMM Tool Kit) 2.0 of commercial tool from Entropic Co. to experiment in same condition. We applied two HMM model topologies, 3 emitting state of 5 state and 6 emitting state of 8 state, in Continuous HMM on training of each recognition unit. We also used 3 sets of PBW (Phonetically Balanced Words) and 1 set of POW(Phonetically Optimized Words) for training and another 1 set of PBW for recognition, that is "Speaker Dependent Medium Vocabulary Size Recognition." Experiments result reports that recognition rate is 95.65% in phoneme unit, 94.41% in syllable unit and decoding time of recognition in syllable unit is faster by 25% than in phoneme.
PDF

배경잡음 하에서의 신경회로망에 의한 남성화자 및 여성화자의 성별인식 알고리즘

Choe, Jae-Seung
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2013.05a
- /
- pp.515-517
- /
- 2013
본 논문에서는 잡음 환경 하에서 남녀 성별인식이 가능한 신경회로망에 의한 화자종속 음성인식 알고리즘을 제안한다. 본 논문에서 제안한 음성인식 알고리즘은 남성화자 및 여성화자를 인식하기 위하여 LPC 켑스트럼 계수를 사용하여 신경회로망에 의하여 학습된다. 본 실험에서는 백색잡음 및 자동차잡음에 대하여 신경회로망의 네크워크에 대한 인식결과를 나타낸다. 인식실험의 결과로부터 백색잡음에 대해서는 최대 96% 이상의 인식률, 자동차잡음에 대해서는 최대 88% 이상의 인식률을 구하였다.
PDF

Connected Digit Recognition Using Phonetical Features (음성학적 특징을 이용한 연속 숫자음인식)

김민정
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06d
- /
- pp.72-75
- /
- 1998
본 논문에서는 숫자음 인식시스템의 인식률 향상을 위한 연구로서 4연속 숫자음을 대상으로 연음 현상 및 경음화 현상등과 같은 음성학적 특징을 고려하여 숫자음에 강건한 모델을 작성하는 방법을 제안하고 인식실험을 통하여 그 유효성을 확인하고자 한다. 이를 위하여 음성자료로서는 국어공학센터(KLE)에서 채록한 4연속 숫자음을 사용하며 인식의 기본단위로서 음향학적 특징을 고려한 19개의 연속분포 HMM을 유사음소 단위(Phoneme Like Units ; PLUS) 로 사용한다. 또한 , 인식실험에 있어서는 기존의 방법으로 모델을 작성한 경우와 연음 현상과 경음화 현상 등과 같은 음성학적 특징을 고려하여 모델을 작성한 경우에 대해서 유한상태 오토마타(finite State Automata ; FSA)에 의한 구문제어를 통한 OPDP(One Pass Dynamic Programming)법으로 인식실험을 수행하여 그 결과를 비교 검토하였다. 그 결과, 기존이 방법의 경우 64.6%, 음성학적 특징을 고려한 경우 68.6%의 인식률을 보여, 음성학적 특징을 고려한 경우가 4.0% 향상된 인식률을 얻어 제안한 방법의 유효성을 확인하였다.
PDF

Search Result 6,452, Processing Time 0.037 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)