Search | Korea Science

Modified Weighting Model Rank Method for Improving the Performance of Real-Time Text-Independent Speaker Recognition System (실시간 문맥독립 화자인식 시스템의 성능향상을 위한 수정된 가중모델순위 결정방법)

Kim Min-Joung;Oh Se-Jin;Suk Su-Young;Chung Ho-Youl;Chung Hyun-Yeol
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.107-110
- /
- 2002
현재까지 개발된 화자식별 시스템 중 가중모델순위(Weighting Model Rank; WMR)방법을 이용한 화자인식 시스템이 비교적 높은 인식성능을 나타내고 있다. WMR 방법은 각 화자에 대한 프레임 유사도의 순위에 따라 지수함수 가중치로 대치시키는 방법을 사용하고 있으나, 이 방법은 유사도 본래의 변별력이 전체 계산에서 고려되지 않는 문제가 있었다. 이를 해결하기 위해 본 논문에서는 각 화자의 프레임 유사도와 지수함수를 이용한 가중치를 곱한 값을 이용하여 전체 스코어를 계산하도록 하는 수정된 가중모델 순위방법(Modified Weighting Model Rank; MWMR)을 제안한다. 제안한 방법의 유효성을 확인하기 위하여 316명의 화자를 대상으로 하여 인식실험을 실시한 결과, 학습 프레임이 10,000일 경우, MWMR 방법에서 $98.1\%$의 화자 인식률을 얻어 WMR 방법에 비해 약 $2.0\%$의 향상된 인식결과를 보여 제안한 방법의 유효성을 확인할 수 있었다.
PDF

Status of Korean Idiom Understanding for Chinese Learners of Korean according to Tasks (과제 유형에 따른 중국인 한국어 학습자의 관용어 이해 실태 양상)

Lee, Mi-Kyung;Kang, An-Young;Kim, Youn-Joo
- The Journal of the Korea Contents Association
- /
- v.15 no.10
- /
- pp.658-668
- /
- 2015
The purpose of present study tested the effects of context, transparency, familiarity and related variables on comprehension of 32 idioms in 87 Chinese learners of Korean who were attending the S university in Jeonnam providence. In the first assessment, idiomatic phrases were presented out of context. In another assessment, idiomatic phrases were embedded in supportive story contexts. To examine the difference based on task types, paired t-test or one-way ANOVA was used to test differences on related variables such as TOPIK, years of residence in Korea, major and etc. on idiom comprehension. The results of this study are summarized as follows. First, task type, familiarity and transparency were found to have no significant effect on idiom comprehension for Chinese learners of Korean. Second, the related variables such as TOPIK, and major had a significant effect on idiom comprehension. Third, percentage of context related interpretation error in context task was the highest. Literal interpretation errors were followed by it. It means they have a tendency to use contextual cues and semantic analysis of the phrase to comprehend Korean idioms. The results of study will be used to make a plan for teaching Chinese learners of Korean.
https://doi.org/10.5392/JKCA.2015.15.10.658 인용 PDF KSCI

Design and Implementation of a Virtual Education System on the Web Environment (웹 환경에서의 가상교육 시스템 설계 및 구현)

노진순;이용배;맹성현
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.10b
- /
- pp.595-597
- /
- 2001
World-Wide Web으로 인하여 인터넷 상의 다양하고 고품질의 자료들을 교육 자료에 손쉽게 활용할 수 있는 시대가 도래하였다. 그러나 이러한 자료들은 교육적 효과를 극대화시키기 위해서 좀 더 정제되고, 교육과정에 맞는 흐름을 가질 필요가 있다. 이러한 과정의 흐름을 제공하기 위해서는 웹 상에서 분산되어 독립적으로 존재하는 디지털 문서들을 교육 목적에 맞게 새로운 순서, 즉 문맥화된 순서를 가진 자료로 재구성할 수 있어야 하며, 문서간의 부드러운 내용 전개를 위해서 부가적인 설명이나 기존 문서에 빠져 있는 내용들을 보완할 수 있어야 한다. 본 논문의 연구과정에서 개발된 가상교육 시스템은 교사가 교육용 지식문서를 작성하여 면대면(face to face) 교육에서는 직접 학생들을 교육할 수 있는 교육 자료로 사용될 수 있을 뿐만 아니라 웹을 통해서는 학생 스스로가 부족한 부분을 원하는 시간에 학습할 수 있는 능동적인 교육 환경을 제공할 수 있다. 또한, 가상교육 시스템에 가상문서 개념을 도입함으로써 인터넷 상의 수많은 리소스들을 인용하는 것에 대한 부하를 막을 수 있다. 본 논문에서는 인터넷 상의 디지털 컨텐츠를 전문적인 지식을 가진 교사가 교육과정에 맞게 쉽게 재구성해 줄 수 있도록 가상교육 시스템을 설계 및 구현한 내용에 대해 기술한다.
PDF

Semantic Integration of Databases Based on the Multi-Aspect Semantic Model (다중 측면 의미 모델에 기반한 데이터베이스의 의미 통합)

이정욱;김중일;이종혁;백두권
- Proceedings of the Korean Information Science Society Conference
- /
- 1998.10b
- /
- pp.283-285
- /
- 1998
현재의 멀티데이터베이스 시스템에서 고려해야 할 중요한 문제중의 하나는 의미 이질성(semantic heterogeneity)을 식별하고 해결하는 것이다. 본 논문에서는 이를 위하여, 다중 측면 의미 모델(Multi-Aspect Semantic Model:MASM)을 제시하고 이에 기반한 의미 통합 방법을 제시한다. MASM은 의미 특징(semantic feature), 스키마 측면(schematic aspect), 명칭(name), 기능적 측면(functional aspect), 문맥(context) 등의 여러 요소들을 고려한 모델이며, 모든 요소 데이터베이스간에 공유되어야 하는 표준화된 지식 없이 객체간의 의미 유사성을 판단한다. 정보 통합에 필요한 모든 지식은 각 요소 데이터베이스에서 다른 요소 데이터베이스에 독립적으로 구축되며, 이를 통하여 융통성과 확장성을 갖는 멀티데이터베이스 시스템을 구축하는 토대를 마련한다.

Chinese Unsupervised Word Sense Disambiguation using WordNet (어휘의미망을 이용한 중국어 비감독 어의 중의성 해소)

Lian, Guang-Zhe;Kim, Minho;Kwon, Hyuk-Chul
- Proceedings of the Korea Information Processing Society Conference
- /
- 2012.04a
- /
- pp.365-368
- /
- 2012
어의 중의성 해소는 자연어처리에서 중요한 역할을 한다. 감독 중의성 해소 방법은 비감독 중의성 해소 방법보다 높은 성능을 나타내지만, 구축비용이 큰 대규모 의미부착 말뭉치가 필요하다. 본 논문에서는 중국어 어휘의미망(HowNet)과 의미 미부착 말뭉치를 이용한 중국어 비감독 어의 중의성 해소 방법을 제안한다. 의미 미부착 말뭉치에서 통계정보를 추출하고, 중국어 어휘 의미망에서 중의성 어휘의 의미별 형제어를 추출하여 중의성 어휘의 주변 문맥에 나타나는 어휘와 카이제곱검정(${\chi}^2$-test)에 의한 독립성 검정을 통해 어휘 간 연관성을 판단하고 중의성 해소를 한다. 본 논문에서 제안한 중의성 해소방법의 성능을 SemEval-2007 평가데이터에서 측정한 결과 명사와 동사에서 각각 64.7%, 49.4%를 나타냈다. 이는 SemEval-2007 중국어 비감독 중의성 해소에서 가장 높은 성능을 나타낸 시스템보다 13.1%, 13.9% 높은 성능이다.
https://doi.org/10.3745/PKIPS.y2012m04a.365 인용 PDF

Noise Robust Text-Independent Speaker Identification for Ubiquitous Robot Companion (지능형 서비스 로봇을 위한 잡음에 강인한 문맥독립 화자식별 시스템)

Kim, Sung-Tak;Ji, Mi-Kyoung;Kim, Hoi-Rin;Kim, Hye-Jin;Yoon, Ho-Sub
- 한국HCI학회:학술대회논문집
- /
- 2008.02a
- /
- pp.190-194
- /
- 2008
This paper presents a speaker identification technique which is one of the basic techniques of the ubiquitous robot companion. Though the conventional mel-frequency cepstral coefficients guarantee high performance of speaker identification in clean condition, the performance is degraded dramatically in noise condition. To overcome this problem, we employed the relative autocorrelation sequence mel-frequency cepstral coefficient which is one of the noise robust features. However, there are two problems in relative autocorrelation sequence mel-frequency cepstral coefficient: 1) the limited information problem. 2) the residual noise problem. In this paper, to deal with these drawbacks, we propose a multi-streaming method for the limited information problem and a hybrid method for the residual noise problem. To evaluate proposed methods, noisy speech is used in which air conditioner noise, classic music, and vacuum noise are artificially added. Through experiments, proposed methods provide better performance of speaker identification than the conventional methods.
PDF

Improvement of Keyword Spotting Performance Using Normalized Confidence Measure (정규화 신뢰도를 이용한 핵심어 검출 성능향상)

Kim, Cheol;Lee, Kyoung-Rok;Kim, Jin-Young;Choi, Seung-Ho;Choi, Seung-Ho
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.4
- /
- pp.380-386
- /
- 2002
Conventional post-processing as like confidence measure (CM) proposed by Rahim calculates phones' CM using the likelihood between phoneme model and anti-model, and then word's CM is obtained by averaging phone-level CMs[1]. In conventional method, CMs of some specific keywords are tory low and they are usually rejected. The reason is that statistics of phone-level CMs are not consistent. In other words, phone-level CMs have different probability density functions (pdf) for each phone, especially sri-phone. To overcome this problem, in this paper, we propose normalized confidence measure. Our approach is to transform CM pdf of each tri-phone to the same pdf under the assumption that CM pdfs are Gaussian. For evaluating our method we use common keyword spotting system. In that system context-dependent HMM models are used for modeling keyword utterance and contort-independent HMM models are applied to non-keyword utterance. The experiment results show that the proposed NCM reduced FAR (false alarm rate) from 0.44 to 0.33 FA/KW/HR (false alarm/keyword/hour) when MDR is about 8%. It achieves 25% improvement of FAR.
PDF KSCI

A Variable Parameter Model based on SSMS for an On-line Speech and Character Combined Recognition System (음성 문자 공용인식기를 위한 SSMS 기반 가변 파라미터 모델)

석수영;정호열;정현열
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.7
- /
- pp.528-538
- /
- 2003
A SCCRS (Speech and Character Combined Recognition System) is developed for working on mobile devices such as PDA (Personal Digital Assistants). In SCCRS, the feature extraction is separately carried out for speech and for hand-written character, but the recognition is performed in a common engine. The recognition engine employs essentially CHMM (Continuous Hidden Markov Model), which consists of variable parameter topology in order to minimize the number of model parameters and to reduce recognition time. For generating contort independent variable parameter model, we propose the SSMS(Successive State and Mixture Splitting), which gives appropriate numbers of mixture and of states through splitting in mixture domain and in time domain. The recognition results show that the proposed SSMS method can reduce the total number of GOPDD (Gaussian Output Probability Density Distribution) up to 40.0% compared to the conventional method with fixed parameter model, at the same recognition performance in speech recognition system.
PDF KSCI

Speech Recognition Using MSVQ/TDRNN (MSVQ/TDRNN을 이용한 음성인식)

Kim, Sung-Suk
- The Journal of the Acoustical Society of Korea
- /
- v.33 no.4
- /
- pp.268-272
- /
- 2014
This paper presents a method for speech recognition using multi-section vector-quantization (MSVQ) and time-delay recurrent neural network (TDTNN). The MSVQ generates the codebook with normalized uniform sections of voice signal, and the TDRNN performs the speech recognition using the MSVQ codebook. The TDRNN is a time-delay recurrent neural network classifier with two different representations of dynamic context: the time-delayed input nodes represent local dynamic context, while the recursive nodes are able to represent long-term dynamic context of voice signal. The cepstral PLP coefficients were used as speech features. In the speech recognition experiments, the MSVQ/TDRNN speech recognizer shows 97.9 % word recognition rate for speaker independent recognition.
https://doi.org/10.7776/ASK.2014.33.4.268 인용 PDF KSCI

Query Expansion based on Word Graph using Term Proximity (질의 어휘와의 근접도를 반영한 단어 그래프 기반 질의 확장)

Jang, Kye-Hun;Lee, Kyung-Soon
- The KIPS Transactions:PartB
- /
- v.19B no.1
- /
- pp.37-42
- /
- 2012
The pseudo relevance feedback suggests that frequent words at the top documents are related to initial query. However, the main drawback associated with the term frequency method is the fact that it relies on feature independence, and disregards any dependencies that may exist between words in the text. In this paper, we propose query expansion based on word graph using term proximity. It supplements term frequency method. On TREC WT10g test collection, experimental results in MAP(Mean Average Precision) show that the proposed method achieved 6.4% improvement over language model.
https://doi.org/10.3745/KIPSTB.2012.19B.1.037 인용 PDF KSCI

Search Result 64, Processing Time 0.053 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)