Search | Korea Science

Two-Path Language Modeling Considering Word Order Structure of Korean (한국어의 어순 구조를 고려한 Two-Path 언어모델링)

Shin, Joong-Hwi;Park, Jae-Hyun;Lee, Jung-Tae;Rim, Hae-Chang
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.8
- /
- pp.435-442
- /
- 2008
The n-gram model is appropriate for languages, such as English, in which the word-order is grammatically rigid. However, it is not suitable for Korean in which the word-order is relatively free. Previous work proposed a twoply HMM that reflected the characteristics of Korean but failed to reflect word-order structures among words. In this paper, we define a new segment unit which combines two words in order to reflect the characteristic of word-order among adjacent words that appear in verbal morphemes. Moreover, we propose a two-path language model that estimates probabilities depending on the context based on the proposed segment unit. Experimental results show that the proposed two-path language model yields 25.68% perplexity improvement compared to the previous Korean language models and reduces 94.03% perplexity for the prediction of verbal morphemes where words are combined.
https://doi.org/10.7776/ASK.2008.27.8.435 인용 PDF KSCI

Development of Speech Recognition System based on User Context Information in Smart Home Environment (스마트 홈 환경에서 사용자 상황정보 기반의 음성 인식 시스템 개발)

Kim, Jong-Hun;Sim, Jae-Ho;Song, Chang-Woo;Lee, Jung-Hyun
- The Journal of the Korea Contents Association
- /
- v.8 no.1
- /
- pp.328-338
- /
- 2008
Most speech recognition systems that have a large capacity and high recognition rates are isolated word speech recognition systems. In order to extend the scope of recognition, it is necessary to increase the number of words that are to be searched. However, it shows a problem that exhibits a decrease in the system performance according to the increase in the number of words. This paper defines the context information that affects speech recognition in a ubiquitous environment to solve such a problem and develops user localization method using inertial sensor and RFID. Also, we develop a new speech recognition system that demonstrates better performances than the existing system by establishing a word model domain of a speech recognition system by context information. This system shows operation without decrease of recognition rate in smart home environment.
https://doi.org/10.5392/JKCA.2008.8.1.328 인용 PDF

A Study on the Death Consciousness Among Health Care Personnels (죽음의식에 관한 연구 -의.간호계 종사자 및 학생을 중심으로-)

권혜진
- Journal of Korean Academy of Nursing
- /
- v.10 no.2
- /
- pp.21-40
- /
- 1980
In order to take cue of the dying persons and their survivors in a more positive and affirmative atti-tube. and to understand the valuable meaning of and dying. a survey was performed to 550 cases of health care personnels including 116 nursing students. 238 medical students. 137 nurses. and 59 doctors. Samplings were made through census Procedure from the entire group of medical and nursing students in College of Medicine. Chung-Ang University. and of licenced nurses and doctors in Chung-Ang University Hospital. and in Han-Gang Sacred Heart Hospital from the first to the end of march. 1980. These collected data were computerized at KIST by SPSS programming and were statistically analyzed by chi-square test. Through content analysis of the word associated with death and descriptive analysis of the death-re-lated variables. the following conclusion in is reached. First. Total numbers of death-word percieved by health care personnels were 198 kinds. Among them, 40 kinds of words associated with death were responded from than 1% of the total. As to the 10 death related word responded by free word association method. it was revealed that individual average number of death related word was 7.70 word. which came from higher number of words in the senior students (8.96 word) or the graduates (8.10 word) compared with the freshman (6.84 word). Second. In Content specific analysis of the death related word. more frequently perceived types summarized as the following order; the affective context of death. the diseases. the disasters. the religion, the funeral ceremonies. the separation, the drakness. and the life. Third. The most prevalent 10 words associated with death which the the respondents gave response to the the first recalling word. were as following o order； the dieases. the sadness, the vanity. the darkness, the frustration. the suicide. the incurable dieases, the graves. the dead. and the catastrophes. By sex, the diease is outstanding in females, but the vanity is in males. By occupation. the vanity and the dead was frequently observed in student group including senior students. while the incurable dieases presented by doctors. Fourth. In health care personnels. the first perceived ages of death were 11.47 $\pm$3.33 years (8.14- 15.80 years). Among them. senior students were inclined to percept death at the earliest age of life (11.28years). while doctors and nurses perceived death later in their life (12.98 years). Fifth, It is revealed in this survey that the most frequently responded death perceiving motives by health care personnels ar“psychological conflict”and“death of those around them”. Death perceiving motives can be classified in two factors; personality and life circumstances. Sixth It is of interest that only 11.3% health care personnels was found to feel death as inevitable or acceptable event. whereas 58.3% deny or reject it.
PDF

Coordinative movement of articulators in bilabial stop /p/^∗

Son, Minjung
- Phonetics and Speech Sciences
- /
- v.10 no.4
- /
- pp.77-89
- /
- 2018
Speech articulators are coordinated for the purpose of segmental constriction in terms of a task. In particular, vertical jaw movements repeatedly contribute to consonantal as well as vocalic constriction. The current study explores vertical jaw movements in conjunction with bilabial constriction in bilabial stop /p/ in the context /a/-to-/a/. Revisiting kinematic data of /p/ collected using the electromagenetic midsagittal articulometer (EMMA) method from seven (four female and three male) speakers of Seoul Korean, we examined maximum vertical jaw position, its relative timing with respect to the upper and lower lips, and lip aperture minima. The results of those dependent variables are recapitulated in terms of linguistic (different word boundaries) and paralinguistic (different speech rates) factors as follows. Firstly, maximum jaw height was lower in the across-word boundary condition (across-word < within-word), but it did not differ as a function of different speech rates (comfortable = fast). Secondly, more reduction in the lip aperture (LA) gesture occurred in fast rate, while word-boundary effects were absent. Thirdly, jaw raising was still in progress after the lips' positional extrema were achieved in the within-word condition, while the former was completed before the latter in the across-word condition. Lastly, relative temporal lags between the jaw and the lips (UL and LL) were more synchronous in fast rate, compared to comfortable rate. When these results are considered together, it is possible to posit that speakers are not tolerant of lenition to the extent that it is potentially realized as a labial approximant in either word-boundary condition while jaw height still manifested lower jaw position in the across-word boundary condition. Early termination of vertical jaw maxima before vertical lower lip maxima across-word condition may be partly responsible for the spatial reduction of jaw raising movements. This may come about as a consequence of an excessive number of factors (e.g., upper lip height (UH), lower lip height (LH), jaw angle (JA)) for the representation of a vector with two degrees of freedom (x, y) engaged in a gesture-based task (e.g., lip aperture (LA)). In the task-dynamic application toolkit, the jaw angle parameter can be assigned numerical values for greater weight in the across-word boundary condition, which in turn gives rise to lower jaw position. Speech rate-dependent spatial reduction in lip aperture may be able to be resolved by means of manipulating activation time of an active tract variable in the gestural score level.
https://doi.org/10.13064/KSSS.2018.10.4.077 인용 PDF KSCI

Acoustic and Pronunciation Model Adaptation Based on Context dependency for Korean-English Speech Recognition (한국인의 영어 인식을 위한 문맥 종속성 기반 음향모델/발음모델 적응)

Oh, Yoo-Rhee;Kim, Hong-Kook;Lee, Yeon-Woo;Lee, Seong-Ro
- MALSORI
- /
- v.68
- /
- pp.33-47
- /
- 2008
In this paper, we propose a hybrid acoustic and pronunciation model adaptation method based on context dependency for Korean-English speech recognition. The proposed method is performed as follows. First, in order to derive pronunciation variant rules, an n-best phoneme sequence is obtained by phone recognition. Second, we decompose each rule into a context independent (CI) or a context dependent (CD) one. To this end, it is assumed that a different phoneme structure between Korean and English makes CI pronunciation variabilities while coarticulation effects are related to CD pronunciation variabilities. Finally, we perform an acoustic model adaptation and a pronunciation model adaptation for CI and CD pronunciation variabilities, respectively. It is shown from the Korean-English speech recognition experiments that the average word error rate (WER) is decreased by 36.0% when compared to the baseline that does not include any adaptation. In addition, the proposed method has a lower average WER than either the acoustic model adaptation or the pronunciation model adaptation.
PDF

A Study on Korean 4-connected Digit Recognition Using Demi-syllable Context-dependent Models (반음절 문맥종속 모델을 이용한 한국어 4 연숫자음 인식에 관한 연구)

이기영;최성호;이호영;배명진
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.3
- /
- pp.175-181
- /
- 2003
Because a word of Korean digits is a syllable and deeply coarticulatied in connected digits, some recognition models based on demisyllables have been proposed by researchers. However, they could not show an excellent recognition results yet. This paper proposes a recognition model based on extended and context-dependent demisyllables, such as a tri-demisyllable like a tri-phone, for the Korean 4-connected digits recognition. For experiments, we use a toolkit of HTK 3.0 for building this model of continuous HMMs using training Korean connected digits from SiTEC database and for recognizing unknown ones. The results show that the recognition rate is 92% and this model has an ability to improve the recognition performance of Korean connected digits.
PDF KSCI

Design and Implementation of Context-aware Application on Smartphone Using Speech Recognizer

Kim, Kyuseok
- Journal of Advanced Information Technology and Convergence
- /
- v.10 no.2
- /
- pp.49-59
- /
- 2020
As technologies have been developing, our lives are getting easier. Today we are surrounded by the new technologies such as AI and IoT. Moreover, the word, "smart" is a very broad one because we are trying to change our daily environment into smart one by using those technologies. For example, the traditional workplaces have changed into smart offices. Since the 3rd industrial revolution, we have used the touch interface to operate the machines. In the 4th industrial revolution, however, we are trying adding the speech recognition module to the machines to operate them by giving voice commands. Today many of the things are communicated with human by voice commands. Many of them are called AI things and they do tasks which users request and do tasks more than what users request. In the 4th industrial revolution, we use smartphones all the time every day from the morning to the night. For this reason, the privacy using phone is not guaranteed sometimes. For example, the caller's voice can be heard through the phone speaker when accepting a call. So, it is needed to protect privacy on smartphone and it should work automatically according to the user context. In this aspect, this paper proposes a method to adjust the voice volume for call to protect privacy on smartphone according to the user context.
https://doi.org/10.14801/JAITC.2020.10.2.49 인용

Improvements of an English Pronunciation Dictionary Generator Using DP-based Lexicon Pre-processing and Context-dependent Grapheme-to-phoneme MLP (DP 알고리즘에 의한 발음사전 전처리와 문맥종속 자소별 MLP를 이용한 영어 발음사전 생성기의 개선)

김회린;문광식;이영직;정재호
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.5
- /
- pp.21-27
- /
- 1999
In this paper, we propose an improved MLP-based English pronunciation dictionary generator to apply to the variable vocabulary word recognizer. The variable vocabulary word recognizer can process any words specified in Korean word lexicon dynamically determined according to the current recognition task. To extend the ability of the system to task for English words, it is necessary to build a pronunciation dictionary generator to be able to process words not included in a predefined lexicon, such as proper nouns. In order to build the English pronunciation dictionary generator, we use context-dependent grapheme-to-phoneme multi-layer perceptron(MLP) architecture for each grapheme. To train each MLP, it is necessary to obtain grapheme-to-phoneme training data from general pronunciation dictionary. To automate the process, we use dynamic programming(DP) algorithm with some distance metrics. For training and testing the grapheme-to-phoneme MLPs, we use general English pronunciation dictionary with about 110 thousand words. With 26 MLPs each having 30 to 50 hidden nodes and the exception grapheme lexicon, we obtained the word accuracy of 72.8% for the 110 thousand words superior to rule-based method showing the word accuracy of 24.0%.
PDF

The Effect of Emotional Content and Context on Memory Encoding: ERP Studies (자극과 맥락의 정서성이 기억 부호화에 미치는 영향: ERP 연구)

Park, Sun-Hee;Park, Tae-Jin
- Korean Journal of Cognitive Science
- /
- v.21 no.2
- /
- pp.387-408
- /
- 2010
This study examined the effects of emotional content on the encoding process of emotional stimuli and the effects of emotional context on those of neutral stimuli. It was examined whether the superior memory of emotional stimuli is due to attentional resource allocation. This study were performed an emotional picture and a neutral word were presented in succession at every trials. The results of recognition judgement showed superior memory of emotional pictures than neutral pictures, but showed poorer memory of neutral words in emotional context than those in neutral context. LPC(Late Positive Complex) of ERP results showed the similar pattern: higher amplitude by emotional pictures than neutral pictures, and lower amplitude by neutral words in emotional context than those in neutral context. This result is considered to support attention allocation hypothesis.
PDF

Automatic Error Correction System for Erroneous SMS Strings (SMS 변형된 문자열의 자동 오류 교정 시스템)

Kang, Seung-Shik;Chang, Du-Seong
- Journal of KIISE:Software and Applications
- /
- v.35 no.6
- /
- pp.386-391
- /
- 2008
Some spoken word errors that violate grammatical or writing rules occurs frequently in communication environments like mobile phone and messenger. These unexpected errors cause a problem in a language processing system for many applications like speech recognition, text-to-speech translation, and so on. In this paper, we proposed and implemented an automatic correction system of ill-formed words and word spacing errors in SMS sentences that has been the major errors of poor accuracy. We experimented three methods of constructing the word correction dictionary and evaluated the results of those methods. They are (1) manual construction of error words from the vocabulary list of ill-formed communication languages, (2) automatic construction of error dictionary from the manually constructed corpus, and (3) context-dependent method of automatic construction of error dictionary.
PDF KSCI

Search Result 353, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)