• Title/Summary/Keyword: Syllable-Unit Processing

Search Result 15, Processing Time 0.025 seconds

A Study on Korean Connected Digit Recognizer Based on Semi-syllable and Post-processing (반음절기반의 한국어 연속숫자음인식과 그 후처리에 대한 연구)

  • Jeong, Jae-Boo;Chung, Hoon;Chung, Ik-Joo
    • Speech Sciences
    • /
    • v.8 no.4
    • /
    • pp.1-15
    • /
    • 2001
  • This paper describes the effect of new recognition unit, a unit based on semisyllable, and its post processing method. A recognition unit based on semi-syllable expresses Korean connected digit's coarticulation effect. An existing method using semi-syllable limits next models, derived from current recognized models, to make complete connected digit sequence. However, this paper uses a new method to make complete connected digit sequence. The new post-processing method recognizes isolated digit words which include digits sequence from the digit combinations being able to occur from current recognized semi-syllable sequence. This method gives an improved accuracy rate than that of existing method. This new post processing provides two advantages. 1) It corrects current mis-recognized semi-syllable unit. 2) When people say each digit, they say it without regard to saying duration.

  • PDF

The Processing Unit in Korean Words (한글 낱말의 처리 단위)

  • 이준석;김경린
    • Korean Journal of Cognitive Science
    • /
    • v.1 no.2
    • /
    • pp.221-239
    • /
    • 1989
  • The purpose of this study was to explore the processing unit in Korean word.Three experiments were conducted to examine this question.Preliminary experiment and Enperiment I were executed to delineate the processing unit in singles syllable word and Experiment 2,for words two or more syllables.The major finding of the preliminary experiment showed that the effect of the consonant type was not significant but that of the letter position was.Reaction time increased as the position of letter increased.The difference in reaction time between the first and the second position was not significant.However,the difference between the second and third was.In the Experiment 1, the effect of the number of letter was significant: reaction time increased as the number of letters increased.The size of the position effect both in the preliminary experiment and Experiment 1was comparable.Result of Experiment 2 was such that regardless of the presence of the final consonant(s),the reaction time incresased linearly as the number of svllables increased from two to four. The findings of the present study suggest that:(1)processing unit in single syllable Korean words is a syllable without the final consonant(s):(2) but in words of two or more syllables,the unit is likely to be a syllable with the final consonant(s).

Variables affecting Korean word recognition: focusing on syllable shape (한글 단어 재인에 영향을 미치는 변인: 음절 형태를 중심으로)

  • Min, Suyoung;Lee, Chang H.
    • Korean Journal of Cognitive Science
    • /
    • v.29 no.4
    • /
    • pp.193-220
    • /
    • 2018
  • Recent studies have demonstrated that word frequency, word length, neighborhood and word shape may have a role in visual word recognition. Shape information may affect word processing in different ways as Korean letter system works differently than that of English. The purpose of this study was to apply Gestalt's continuity principle to Korean alphabetic script(hangul), and to investigate the processing unit of hangul and to verify whether syllable shape affects word recognition in hangul. In experiment 1, three syllable words were utilized and two variables; 1) syllable types(horizontal syllable shape, e.g., "가". vertical syllable shape, e.g., "고") and 2) presenting direction (horizontal, vertical) were manipulated. Whereas "가" meets the criteria of Gestalt's continuity principle, "고" does not. Based on the result of lexical decision time, horizontal syllable shape type showed significant performance improvement, when compared to vertical syllable shape type, regardless of the presenting direction. In experiment 2, syllable types(horizontal syllable shape, vertical syllable shape) and the visual relationship between prime and target(identical, similar, different) were manipulated by using masked priming. There was a significant performance difference between the visual relationship of prime and target, and thus the effect of syllable shape was verified.

A Reranking Model for Korean Morphological Analysis Based on Sequence-to-Sequence Model (Sequence-to-Sequence 모델 기반으로 한 한국어 형태소 분석의 재순위화 모델)

  • Choi, Yong-Seok;Lee, Kong Joo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.4
    • /
    • pp.121-128
    • /
    • 2018
  • A Korean morphological analyzer adopts sequence-to-sequence (seq2seq) model, which can generate an output sequence of different length from an input. In general, a seq2seq based Korean morphological analyzer takes a syllable-unit based sequence as an input, and output a syllable-unit based sequence. Syllable-based morphological analysis has the advantage that unknown words can be easily handled, but has the disadvantages that morpheme-based information is ignored. In this paper, we propose a reranking model as a post-processor of seq2seq model that can improve the accuracy of morphological analysis. The seq2seq based morphological analyzer can generate K results by using a beam-search method. The reranking model exploits morpheme-unit embedding information as well as n-gram of morphemes in order to reorder K results. The experimental results show that the reranking model can improve 1.17% F1 score comparing with the original seq2seq model.

Development of a Lipsync Algorithm Based on Audio-visual Corpus (시청각 코퍼스 기반의 립싱크 알고리듬 개발)

  • 김진영;하영민;이화숙
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.3
    • /
    • pp.63-69
    • /
    • 2001
  • A corpus-based lip sync algorithm for synthesizing natural face animation is proposed in this paper. To get the lip parameters, some marks were attached some marks to the speaker's face, and the marks' positions were extracted with some Image processing methods. Also, the spoken utterances were labeled with HTK and prosodic information (duration, pitch and intensity) were analyzed. An audio-visual corpus was constructed by combining the speech and image information. The basic unit used in our approach is syllable unit. Based on this Audio-visual corpus, lip information represented by mark's positions was synthesized. That is. the best syllable units are selected from the audio-visual corpus and each visual information of selected syllable units are concatenated. There are two processes to obtain the best units. One is to select the N-best candidates for each syllable. The other is to select the best smooth unit sequences, which is done by Viterbi decoding algorithm. For these process, the two distance proposed between syllable units. They are a phonetic environment distance measure and a prosody distance measure. Computer simulation results showed that our proposed algorithm had good performances. Especially, it was shown that pitch and intensity information is also important as like duration information in lip sync.

  • PDF

A Study on a Generation of a Syllable Restoration Candidate Set and a Candidate Decrease (음절 복원 후보 집합의 생성과 후보 감소에 관한 연구)

  • 김규식;김경징;이상범
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.12
    • /
    • pp.1679-1690
    • /
    • 2002
  • This paper, describe about a generation of a syllable restoration regulation for a post processing of a speech recognition and a decrease of a restoration candidate. It created a syllable restoration regulation to create a restoration candidate pronounced with phonetic value recognized through a post processing of the formula system that was a tone to recognize syllable unit phonetic value for a performance enhancement of a dialogue serial speech recognition. Also, I presented a plan to remove a regulation to create unused notation from a real life in a restoration regulation with a plan to reduce number candidate of a restoration meeting. A design implemented a restoration candidate set generator in order a syllable restoration regulation display that it created a proper restoration candidate set. The proper notation meeting that as a result of having proved about a standard pronunciation example and a word extracted from a pronunciation dictionary at random, the notation that an utterance was former was included in proved with what a generation became.

  • PDF

Fake News Detection Using Deep Learning

  • Lee, Dong-Ho;Kim, Yu-Ri;Kim, Hyeong-Jun;Park, Seung-Myun;Yang, Yu-Jun
    • Journal of Information Processing Systems
    • /
    • v.15 no.5
    • /
    • pp.1119-1130
    • /
    • 2019
  • With the wide spread of Social Network Services (SNS), fake news-which is a way of disguising false information as legitimate media-has become a big social issue. This paper proposes a deep learning architecture for detecting fake news that is written in Korean. Previous works proposed appropriate fake news detection models for English, but Korean has two issues that cannot apply existing models: Korean can be expressed in shorter sentences than English even with the same meaning; therefore, it is difficult to operate a deep neural network because of the feature scarcity for deep learning. Difficulty in semantic analysis due to morpheme ambiguity. We worked to resolve these issues by implementing a system using various convolutional neural network-based deep learning architectures and "Fasttext" which is a word-embedding model learned by syllable unit. After training and testing its implementation, we could achieve meaningful accuracy for classification of the body and context discrepancies, but the accuracy was low for classification of the headline and body discrepancies.

The exploration of the effects of word frequency and word length on Korean word recognition (한국어 단어재인에 있어서 빈도와 길이 효과 탐색)

  • Lee, Changhwan;Lee, Yoonhyoung;Kim, Tae Hoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.1
    • /
    • pp.54-61
    • /
    • 2016
  • Because a word is the basic unit of language processing, studies of the word recognition processing and the variables that contribute to word recognition processing are very important. Word frequency and word length are recognized as important factors on word recognition. This study examined the effects of those two variables on the Korean word recognition processing. In Experiment 1, two types of Hangul words, pure Hangul words and Hangul words with Hanja counterparts, were used to explore the frequency effects. A frequency effect was not observed for Hangul words with Hanja counterparts. In Experiment 2, the word length was manipulated to determine if the word length effect appears in Hangul words. Contrary to the expectation, one syllable words were processed more slowly than two syllable words. The possible explanations for these results and future research directions are discussed.

Implementation of an Effective Rule Base System for the Change of Korean Vocal Sound (한국어 음운 변동 처리를 위한 효율적인 Rule Base System의 구성)

  • 이규영;이상범
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.28B no.12
    • /
    • pp.9-18
    • /
    • 1991
  • In this Paper, a rule-based method for the phenomenon of Korean vocal sound change is proposed. This method could be used to solve a problem between symbolic(Hangul)and phonetic language(Korean) for the study of Korean speech processing. A rule on the phenomenon of vocal sound rearranged for the rule base with a end-consonents on the authority of standard pronunciation rule. The proposed rule base system is simplified by the implementation for the vocal sound change. Also, it is useful to create the data base with phonetic value for the Korean voice processing by syllable unit.

  • PDF

A Study on Recognition Units and Methods to Align Training Data for Korean Speech Recognition) (한국어 인식을 위한 인식 단위와 학습 데이터 분류 방법에 대한 연구)

  • 황영수
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.4 no.2
    • /
    • pp.40-45
    • /
    • 2003
  • This is the study on recognition units and segmentation of phonemes. In the case of making large vocabulary speech recognition system, it is better to use the segment than the syllable or the word as the recognition unit. In this paper, we study on the proper recognition units and segmentation of phonemes for Korean speech recognition. For experiments, we use the speech toolkit of OGI in U.S.A. The result shows that the recognition rate of the case in which the diphthong is established as a single unit is superior to that of the case in which the diphthong is established as two units, i.e. a glide plus a vowel. And recognizer using manually-aligned training data is a little superior to that using automatically-aligned training data. Also, the recognition rate of the case in which the bipbone is used as the recognition unit is better than that of the case in which the mono-Phoneme is used.

  • PDF