• Title/Summary/Keyword: vocabulary data

Search Result 285, Processing Time 0.032 seconds

A Study on the Korean Syllable As Recognition Unit (인식 단위로서의 한국어 음절에 대한 연구)

  • Kim, Yu-Jin;Kim, Hoi-Rin;Chung, Jae-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.3
    • /
    • pp.64-72
    • /
    • 1997
  • In this paper, study and experiments are performed for finding recognition unit fit which can be used in large vocabulary recognition system. Specifically, a phoneme that is currently used as recognition unit and a syllable in which Korean is well characterized are selected. From comparisons of recognition experiments, the study is performed whether a syllable can be considered as recognition unit of Korean recognition system. For report of an objective result of the comparison experiment, we collected speech data of a male speaker and processed them by hand-segmentation for phoneme boundary and labeling to construct speech database. And for training and recognition based on HMM, we used HTK (HMM Tool Kit) 2.0 of commercial tool from Entropic Co. to experiment in same condition. We applied two HMM model topologies, 3 emitting state of 5 state and 6 emitting state of 8 state, in Continuous HMM on training of each recognition unit. We also used 3 sets of PBW (Phonetically Balanced Words) and 1 set of POW(Phonetically Optimized Words) for training and another 1 set of PBW for recognition, that is "Speaker Dependent Medium Vocabulary Size Recognition." Experiments result reports that recognition rate is 95.65% in phoneme unit, 94.41% in syllable unit and decoding time of recognition in syllable unit is faster by 25% than in phoneme.

  • PDF

The Role of Language Development in the Relation from Home Environment to Peer Competence of Young Children (유아의 가정환경과 또래유능성의 관계에서 언어발달의 역할)

  • Chang, Young Eun;Sung, Mi Young
    • Korean Journal of Childcare and Education
    • /
    • v.11 no.6
    • /
    • pp.1-18
    • /
    • 2015
  • The purpose of this study was to investigate the effects of the home environment of young children on their language and the quality of peer competence. The current study hypothesized that when the home environment of young children is desirable, the young children would be more likely to develop better language skills, which in turn, predict greater peer competence and lower levels of aggression and withdrawal in the interaction with peers at child care settings. The study used data of 1,802 families who have been participating in the Korean Child Panel Study since 2008. The results revealed that there was a significant relation from both positive home environment and better language skills to more positive play interaction and reduced play disruption and play disconnection. Home environment significantly predicted better expressive language development of young children and, in turn, higher scores on expressive vocabulary tests predicted greater peer competence and less negative play behaviors rated by child care providers. Statistical tests proved that the mediational effects of language skills between home environment and toddler's peer relationships were statistically significant. The study results emphasized the importance of language development in children's expanding social settings and the supporting role of rich and stimulus home environments in children's development.

Speech Synthesis for the Korean large Vocabulary Through the Waveform Analysis in Time Domains and Evauation of Synthesized Speech Quality (시간영역에서의 파형분석에 의한 무제한 어휘 합성 및 음절 유형별 규칙합성음 음질평가)

  • Kang, Chan-Hee;Chin, Yong-Ohk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.1
    • /
    • pp.71-83
    • /
    • 1994
  • This paper deals with the improvement of the synthesized speech quality and naturality in the Korean TTS(Text-to-Speech) system. We had extracted the parameters(table2) such as its amplitude, duration and pitch period in a syllable through the analysis of speech waveforms(table1) in the time domain and synthesized syllables using them. To the frequencies of the Korean pronunciation large vocabulary dictionary we had synthesized speeches selected 229 syllables such as V types are 19, CV types are 80. VC types are 30 and CVC types are 100. According to the 4 Korean syllable types from the data format dictionary(table3) we had tested each 15 syllables with the objective MOS(Mean Opinion Score) evaluation method about the 4 items i.e., intelligibility, clearness, loudness, and naturality after selecting random group without the knowledge of them. As the results of experiments the qualities of them are very clear and we can control the prosodic elements such as durations, accents and pitch periods (fig9, 10, 11, 12).

  • PDF

Automatic Generation of Concatenate Morphemes for Korean LVCSR (대어휘 연속음성 인식을 위한 결합형태소 자동생성)

  • 박영희;정민화
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.407-414
    • /
    • 2002
  • In this paper, we present a method that automatically generates concatenate morpheme based language models to improve the performance of Korean large vocabulary continuous speech recognition. The focus was brought into improvement against recognition errors of monosyllable morphemes that occupy 54% of the training text corpus and more frequently mis-recognized. Knowledge-based method using POS patterns has disadvantages such as the difficulty in making rules and producing many low frequency concatenate morphemes. Proposed method automatically selects morpheme-pairs from training text data based on measures such as frequency, mutual information, and unigram log likelihood. Experiment was performed using 7M-morpheme text corpus and 20K-morpheme lexicon. The frequency measure with constraint on the number of morphemes used for concatenation produces the best result of reducing monosyllables from 54% to 30%, bigram perplexity from 117.9 to 97.3. and MER from 21.3% to 17.6%.

A Comparative Study on Expressive Methods of Finishing Materials for Space Image and Emotional Vocabulary (공간이미지와 감성어휘에 따른 마감재 표현방법 비교 연구)

  • Seo, Ji-Eun;Lee, Gok-Sook
    • Korean Institute of Interior Design Journal
    • /
    • v.21 no.3
    • /
    • pp.111-118
    • /
    • 2012
  • The purpose of this study is to focus on living rooms that are preferred as a place for changing space image to the maximum and to find a method how finishing materials are expressed by selecting space with mix & match of many images. The study methods are as follows. First, understand the expressive trend of space images through the precedent studies and magazines, and examine its relationship with finishing materials. Second, select space images based on the contents understood earlier and extract adjective words that represent each space image through an expert survey. Third, find the cases where space images are expressed based on the extracted words and analyze expression methods of finishing materials. The results of the study are as follows. First, it was confirmed that recent space images are actively expressed through finishing materials. Second, space images selected through data related to the trend were classified as modern+natural, modern+traditional, modern+retro, classic+natural, classic+humor, and futurism+natural and 4 adjective words for each space image were extracted. Third, expressive elements of finishing materials were extracted as 'material'. 'texture', 'color', and 'pattern' through the precedent studies. Fourth, expressive methods of finishing materials for each space image could be suggested by analyzing the examples that show mix & match based on the contents extracted earlier. Lastly, it is expected to find various methods that lead space image into finishing materials by evaluating responses and changes in visual perception of residents according to expression of finishing materials based on this study.

  • PDF

Language Model based on VCCV and Test of Smoothing Techniques for Sentence Speech Recognition (문장음성인식을 위한 VCCV 기반의 언어모델과 Smoothing 기법 평가)

  • Park, Seon-Hee;Roh, Yong-Wan;Hong, Kwang-Seok
    • The KIPS Transactions:PartB
    • /
    • v.11B no.2
    • /
    • pp.241-246
    • /
    • 2004
  • In this paper, we propose VCCV units as a processing unit of language model and compare them with clauses and morphemes of existing processing units. Clauses and morphemes have many vocabulary and high perplexity. But VCCV units have low perplexity because of the small lexicon and the limited vocabulary. The construction of language models needs an issue of the smoothing. The smoothing technique used to better estimate probabilities when there is an insufficient data to estimate probabilities accurately. This paper made a language model of morphemes, clauses and VCCV units and calculated their perplexity. The perplexity of VCCV units is lower than morphemes and clauses units. We constructed the N-grams of VCCV units with low perplexity and tested the language model using Katz, absolute, modified Kneser-Ney smoothing and so on. In the experiment results, the modified Kneser-Ney smoothing is tested proper smoothing technique for VCCV units.

L2 Reading Difficulties Faced by Malaysian Students in a Korean University (말레이시아 학생들의 L2 읽기 문제: 한국 대학의 사례를 중심으로)

  • Kim, Kyung-Rahn
    • Journal of Digital Convergence
    • /
    • v.19 no.2
    • /
    • pp.21-32
    • /
    • 2021
  • The current study investigates how Malaysian ESL learners' L2 (English) speaking fluency is reflected in advanced L2 reading and what difficulties they encounter in reading comprehension. Nine Malaysian students attending a Korean university participated in qualitative research using in-depth and semi-structured interviews. The data revealed that L2 was a very familiar language, and their speaking fluency in L2 reduced the anxiety of L2 reading in general. However, it did not play a significant role in reading at an advanced level. Their difficulties in reading were mainly due to a lack of vocabulary knowledge. However, insufficient background knowledge and interest also frustrated their reading tasks. These factors lowered their reading comprehension, causing inaccurate interpretations or discouraging their endeavors to find messages from the given text. Thus, these findings should be carefully addressed in reading classes for Korean L2 learners as well as international students.

An Augmented Reality-Based Digital App as an Educational Tool for Foreign Language Learning and the Evaluation of Its Learning Effect: Towards an Examination of Learning Motivation, Learning Satisfaction, and Learning Engagement (증강현실(Augmented Reality) 기술 기반의 글자교구재 디지털 앱 개발 사례와 교육효과 평가: 학습동기, 학습만족도, 학습몰입도를 중심으로)

  • Sae Roan Kim;Eun Jin Won;Hyung Gi Kim;Pil Jung Yun
    • Journal of Information Technology Services
    • /
    • v.22 no.4
    • /
    • pp.141-157
    • /
    • 2023
  • The present work aimed to present the development of 'Funt', the augmented reality-based digital app as an educational tool for foreign language learning. Our work further evaluated the learning efficacy of the tool by the assessment of the three dependent measures including learning motivation, learning satisfaction, and learning involvement. With a learning app of 'Funt', students can use AR app to access recognition-based or location-based experiences such that any objects, artifacts, or media appear to be in the app. Students are then able to interact with the digital content by manipulating it to learn more about it. Students's engagement should also increase when they create their own experience in AR to demonstrate their understanding of a particular concept or words. Learning effects were evaluated on survey data collected from a hundred respondents aging six to nine years. One-group design for pre-test and post-test was utilized to examine the differences of learning efficacy by comparing the non-'Funt' group and the Funt group scores. A pairwise t-Test was performed for pairwise comparisons between two learning groups. The results indicate that the 'Funt' group scored significantly higher than the non-'Funt' group in the measures of learning motivation, learning satisfaction, and learning involvement. Overall, our results suggest that 'Funt' attracted the students' attention, provided them with a fun context to learn English vocabulary, and develop positive motivation and satisfaction towards vocabulary learning through AR technology.

Korean Speech Recognition Based on Syllable (음절을 기반으로한 한국어 음성인식)

  • Lee, Young-Ho;Jeong, Hong
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.1
    • /
    • pp.11-22
    • /
    • 1994
  • For the conventional systme based on word, it is very difficult to enlarge the number of vocabulary. To cope with this problem, we must use more fundamental units of speech. For example, syllables and phonemes are such units, Korean speech consists of initial consonants, middle vowels and final consonants and has characteristic that we can obtain syllables from speech easily. In this paper, we show a speech recognition system with the advantage of the syllable characteristics peculiar to the Korean speech. The algorithm of recognition system is the Time Delay Neural Network. To recognize many recognition units, system consists of initial consonants, middle vowels, and final consonants recognition neural network. At first, our system recognizes initial consonants, middle vowels and final consonants. Then using this results, system recognizes isolated words. Through experiments, we got 85.12% recognition rate for 2735 data of initial consonants, 86.95% recognition rate for 3110 data of middle vowels, and 90.58% recognition rate for 1615 data of final consonants. And we got 71.2% recognition rate for 250 data of isolated words.

  • PDF

Rapid Speaker Adaptation Based on Eigenvoice Using Weight Distribution Characteristics (가중치 분포 특성을 이용한 Eigenvoice 기반 고속화자적응)

  • 박종세;김형순;송화전
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.5
    • /
    • pp.403-407
    • /
    • 2003
  • Recently, eigenvoice approach has been widely used for rapid speaker adaptation. However, even in the eigenvoice approach, Performance improvement using very small amount of adaptation data is relatively small in comparison with that using somewhat large adaptation data because the reliable estimation of weights of eigenvoice is difficult. In this paper, we propose a rapid speaker adaptation method based on eigenvoice using the weight distribution characteristics to improve the performance on a small adaptation data. In the Experimental results on vocabulary-independent word recognition task (using PBW 452 database), the weight threshold method alleviates the problem of relatively low performance for a tiny small adaptation data. When single adaptation word is used, word error rate is reduced about 9-18% by the weight threshold method.