• Title/Summary/Keyword: Language Adaptation

Search Result 181, Processing Time 0.022 seconds

N- gram Adaptation Using Information Retrieval and Dynamic Interpolation Coefficient (정보검색 기법과 동적 보간 계수를 이용한 N-gram 언어모델의 적응)

  • Choi Joon Ki;Oh Yung-Hwan
    • MALSORI
    • /
    • no.56
    • /
    • pp.207-223
    • /
    • 2005
  • The goal of language model adaptation is to improve the background language model with a relatively small adaptation corpus. This study presents a language model adaptation technique where additional text data for the adaptation do not exist. We propose the information retrieval (IR) technique with N-gram language modeling to collect the adaptation corpus from baseline text data. We also propose to use a dynamic language model interpolation coefficient to combine the background language model and the adapted language model. The interpolation coefficient is estimated from the word hypotheses obtained by segmenting the input speech data reserved for held-out validation data. This allows the final adapted model to improve the performance of the background model consistently The proposed approach reduces the word error rate by $13.6\%$ relative to baseline 4-gram for two-hour broadcast news speech recognition.

  • PDF

Language Model Adaptation Based on Topic Probability of Latent Dirichlet Allocation

  • Jeon, Hyung-Bae;Lee, Soo-Young
    • ETRI Journal
    • /
    • v.38 no.3
    • /
    • pp.487-493
    • /
    • 2016
  • Two new methods are proposed for an unsupervised adaptation of a language model (LM) with a single sentence for automatic transcription tasks. At the training phase, training documents are clustered by a method known as Latent Dirichlet allocation (LDA), and then a domain-specific LM is trained for each cluster. At the test phase, an adapted LM is presented as a linear mixture of the now trained domain-specific LMs. Unlike previous adaptation methods, the proposed methods fully utilize a trained LDA model for the estimation of weight values, which are then to be assigned to the now trained domain-specific LMs; therefore, the clustering and weight-estimation algorithms of the trained LDA model are reliable. For the continuous speech recognition benchmark tests, the proposed methods outperform other unsupervised LM adaptation methods based on latent semantic analysis, non-negative matrix factorization, and LDA with n-gram counting.

Style-Specific Language Model Adaptation using TF*IDF Similarity for Korean Conversational Speech Recognition

  • Park, Young-Hee;Chung, Min-Hwa
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.2E
    • /
    • pp.51-55
    • /
    • 2004
  • In this paper, we propose a style-specific language model adaptation scheme using n-gram based tf*idf similarity for Korean spontaneous speech recognition. Korean spontaneous speech shows especially different style-specific characteristics such as filled pauses, word omission, and contraction, which are related to function words and depend on preceding or following words. To reflect these style-specific characteristics and overcome insufficient data for training language model, we estimate in-domain dependent n-gram model by relevance weighting of out-of-domain text data according to their n-. gram based tf*idf similarity, in which in-domain language model include disfluency model. Recognition results show that n-gram based tf*idf similarity weighting effectively reflects style difference.

Korean Children's Perception of English Language Acquisition and Cultural Adaptation in Australia

  • Park, Joo-Kyung
    • English Language & Literature Teaching
    • /
    • v.13 no.4
    • /
    • pp.127-152
    • /
    • 2007
  • Recently, the number of students to choose to study in Australia has been increasing significantly. The purpose of this study is to examine how Korean primary school children perceive their own English language learning and cultural adaptation in Australia. A questionnaire survey was conducted with 34 Korean children aged 8-13 who were attending primary schools in Brisbane, Queensland. The study results show that they made diverse efforts to learn English language and culture in Australia, such as making English-speaking friends, watching TV/video/DVD, reading English books, and studying with a foreign tutor. Their English listening and writing abilities were thought to be improved most, followed by speaking, reading and cultural understanding after studying in Australia. The subjects were mostly satisfied with their study and life in Australia but they had difficulties with communicating in English, homesickness, foods, weather, insects, and discrimination. In particular, they had problems with understanding classes conducted all in English and participating in the classroom activities due to their low level of English ability and understanding of Australian classroom culture. The findings of this study have pedagogical implications for educators both in Australia and Korea.

  • PDF

Improvement of Korean Sign Language Recognition System by User Adaptation (사용자 적응을 통한 한국 수화 인식 시스템의 개선)

  • Jung, Seong-Hoon;Park, Kwang-Hyun;Bien, Zeung-Nam
    • Proceedings of the KIEE Conference
    • /
    • 2007.04a
    • /
    • pp.301-303
    • /
    • 2007
  • This paper presents user adaptation methods to overcome limitations of a user-independent model and a user-dependent model in a Korean sign language recognition system. To adapt model parameters for unobserved states in hidden Markov models, we introduce new methods based on motion similarity and prediction from adaptation history so that we can achieve faster adaption and higher recognition rates comparing with previous methods.

  • PDF

Probing Sentence Embeddings in L2 Learners' LSTM Neural Language Models Using Adaptation Learning

  • Kim, Euhee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.3
    • /
    • pp.13-23
    • /
    • 2022
  • In this study we leveraged a probing method to evaluate how a pre-trained L2 LSTM language model represents sentences with relative and coordinate clauses. The probing experiment employed adapted models based on the pre-trained L2 language models to trace the syntactic properties of sentence embedding vector representations. The dataset for probing was automatically generated using several templates related to different sentence structures. To classify the syntactic properties of sentences for each probing task, we measured the adaptation effects of the language models using syntactic priming. We performed linear mixed-effects model analyses to analyze the relation between adaptation effects in a complex statistical manner and reveal how the L2 language models represent syntactic features for English sentences. When the L2 language models were compared with the baseline L1 Gulordava language models, the analogous results were found for each probing task. In addition, it was confirmed that the L2 language models contain syntactic features of relative and coordinate clauses hierarchically in the sentence embedding representations.

A Prediction Model on Adaptation to University Life among Chinese International Students in Korea (중국 유학생의 한국 대학생활 적응 예측모형)

  • Lin, Qin Lan;Kim, Hee-Kyung
    • The Journal of Korean Academic Society of Nursing Education
    • /
    • v.17 no.3
    • /
    • pp.501-513
    • /
    • 2011
  • Purpose: On the basis of the theoretical framework of a combination of Roy's adaptation theory and Lazarus & Folkman's theory of stress - appraise coping, the purpose of this study was to predict effect factors of adaptation to university life of Chinese international students in Korea. After this, a model of adaptation to university life of Chinese international students in Korea was constructed. Methods: A questionnaire was used to survey 369 Chinese international students from one university in Korea, which was analyzed by using PASW Statistics 18.0 and LISREL 8.7. Results: This theoretical model explained adaptation to university life of Chinese international students at 75.0% in Korea. Physical symptoms, loneliness, acculturation stress and self-efficacy directly affected the adaptation to university life. Korean language proficiency indirectly affected adaptation to university life through self-efficacy. Conclusion: Results of this study provided theoretical basis for the future health care of university- centered health centers. For improving adaptation to university life of Chinese international students in Korea, education and nursing measures for reducing physical symptoms, loneliness and acculturation stress, and improving Korean language proficiency and self-efficacy are proposed for further research and development.

Development of Revised Korean Version of ICF (ICF 한글개정판 개발)

  • Lee, Haejung;Song, Jumin
    • The Journal of Korean Physical Therapy
    • /
    • v.26 no.5
    • /
    • pp.344-350
    • /
    • 2014
  • Purpose: The purpose of this study was to translate and culturally adapt the International Classification of Functioning, Disability and Health (ICF) into the Korean language. Methods: The process of translation and adaptation of the ICF used here followed the translation guidelines of WHO. Implementation of this procedure comprised of four steps; forward translation, expert panel back-translation, pre-testing and cognitive interviewing, and final adaptation. The translators included health professionals with knowledge of ICF and non-health professionals blinded to the ICF. Clinical academics with significant experience in the use of disability survey, medical doctors, special educators, related policy makers, clinicians, architecture professionals, and international experts in ICF were invited to integrate all versions of the ICF for testing; 151 clinicians volunteered from 19 medical institutes across the country. Four different core-sets and a questionnaire were used for testing its practical usability and adaptation. Results: All translations were reviewed and a consensus was reached on any discrepancy from the earlier versions. Over 90% of the newly translated version of K-ICF was found to be different from the 2004 K-ICF version in the ICF language. Understanding of K-ICF language was responded difficult and very difficult by 50% of participants, whereas its practical use was responded 'useful' by more than 50% of subjects. Conclusion: It can be suggested that the new version of K-ICF should be widely used for final adaptation in the field of areas. Future studies will be required for implementation of K-ICF.

Performance of Vocabulary-Independent Speech Recognizers with Speaker Adaptation

  • Kwon, Oh Wook;Un, Chong Kwan;Kim, Hoi Rin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.1E
    • /
    • pp.57-63
    • /
    • 1997
  • In this paper, we investigated performance of a vocabulary-independent speech recognizer with speaker adaptation. The vocabulary-independent speech recognizer does not require task-oriented speech databases to estimate HMM parameters, but adapts the parameters recursively by using input speech and recognition results. The recognizer has the advantage that it relieves efforts to record the speech databases and can be easily adapted to a new task and a new speaker with different recognition vocabulary without losing recognition accuracies. Experimental results showed that the vocabulary-independent speech recognizer with supervised offline speaker adaptation reduced 40% of recognition errors when 80 words from the same vocabulary as test data were used as adaptation data. The recognizer with unsupervised online speaker adaptation reduced abut 43% of recognition errors. This performance is comparable to that of a speaker-independent speech recognizer trained by a task-oriented speech database.

  • PDF

Language Model Adaptation for Conversational Speech Recognition (대화체 연속음성 인식을 위한 언어모델 적응)

  • Park Young-Hee;Chung Minhwa
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.83-86
    • /
    • 2003
  • This paper presents our style-based language model adaptation for Korean conversational speech recognition. Korean conversational speech is observed various characteristics of content and style such as filled pauses, word omission, and contraction as compared with the written text corpora. For style-based language model adaptation, we report two approaches. Our approaches focus on improving the estimation of domain-dependent n-gram models by relevance weighting out-of-domain text data, where style is represented by n-gram based tf*idf similarity. In addition to relevance weighting, we use disfluencies as predictor to the neighboring words. The best result reduces 6.5% word error rate absolutely and shows that n-gram based relevance weighting reflects style difference greatly and disfluencies are good predictor.

  • PDF