• Title/Summary/Keyword: Korean learners' corpus

Search Result 40, Processing Time 0.023 seconds

Studying the frequencies of sentence pattern for a entence patterns dictionary (문형 사전을 위한 문형 빈도 조사)

  • Kim Yu-Mi
    • Korean Journal of Cognitive Science
    • /
    • v.16 no.2
    • /
    • pp.123-140
    • /
    • 2005
  • The purpose of this paper is to examine the frequency and usage of sentence patterns appearing in electronic dictionaries used in Korean language education in order to design an automatic sentence patterns checking. First, the concept of sentence patterns is defined and it is classified into sentence structure patterns and sentencial expression patterns. Sentence structure patterns and sentencial expression patterns are analyzed how they are expressed in the Korean Learner's Corpus. learner's Corpus is built into the Standard Corpus, which all Korean Learners must learn, and the Errors Corpus made by learners. From these research, we will find out how frequently the Sentential Patterns are being used in the Standard Corpus which has been made of Korean Texts and how the Sentential Pattern are being used in the Errors Corpus which were constructed from Korean learner's writings. Finally, having described the Sentential Patterns on the Sentential Electric Dictionary, we determine the optimum speed in the search for the Sentential Pattern.

  • PDF

A Study to Rethink the Components of Teaching Korean Genitive Particle '의': Based on the Errors in Korean Learners' Corpus (한국어 학습자 대상 관형격 조사 '의'의 교육 내용 재고: 학습자 말뭉치에 나타난 오류를 바탕으로)

  • Soo-Hyun Lee;Ji-Young Sim
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.26 no.3
    • /
    • pp.443-454
    • /
    • 2023
  • The purpose of this study is to reveal the Korean learners' usage pattern of '의', the genitive particle, according to semantic classification, so that it can be referred to in determining the contents and methods of related education. The method of this study adopts a quantitative analysis using learners corpus established by National Institute of Korean Language. As a result of the analysis, as proficiency increases, the overall frequency of '의' increases and the number of meaning senses used increases. However, the frequency of errors also increases with it. As for the usage pattern of each sense, the meaning of 'ownership, belonging' is the most frequent, and followed by 'acting entity', 'kinship, social relations', and 'relationship(area)'. In conclusion, the meanings of 'acting subjects' and 'relationships(area) need to be supplemented with explicit education. Other meanings need to be discussed, and decisions should be made in consideration of learning purpose and proficiency.

Analysis of Phonemic Errors of Korean Learners According to Language and Proficiency (언어권과 숙달도에 따른 한국어 학습자의 발음 오류 분석 - 음소 오류를 중심으로 -)

  • 유소영;강현화
    • Language Facts and Perspectives
    • /
    • v.44
    • /
    • pp.357-397
    • /
    • 2018
  • The purpose of this paper is to investigate the phonemic errors in Korean learner's spoken corpus. Through this, we tried to investigate the common errors and the errors in certain languages. The results of the analysis were as follows. First, Errors that distinguish three phonemes(plain sound, tense sound, aspiration sound) were high in all languages. In the middle phonemes, the most common errors in pronouncing 'ㅓ' in all languages. Second, the errors of each language are different. Comparing the ratios by position, Chinese characters had the most common errors with 50% in final phoneme, and the Japanese language showed equal errors in initial, middle, and end. In English, initial phoneme errors accounted for 58%. Vietnamese Learners showed intensive errors in the initial and final phoneme. Third, in addition to the phoneme errors, we also examined the allophone errors and foreign language pronunciation errors. The allophone errors are mainly concentrated in 'ㄹ', ​​and the pronunciation of the foreign language is mainly used in the source language or the native language of the learners. This paper analyzes the phoneme errors in the Learner's spoken language through the spoken corpus data with representative and annotation consistency. Through this study, we could compare the difference of phoneme errors of Main Korean learners.

Using the Deep Learning Techniques for Understanding the nativelikeness of Korean EFL Learners (한국인 영어학습자의 영어 문장은 얼마나 원어민스러운가: 딥러닝 기반 분석)

  • 박권식;유석훈;송상헌
    • Language Facts and Perspectives
    • /
    • v.48
    • /
    • pp.195-227
    • /
    • 2019
  • Building upon the state-of-the-art deep learning techniques, the present study classifies the texts written by Korean EFL learners and English native speakers and thereby demonstrates how the two types of texts differ from each other. To this end, the current work makes use of the Yonsei English Learner Corpus (YELC) and Gacheon Learner Corpus (GLC) as the L2 data, and Corpus of Contemporary American English (COCA) as the L1 data. Utilizing the sentence classification methods, the current work implements a system to differentiate the two types of texts, the accuracy of which is about 94%. This indicates that the deep leaning-based system is capable of identifying the well-formedness and felicities of the texts written by Korean EFL learners. Nonetheless, the system-based judgments do not overlap with human judgments largely because the deep learning model exclusively focuses on sequence of words. The present study provides a further analysis to see how the two types of judgments differ with respect to grammatical errors (e.g., word order, voice, etc.) and felicity errors (e.g., semantic prosody, the position of adverbs, etc.).

Acquisition and Development of particles of Beginner Level Korean Language Learners (초급 한국어 학습자의 조사 습득 및 발달 연구)

  • 이승연;이유경;최은지;이선영
    • Language Facts and Perspectives
    • /
    • v.48
    • /
    • pp.505-541
    • /
    • 2019
  • This research aims to analyze Korean language learners' spoken corpus to reveal their acquisition order and development patterns of particles. To this end, we collected free conversation data of beginning level Korean language learners over five months and constructed a corpus. It was confirmed that particle acquisition takes place over four stages based on the frequency of particle use and its accuracy. The stages of development were first 'ey, un/nun, i/ka(nominative), ul/lul', second 'eyse, hako(conjunction), to, hako(adverbial)', third '(u)lo, pota, man, eykey, kkaci, puthe, kkeyse, ui', and fourth 'hanthey, (i)na(conjunction), wa/kwa(conjunction), kkey, (i)lang(adverbial), eykeyse, mata, wa/kwa(adverbial), (i)na(auxiliary particle), pakkey, (i)lang(conjunction)'. Based on these findings, the characteristics shown in the particle use of beginning level learners are as the following. First, case markers start to develop foremost. Second, the accuracy of each particle use tends to decrease slightly over time. Third, the frequency of some particles was observed to suddenly increase and then decrease again at a certain period. Fourth, the order of most, but not all particles' appearance seemed to be related to the order of being introduced in textbooks. It is important that this research provides implications for grammar education when establishing Korean language education curriculum or developing grammar syllabus.

Is Compared to Different from Compared with? A Discussion of Prepositions that Are Particularly Difficult for EFL Learners

  • Lee, Seung-Ah
    • Journal of English Language & Literature
    • /
    • v.55 no.6
    • /
    • pp.1057-1085
    • /
    • 2009
  • This paper addresses the question of why prepositions are particularly difficult forEFL learners. The first reason for such difficulty lies in the distinction between seemingly equivalent prepositions such as to and with, as in compared to and compared with. Most monolingual learners' dictionaries regard these two phrases as virtually synonymous. Yet, the results of the corpus analysis conducted in this study indicate that there are differences between the two. A second reason why EFL learners have problems with prepositions is that there are often variations in the inputdata. For example, although from generally follows different, in American English different than is also used. On the other hand, in British English, different to is the second most commonly used construction. This type of regional variation, confirmed in the corpus findings of the present paper, causes confusion in students of English. A learner who is not accustomed to British English may be puzzled by the expression different to. Finally, L1 negative transfer is responsible for the incorrect use of expressions such as discuss about. An error of this sort is the result of interference from the learner's mother tongue. The English verb discuss is not subcategorized for a preposition, whereas the equivalent Korean verb, for example, requires a noun phrase combined with the postposition.

A Comparative Study on Oral Fluency Between Korean Native Speakers and L2 Korean Learners in Speech Discourse - With Focus on Speech Rate, Pause, and Discourse Markers (발표 담화에서의 한국어 모어 화자와 한국어 학습자의 말하기 유창성 비교 연구 -발화 속도, 휴지, 담화표지를 중심으로-)

  • Lee, Jin;Jung, Jinkyung
    • Journal of Korean language education
    • /
    • v.29 no.4
    • /
    • pp.137-168
    • /
    • 2018
  • The purpose of this study is to prepare the basis for a more objective evaluation of oral fluency by comparing speech patterns of Korean native speakers and L2 Korean learners. For this purpose, the current study focused on the analysis of speech materials of the 21st century Sejong spoken corpus and Korean learner corpus. We compared the oral fluency of Korean native speakers and Korean learners based on speech rate, pause, and discourse markers. The results show that the pattern of Korean learners is different to that of Korean native speakers in all aspects of speech rate, pause, and discourse markers; even though proficiency of Korean leaners show increase, they could not reach the oral fluency level of Korean native speakers. At last, based on these results of the analysis, we added suggestions for setting the evaluation criteria of oral fluency of Korean learners.

Automatic pronunciation assessment of English produced by Korean learners using articulatory features (조음자질을 이용한 한국인 학습자의 영어 발화 자동 발음 평가)

  • Ryu, Hyuksu;Chung, Minhwa
    • Phonetics and Speech Sciences
    • /
    • v.8 no.4
    • /
    • pp.103-113
    • /
    • 2016
  • This paper aims to propose articulatory features as novel predictors for automatic pronunciation assessment of English produced by Korean learners. Based on the distinctive feature theory, where phonemes are represented as a set of articulatory/phonetic properties, we propose articulatory Goodness-Of-Pronunciation(aGOP) features in terms of the corresponding articulatory attributes, such as nasal, sonorant, anterior, etc. An English speech corpus spoken by Korean learners is used in the assessment modeling. In our system, learners' speech is forced aligned and recognized by using the acoustic and pronunciation models derived from the WSJ corpus (native North American speech) and the CMU pronouncing dictionary, respectively. In order to compute aGOP features, articulatory models are trained for the corresponding articulatory attributes. In addition to the proposed features, various features which are divided into four categories such as RATE, SEGMENT, SILENCE, and GOP are applied as a baseline. In order to enhance the assessment modeling performance and investigate the weights of the salient features, relevant features are extracted by using Best Subset Selection(BSS). The results show that the proposed model using aGOP features outperform the baseline. In addition, analysis of relevant features extracted by BSS reveals that the selected aGOP features represent the salient variations of Korean learners of English. The results are expected to be effective for automatic pronunciation error detection, as well.

This study revises Lee Hyo-seok's The Buckwheat Season, utilizing Novel Corpus, intermediate learners' level (소설텍스트의 난이도 조정 방안 연구 -이효석의 「메밀꽃 필 무렵」을 중심으로-)

  • Hwang, Hye ran
    • Journal of Korean language education
    • /
    • v.29 no.4
    • /
    • pp.255-294
    • /
    • 2018
  • The Buckwheat Season, evaluated as the best of Lee Hyo-seok's literature, is one of the short stories that represent Korean literature. However, vivid literary expressions such as lyrical and beautiful depictions, figurative expressions and dialects, which show the Korean beauty, rather make learners have difficulty and become a factor that fails in reading comprehension. Thus, it is necessary to revise and present the text modified for the learners' language level. The methods of revising a literary text include the revision of linguistic elements such as cryptic vocabulary or sentence structure and the revision of the composition of the text, e.g. suggestion of characters or plot, or insertion of illustration. The methods of revising the language of the text can be divided into methods of simplification and detailing. However, in the process of revising the text, many depend on the adapter's subjective perception, not revising it with objective criteria. This paper revised the text, utilizing by the Academy of Korean Studies, , and the by the National Institute of Korean Language to secure objectivity in revising the text.

A Corpus-Based Study on Korean EFL Learners' Use of English Logical Connectors

  • Ha, Myung-Jeong
    • International Journal of Contents
    • /
    • v.10 no.4
    • /
    • pp.48-52
    • /
    • 2014
  • The purpose of this study was to examine 30 logical connectors in the essay writing of Korean university students for comparison with the use in similar types of native English writing. The main questions addressed were as follows: Do Korean EFL students tend to over- or underuse logical connectors? What types of connectors differentiate Korean learners from native use? To answer these questions, EFL learner data were compared with data from native speakers using computerized corpora and linguistic software tools to speed up the initial stage of the linguistic analysis. The analysis revealed that Korean EFL learners tend to overuse logical connectors in the initial position of the sentence, and that they tend to overuse additive connectors such as 'moreover', 'besides', and 'furthermore', whereas they underuse contrastive connectors such as 'yet' and 'instead'. On the basis of the results of this study, some pedagogical implications are made concerning the need for teaching of the semantic, stylistic, and syntactic behavior of logical connectors.