• Title/Summary/Keyword: 단어 유사도 분석

Search Result 232, Processing Time 0.033 seconds

An English Essay Scoring System Based on Grammaticality and Lexical Cohesion (문법성과 어휘 응집성 기반의 영어 작문 평가 시스템)

  • Kim, Dong-Sung;Kim, Sang-Chul;Chae, Hee-Rahk
    • Korean Journal of Cognitive Science
    • /
    • v.19 no.3
    • /
    • pp.223-255
    • /
    • 2008
  • In this paper, we introduce an automatic system of scoring English essays. The system is comprised of three main components: a spelling checker, a grammar checker and a lexical cohesion checker. We have used such resources as WordNet, Link Grammar/parser and Roget's thesaurus for these components. The usefulness of an automatic scoring system depends on its reliability. To measure reliability, we compared the results of automatic scoring with those of manual scoring, on the basis of the Kappa statistics and the Multi-facet Rasch Model. The statistical data obtained from the comparison showed that the scoring system is as reliable as professional human graders. This system deals with textual units rather than sentential units and checks not only formal properties of a text but also its contents.

  • PDF

Perception and Production of English Geminate Graphemes by Korean Students (한국 학생들의 영어 겹자음 철자 인지와 발화)

  • Cho, Mi-Hui
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2009.05a
    • /
    • pp.1092-1096
    • /
    • 2009
  • While Korean allows the same consonants at the coda of the preceding syllable and at the onset of the following syllable, English does not allow the geminate consonant in the same position. Due to this difference between Korean and English, Korean learners of English tend to incorrectly produce geminate consonants for English geminate graphemes as in summer. Based on this observation, a pilot study was designed to investigate how Korean learners of English perceive and produce English doubleton graphemes and singleton graphemes. Twenty Korean college students were asked to perform a forced-choice perception test as well as a production test for the 36 real word stimuli which consist of near minimal pairs of singleton and doubleton graphemes. The result showed that the accuracy rates for the word with singleton graphemes were relatively high both in perception and production (78.6% and 76.1%, respectively), while those for the word with doubleton graphemes were low both in perception and production (55.3% and 61.7%, respectively). Also, spectrographic analyses were provided where more production errors were witnessed in doubleton grapheme words than singleton grapheme words.

  • PDF

A Study on the Property Values of News Articles and Copyright Infringement (보도기사의 재산권적 가치와 무단전재를 통한 저작권 침해에 관한 연구)

  • Kim, Gyong-Ho
    • Korean journal of communication and information
    • /
    • v.39
    • /
    • pp.324-354
    • /
    • 2007
  • Facts, which constitute news, are as free as air. When they are transformed into news via labor and capital investment of a news organization, the news is deemed to have property values, and the media can claim exclusive rights over the news. The copyright law protects the originality of a work, the uniqueness of reporter's analysis, the selection of words, the arrangement of materials, and the emphasis given on particular points. The name of the game of copyright infringement lies in the infringement of the similarity of the method of expression, not the infringement of the subject. Even though news articles convey information by specifying factual elements of an event or accident, they still have some originality. The judgement that news articles lack of originality is inconsistent with the purpose of the copyright law. Therefore, the law should be amended to articulate that the unauthorized use of news articles without a proper citation shall be the subject of legal action, and courts should decide related cases accordingly.

  • PDF

Categorization of Korean News Articles Based on Convolutional Neural Network Using Doc2Vec and Word2Vec (Doc2Vec과 Word2Vec을 활용한 Convolutional Neural Network 기반 한국어 신문 기사 분류)

  • Kim, Dowoo;Koo, Myoung-Wan
    • Journal of KIISE
    • /
    • v.44 no.7
    • /
    • pp.742-747
    • /
    • 2017
  • In this paper, we propose a novel approach to improve the performance of the Convolutional Neural Network(CNN) word embedding model on top of word2vec with the result of performing like doc2vec in conducting a document classification task. The Word Piece Model(WPM) is empirically proven to outperform other tokenization methods such as the phrase unit, a part-of-speech tagger with substantial experimental evidence (classification rate: 79.5%). Further, we conducted an experiment to classify ten categories of news articles written in Korean by feeding words and document vectors generated by an application of WPM to the baseline and the proposed model. From the results of the experiment, we report the model we proposed showed a higher classification rate (89.88%) than its counterpart model (86.89%), achieving a 22.80% improvement. Throughout this research, it is demonstrated that applying doc2vec in the document classification task yields more effective results because doc2vec generates similar document vector representation for documents belonging to the same category.

A Watermarking for Text Document Images using Edge Direction Histograms (에지 방향 히스토그램을 이용한 텍스트 문서 영상의 워터마킹)

  • 김영원;오일석
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.2
    • /
    • pp.203-212
    • /
    • 2004
  • The watermarking is a method to achieve the copyright protection of multimedia contents. Among several media, the left documents show very peculiar properties: block/line/word patterning, clear separation between foreground and background areas. So algorithms specific to the text documents are required that meet those properties. This paper proposes a novel watermarking algorithm for the grayscale text document images. The algorithm inserts the watermark signals through the edge direction histograms. A concept of sub-image consistency is developed that the sub-images have similar shapes in terms of edge direction histograms. Using Korean, Chinese, and English document images, the concept is evaluated and proven to be valid over a wide range of document images. To insert watermark signals, the edge direction histogram is modified slightly. The experiments were performed on various document images and the algorithm was evaluated in terms of imperceptibility and robustness.

Performance of Korean spontaneous speech recognizers based on an extended phone set derived from acoustic data (음향 데이터로부터 얻은 확장된 음소 단위를 이용한 한국어 자유발화 음성인식기의 성능)

  • Bang, Jeong-Uk;Kim, Sang-Hun;Kwon, Oh-Wook
    • Phonetics and Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.39-47
    • /
    • 2019
  • We propose a method to improve the performance of spontaneous speech recognizers by extending their phone set using speech data. In the proposed method, we first extract variable-length phoneme-level segments from broadcast speech signals, and convert them to fixed-length latent vectors using an long short-term memory (LSTM) classifier. We then cluster acoustically similar latent vectors and build a new phone set by choosing the number of clusters with the lowest Davies-Bouldin index. We also update the lexicon of the speech recognizer by choosing the pronunciation sequence of each word with the highest conditional probability. In order to analyze the acoustic characteristics of the new phone set, we visualize its spectral patterns and segment duration. Through speech recognition experiments using a larger training data set than our own previous work, we confirm that the new phone set yields better performance than the conventional phoneme-based and grapheme-based units in both spontaneous speech recognition and read speech recognition.

An analysis of the signaling effect of FOMC statements (미 연준 통화정책방향 의결문의 시그널링 효과 분석)

  • Woo, Shinwook;Chang, Youngjae
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.3
    • /
    • pp.321-334
    • /
    • 2020
  • The US Federal Reserve (Fed) has decided to cut interest rates. When we look at the expression of the FOMC statements at the time of policy change period we can understand that Fed has been communicating with markets through a change of word selection. However, there is a criticism that the method of analyzing the expression of the decision sentence through the context can be subjective and limited in qualitative analysis. In this paper, we evaluate the signaling effect of FOMC statements based on previous research. We analyze decision making characteristics from the viewpoint of text mining and try to predict future policy trend changes by capturing changes in expressions between statements. For this purpose, a decision tree and neural network models are used. As a result of the analysis, it can be judged that the discrepancy indicators between statements could be used to predict the policy change in the future and that the US Federal Reserve has systematically implemented policy signaling through the policy statements.

Monetary policy synchronization of Korea and United States reflected in the statements (통화정책 결정문에 나타난 한미 통화정책 동조화 현상 분석)

  • Chang, Youngjae
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.1
    • /
    • pp.115-126
    • /
    • 2021
  • Central banks communicate with the market through a statement on the direction of monetary policy while implementing monetary policy. The rapid contraction of the global economy due to the recent Covid-19 pandemic could be compared to the crisis situation during the 2008 global financial crisis. In this paper, we analyzed the text data from the monetary policy statements of the Bank of Korea and Fed reflecting monetary policy directions focusing on how they were affected in the face of a global crisis. For analysis, we collected the text data of the two countries' monetary policy direction reports published from October 1999 to September 2020. We examined the semantic features using word cloud and word embedding, and analyzed the trend of the similarity between two countries' documents through a piecewise regression tree model. The visualization result shows that both the Bank of Korea and the US Fed have published the statements with refined words of clear meaning for transparent and effective communication with the market. The analysis of the dissimilarity trend of documents in both countries also shows that there exists a sense of synchronization between them as the rapid changes in the global economic environment affect monetary policy.

무순 추출물의 생리활성 효과

  • 한진희;문혜경;김종국;김귀영;강우원
    • Proceedings of the Korean Society of Postharvest Science and Technology of Agricultural Products Conference
    • /
    • 2003.04a
    • /
    • pp.98-98
    • /
    • 2003
  • 무순에는 비타민 C가 많이 들어 있어 겨울철 비타민 공급원뿐만 아니라 디아스타제라는 효소가 들어 있어 소화를 촉진시키는 역할을 한다. 그 외에도 거담제 및 건위제 작용을 하고 음주로 인한 토혈해소, 천식에도 좋아 약용하기도 한다. 본 연구에서는 이용가치는 적지만 농가 소득증대에 기여 할 수 있으며 소화를 촉진시키는 무순, 또는 무싹기름이라고 일컬어지는 무순을 추출용매에 따라 생리활성 효과 분석하고 영양학적 가치가 가장 높은 시기의 무순을 선택함으로써 올바른 섭취의 기초자료를 마련하고 그 기능성을 확인하여 기능성 식품소재 및 기능성 화장품 소재로써의 활용을 검토하고자 하고자 한다. 무순을 4일, 8일, 12일에 따라 incubator에 배양하여 시기별로 채취하여 동결건조 한 후 70% Ethanol, 80% Methanol, 75% acetone, 열수로 환류 추출한 후 시료로 사용하였다. 각 용매 추출물에 대해 DPPH free radical 소거능 실험에서는 acetone 추출물에서 89.18%로 가장 높은 전자공여능을 나타냈으며 각각의 추출용매에서 성장 4일과 12일의 무순에서 높은 전자공여능을 보였다. 아질산염 소거능에서는 pH 1.2의 조건에서 가장 높은 아질산염 소거능을 보였고, 열수 추출물에서 89.70%로 가장 높은 소거능을 보였다. pH 4.2조건에서는 열수추출물의 소거능이 가장 좋았고, pH 6.0 조건에서는 가장 낮은 소거능을 보였으며, Ethanol 과 Methanol 추출물에서 23.55∼37.41%의 소거능을 보였다. SOD유사활성은 성장 8일에서 모두 낮은 활성을 보였으며, 성장 4일과 성장 12일의 무순에서는 큰 차이를 보이지 않았지만, Methanol 추출물중 성장 12일에서 27.41%의 SOD유사활성을 보였다.ic acid는 28.8∼51.7 mg%, 미강에서 321.4∼438.4 mg% 범위로 나타났다. 현미, 백미 및 미강에 함유된 총 폴리페놀의 함량을 표준 페놀화합물로 카테친을 사용하고 비색법에 의하여 측정하였을 때 오대 현미의 폴리페놀 함량은 78.4 mg%, 남평 현미 88.8 mg% 였다. 도정한 백미 중의 총 폴리페놀 함량은 30.3∼56.9 mg%, 미강이 541.5∼472.6 mg%의 범위였다. 이상과 같이 쌀에는 phenolic acid 및 총 폴리페놀이 상당량 함유되어 있으며 특히 배유보다는 강층에 많이 존재하므로 이들 성분의 효율적인 이용을 위한 쌀의 섭취방안이 필요한 것으로 나타났다. 유의적인 상관관계를 나타내고 있어 백편의 조직감은 Compression force 와 Work ratio로 대치할 수 있을 것이라고 사료된다. 수분함량은 기계적 검사보다 관능검사와 더욱 높은 상관관계를 나타냈다.내었다. 항균활성이 우수한 생약재를 농도별로 활성을 조사한 결과, 물 추출물과 10% Ethanol 추출물 모두 낮은 농도에서도 우수한 항균활성을 나타내었다.취와 함께 점질성 갈변물질이 생성되었다. 이와 같은 결과로 볼 때, BAAG의 처리는 BAAC의 경우보다 가격은 저렴하면서도 항균력은 우수한 천연 항균복합제재로써 농산물 식품원료에 적용하여 선도유지 기간을 연장할 수 있는 효과를 기대할 수 있었다. 과일 등의 포장제로서 이용할 가능성을 확인하였다.로 [-wh] 겹의문사는 복수 의미를 지닐 수 없 다. 그러면 단수 의미는 어떻게 생성되는가\ulcorner 본 논문에서는 표면적 형태에도 불구하고 [-wh]의미의 겹의문사는 병렬적 관계의 합성어가 아니라 내부구조를 지니지 않은 단순한 단어(minimal $X^{0}$ elem

  • PDF

International Comparison Study on Essential Concepts of Science Curriculum: Focus on the United States, Canada, Australia and England (과학과 교육과정의 핵심 개념 국제 비교 -미국, 캐나다, 호주, 영국을 중심으로-)

  • Kim, Jihyeon;Chung, Are Jun
    • Journal of The Korean Association For Science Education
    • /
    • v.37 no.1
    • /
    • pp.215-223
    • /
    • 2017
  • This study aims to find an effective way to present essential science concepts in national science curriculum through international comparisons. Next Generation Science Standard (US), Ontario Science Curriculum (Canada), Australia Science Curriculum, and British/English Science Curriculum were selected for comparison. In science curriculum documents, these countries used terms such as 'Key ideas,' 'Big ideas,' 'Key concepts,' 'Disciplinary core ideas.' and 'Fundamental concepts' to present essential concepts of science. This study reviewed the characteristics of the meaning, the status, and the role of essential concepts country by country. The result shows essential concepts have been used with different meanings and statutes in each case. Furthermore, various roles were performed through essential concepts in order to organize their science curriculum. From these foreign nation's cases, this study proposes several ways to present essential science concepts based on results. First, interdisciplinary integrated concepts were needed to organize an integrated science curriculum. In science curriculum documents of the United States, Canada, Australia and England, two types of terms were used in order to structuralize an integrated science curriculum. Second, essential concepts should include concepts related with function and value as well as scientific knowledge. Third, essential concepts need to be presented in such a way as to show specific contexts. Therefore, selecting appropriate contents and structure are needed to be able to improve the way to present essential concepts in Korea's educational environment.