• Title/Summary/Keyword: Korean Morphological Analysis

Search Result 2,456, Processing Time 0.046 seconds

MADE: Morphological Analyzer Development Environment (MADE : 형태소 분석기 개발환경)

  • Shim, Kwang-Seob
    • Journal of Internet Computing and Services
    • /
    • v.8 no.4
    • /
    • pp.159-171
    • /
    • 2007
  • This paper proposes a software tool MADE that is useful to develop a practical Korean morphological analyzer. A morphological analysis is performed by using adjacency conditions provided by a morphological dictionary. This means that developing a morphological analyzer is reduced merely to constructing a morphological dictionary. No programming skill is required in this process, MADE provides with useful functions that facilitate the construction of a dictionary. Once a dictionary is constructed, the morphological analysis engine embedded in MADE may be used as a stand-alone morphological analyzer or be integrated into an application software which requires a Korean morphological analysis module.

  • PDF

Linear-Time Korean Morphological Analysis Using an Action-based Local Monotonic Attention Mechanism

  • Hwang, Hyunsun;Lee, Changki
    • ETRI Journal
    • /
    • v.42 no.1
    • /
    • pp.101-107
    • /
    • 2020
  • For Korean language processing, morphological analysis is a critical component that requires extensive work. This morphological analysis can be conducted in an end-to-end manner without requiring a complicated feature design using a sequence-to-sequence model. However, the sequence-to-sequence model has a time complexity of O(n2) for an input length n when using the attention mechanism technique for high performance. In this study, we propose a linear-time Korean morphological analysis model using a local monotonic attention mechanism relying on monotonic alignment, which is a characteristic of Korean morphological analysis. The proposed model indicates an extreme improvement in a single threaded environment and a high morphometric F1-measure even for a hard attention model with the elimination of the attention mechanism formula.

A Morphological Study on the Modern Urbanization and Transformation Type of Urban Tissues in Kunsan (군산의 근대도시발달과정과 도시조직의 변화 유형에 관한 형태학적 연구)

  • Lee, Kyung-Chan;Huh, Joon
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.32 no.6 s.107
    • /
    • pp.36-51
    • /
    • 2005
  • The purpose of this thesis is to analyse modem urbanization process and the morphological transformation of the urban tissues in Kunsan between the you 1899 and 2001, The method of this study is to investigate the transformation process of morphological elements such as plot structure, building layout, building facades, land use, exterior space structure and their use, with actual field surveys, the analysis of land registration maps in 1912, and various topological map. Morphological analysis on modern Kunsan is progressed by three steps-typo-morphological analysis of urban tissue in old-town area, interpretation of morphological process, and transformation process, of morphological structure in Japanese concession in view of plots system. As a result, it is found that there is cyclical relationship among the morphological transformation processes of morphological elements, plots, buildings, land-uses, and access space to buildings. From the view of town plan change, the period of restoration of war damage in 1950s and compressive growing period in 1960s have important meaning in the morphological process of old-town area. Particularly the first building plan and layout type together with plot form and structure is acted as the main factor to decide the subsequent plot transformation system, exterior space system and the particular streetscape in Kunsan.

A Rule-Based Analysis from Raw Korean Text to Morphologically Annotated Corpora

  • Lee, Ki-Yong;Markus Schulze
    • Language and Information
    • /
    • v.6 no.2
    • /
    • pp.105-128
    • /
    • 2002
  • Morphologically annotated corpora are the basis for many tasks of computational linguistics. Most current approaches use statistically driven methods of morphological analysis, that provide just POS-tags. While this is sufficient for some applications, a rule-based full morphological analysis also yielding lemmatization and segmentation is needed for many others. This work thus aims at 〔1〕 introducing a rule-based Korean morphological analyzer called Kormoran based on the principle of linearity that prohibits any combination of left-to-right or right-to-left analysis or backtracking and then at 〔2〕 showing how it on be used as a POS-tagger by adopting an ordinary technique of preprocessing and also by filtering out irrelevant morpho-syntactic information in analyzed feature structures. It is shown that, besides providing a basis for subsequent syntactic or semantic processing, full morphological analyzers like Kormoran have the greater power of resolving ambiguities than simple POS-taggers. The focus of our present analysis is on Korean text.

  • PDF

Transformer-based reranking for improving Korean morphological analysis systems

  • Jihee Ryu;Soojong Lim;Oh-Woog Kwon;Seung-Hoon Na
    • ETRI Journal
    • /
    • v.46 no.1
    • /
    • pp.137-153
    • /
    • 2024
  • This study introduces a new approach in Korean morphological analysis combining dictionary-based techniques with Transformer-based deep learning models. The key innovation is the use of a BERT-based reranking system, significantly enhancing the accuracy of traditional morphological analysis. The method generates multiple suboptimal paths, then employs BERT models for reranking, leveraging their advanced language comprehension. Results show remarkable performance improvements, with the first-stage reranking achieving over 20% improvement in error reduction rate compared with existing models. The second stage, using another BERT variant, further increases this improvement to over 30%. This indicates a significant leap in accuracy, validating the effectiveness of merging dictionary-based analysis with contemporary deep learning. The study suggests future exploration in refined integrations of dictionary and deep learning methods as well as using probabilistic models for enhanced morphological analysis. This hybrid approach sets a new benchmark in the field and offers insights for similar challenges in language processing applications.

Semi-Automatic Construction of Morphological Pattern Dictionary using the Method of Morphological Synthesis (형태소 합성 기법을 이용한 형태소 패턴 사전의 반자동 구축)

  • Park, In-Cheol
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.12 no.11
    • /
    • pp.5278-5283
    • /
    • 2011
  • One approach for very high speed korean morphological analysis is to use pre-built morphological results in dictionary. It pays the high cost to build this morphological pattern dictionary manually, besides the dictionary may contain errors. This paper proposes a method to generate morphological patterns automatically using Korean morphological synthesis. The experiment shows that we automatically generate 86% morphological patterns for analyzing Korean sentences. It takes 52.68 seconds for the morphological system using the patterns to analyze 403MB Korean corpus on 2.8GHz Window system.

Syllable-based Probabilistic Models for Korean Morphological Analysis (한국어 형태소 분석을 위한 음절 단위 확률 모델)

  • Shim, Kwangseob
    • Journal of KIISE
    • /
    • v.41 no.9
    • /
    • pp.642-651
    • /
    • 2014
  • This paper proposes three probabilistic models for syllable-based Korean morphological analysis, and presents the performance of proposed probabilistic models. Probabilities for the models are acquired from POS-tagged corpus. The result of 10-fold cross-validation experiments shows that 98.3% answer inclusion rate is achieved when trained with Sejong POS-tagged corpus of 10 million eojeols. In our models, POS tags are assigned to each syllable before spelling recovery and morpheme generation, which enables more efficient morphological analysis than the previous probabilistic models where spelling recovery is performed at the first stage. This efficiency gains the speed-up of morphological analysis. Experiments show that morphological analysis is performed at the rate of 147K eojeols per second, which is almost 174 times faster than the previous probabilistic models for Korean morphology.

Morphological Analysis among Populations of Purpulish Washington Clam, Saxidomus purpuratus on the Korean Waters

  • Kim, Yeong-Hye;Ryu, Dong-Ki;Lee, Dong-Woo;Chang, Dae-Soo;Kim, Jong-Bin;Kim, Seong-Tae;Kwon, Dae-Hyeon
    • The Korean Journal of Malacology
    • /
    • v.22 no.1 s.35
    • /
    • pp.23-26
    • /
    • 2006
  • Morphological differences were studied using the analysis of variance between various partial length and shell length of three populations of Saxidomus purpuratus on the Korean waters. The Relative growth equations, that is, SH-SL, SW-SL, TW-SL of S. purpuratus by sex were estimated. The analysis of variance of four morphological characters proved that each population has no sexual differences (p>0.01). But the three populations are significantly different in morphological characters (p<0.01).

  • PDF

High Speed Korean Morphological Analysis based on Adjacency Condition Check (인접 조건 검사에 의한 초고속 한국어 형태소 분석)

  • 심광섭;양재형
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.1
    • /
    • pp.89-99
    • /
    • 2004
  • This paper proposes a morphological analysis method that enables morphological analysis by checking conditions between two adjacent morphemes. These conditions are fed from a dictionary. This method eliminates a code conversion module and the application of transformational rules for candidate generation. The method claims that very high speed morphological analysis is attainable through simple bit operations for adjacency condition check. MACH, an implementation of the proposed method, is a supersonic Korean morphological analyzer which is able to analyze a document of 1 GB in 5 minutes on a PC with 1.13 GHz Pentium III CPU. The analysis accuracy of MACH is 99.2 %.

Population Analysis of the Common Squid, Todarodes pacificus Steenstrup in Korean Waters 2. Morphological analysis (한국해역에 분포하는 오징어의 계군분석 2. 형태학적 분석)

  • KIM Yeong-hye;KANG Yong-joo;BAIK Chul-in
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.30 no.5
    • /
    • pp.903-905
    • /
    • 1997
  • Morphological differences were studied using the analysis of covariance between various partial length and mantle length of the common Todarodes pacificus by cohorts in the Korean waters. Analysis of seven morphological characters proved that each cohort has no sexual differences, except significant sexual differences only in the Summer cohort in term of relative growth between mantle length and body weight. The three cohorts represent significant differences in morphological characters.

  • PDF