• Title/Summary/Keyword: syntactic structures

Search Result 92, Processing Time 0.023 seconds

A Study on Keyword Extraction and Expansion for Web Text Retrieval (웹 문서 검색을 위한 검색어 추출과 확장에 관한 연구)

  • Yoon, Sung-Hee
    • Journal of the Korea Computer Industry Society
    • /
    • v.5 no.9
    • /
    • pp.1111-1118
    • /
    • 2004
  • Natural language query is the best user interface for the users of web text retrieval systems. This paper proposes a retrieval system with expanded keyword from syntactically-analyzed structures of user's natural language query based on natural language processing technique. Through the steps combining or splitting the compound nouns based on syntactic tree traversal, and expanding the other-formed or shorten-formed keyword into multiple keyword, it shows that precision and correctness of the retrieval system was enhanced.

  • PDF

The Semantic Structure and Argument Realization of Korean Passive Verbs (한국어 피동동사의 의미구조와 논항실현)

  • 김윤신;이정민;강범모;남승호
    • Korean Journal of Cognitive Science
    • /
    • v.11 no.1
    • /
    • pp.25-32
    • /
    • 2000
  • Korean passive verbs are derived from their corresponding active verbs by suffixation or by adding endings and auxiliaries to their stems. Therefore. we assume p passive verbs share some lexical informations with their active counterparts. This paper extending the Generative Lexicon theory of Pustejovsky (995). aims to characterize the argument realization patterns of Korean passive verbs focusing on the case alternation a and to propose their lexical semantic structures which account for the syntactic behavior.

  • PDF

Efficient Analysis of Korean Dependency Structures Using Beam Search Algorithms (Beam Search 알고리즘을 이용한 효율적인 한국어 의존 구조 분석)

  • Kim, Hark-Soo;Seo, Jung-Yun
    • Annual Conference on Human and Language Technology
    • /
    • 1998.10c
    • /
    • pp.281-286
    • /
    • 1998
  • 구문분석(syntactic analysis)은 형태소 분석된 결과를 입력으로 받아 구문단위간의 관계를 결정해 주는 자연어 처리의 한 과정이다. 그러나 구문분석된 결과는 많은 중의성(ambiguity)을 갖게 되며, 이러한 중의성은 이후의 자연어 처리 수행과정에서 많은 복잡성(complexity)를 유발하게 된다. 지금까지 이러한 문제를 해결하기 위한 여러 가지 연구들이 있었으며, 그 중 하나가 대량의 데이터로부터 추출된 통계치를 이용한 방법이다. 그러나, 생성된 모든 구문 트리(parse tree)에 통계치를 부여하고, 그것들을 순위화하는 것은 굉장히 시간 소모적인 일(time-consuming job)이다. 그러므로, 생성 가능한 트리의 수를 효과적으로 줄이는 방법이 필요하다. 본 논문에서는 이러한 문제를 해결하기 위해 개선된 beam search 알고리즘을 제안하고, 기존의 방법과 비교한다. 본 논문에서 제안된 beam search 알고리즘을 사용한 구문분석기는 beam search를 사용하지 않은 구문분석기가 생성하는 트리 수의 1/3정도만으로도 같은 구문 구조 정확률을 보였다.

  • PDF

사동화에 의한 논항구조와 사건구조와 변화

  • 김윤신
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2001.06a
    • /
    • pp.25-58
    • /
    • 2001
  • This study explores the lexical-semantic structure of derived causative verbs in Korean based on Pustejovsky(1995)'s Generative Lexicon Theory (GL), Mor-phological causative verbs are derived from their root stems by affixing ‘-i, -hi, -li, -gi’ in Korean and the meanings of derived predicates are closely related to the meanings of their root verbs. In particular, the change of the ARGUMENT STRUCTURE by morphological derivation leads to the change of the EVENT STRUCTURE. In this study, causation is defined as the cause-effect relation having a causer. The ARGUMENT STRUCTURES of derived causative verbs includes a causer argument, which is added to the ARGUMENT STRUCTURE of their root verbs. Their EVENT STRUCTURE has a headed process related to a causer and their result is the event which their root verbs represent. This approach can also suggest that the (in)directness of causative is determined by which verb is its root and explain the difference between the morphological causativization and the syntactic causativization in Korean.

  • PDF

Ambiguity Types of the Homonymic & Heterographic Units for Improving Korean Voice Recognition System - a Preliminary Research (한국어 음성인식 시스템 향상을 위한 동음이철 단위의 중의성 유형 분류)

  • Yoon, Ae-Sun;Kang, Mi-Young
    • Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.67-81
    • /
    • 2008
  • The accuracy rate of P2G (Phoneme-to-Grapheme) is one of the important factors determining the quality of unlimited voice recognition (VR) systems. Few studies were, however, conducted to reduce ambiguities of a phoneme string which can be segmented into a variety of different linguistic units (i.e. morphemes, words, eo-jeols), thus be transformed into more than one grapheme string. This paper is a preliminary research for building a large knowledge base of those homonymic & heterographic units(HHUs), which will provide unlimited Korean VR systems with more accurate P2G information. This paper analyzes 2 main factors generating HHUs: (1) boundary determination of the prosodic unit; (2) its segmentation into linguistic units. In this paper, linguistic characteristics determining variable boundaries of a prosodic unit are investigated, and the ambiguity types of HHUs are classified in accordance with their morphological and syntactic structures as well as with the phonological rules governing them.

  • PDF

A model of listening comprehension process and the teaching of spoken English (청취이해과정의 모형과 영어의 구어교육)

  • Kim, Dae-Won
    • Speech Sciences
    • /
    • v.8 no.4
    • /
    • pp.185-191
    • /
    • 2001
  • This study was designed to determine what components of spoken language have been relatively neglected in the teaching of listening comprehension in Korea and to suggest a model of listening process. Two types of tests were undertaken using spoken and written forms of English with secondary school teachers of English and college students. Findings: Hearing power has been generally neglected in the teaching of listening comprehension. Hearing power which can be thought as an active process is defined as an ability to transfer the sequence of discrete phonetic segments without word boundary into the sequence of words in phonemic representations by using both nonlinguistic factors and linguistic factors including perception rules based on phonetics and phonology. Vocabularies, hearing-speaking power, syntactic structures and idiomatic expressions are to be taught for spoken English. A model of listening process was suggested and discussed.

  • PDF

A Comparative Study on French Intonation between French and Korean Learners (불어 원어민과 한국인 불어 학습자의 억양 비교 연구)

  • Kim, Hyun-gi
    • Speech Sciences
    • /
    • v.1
    • /
    • pp.27-38
    • /
    • 1997
  • The differences in French Intonation between French and Korean learners can be applied to French intonation education. One native French speaker and three native Korean speakers who learned French language at High school were selected for this study. The subjects spoke test phrases based on the different syntactic structures. High-Speed speech Analysis system(RILP) was used for this experiment. The different intonation curves were showed at the end of phrase and at the beginning of phrase between French and Korean learners. At the end of phrases, French intonation appeared to have increasing and decending pitch contours in the case of wh-question, exclamation and finality. However, Korean learner's intonation showed only increasing pitch contours. At the beginning of phrase, French intonation shows decending pitch contours in the case of minor continuation and command. In contrast, Korean learner's intonation appeared to have increasing pitch contours. The new intonation training system using PC can have great effect on education of French as a second language.

  • PDF

Construction of Korean Linguistic Information for the Korean Generation on KANT (Kant 시스템에서의 한국어 생성을 위한 언어 정보의 구축)

  • Yoon, Deok-Ho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.12
    • /
    • pp.3539-3547
    • /
    • 1999
  • Korean linguistic information for the generation modulo of KANT(Knowledge-based Accurate Natural language Translation) system was constructed. As KANT has a language-independent generation engine, the construction of Korean linguistic information means the development of the Korean generation module. Constructed information includes concept-based mapping rules, category-based mapping rules, syntactic lexicon, template rules, grammar rules based on the unification grammar, lexical rules and rewriting rules for Korean. With these information in sentences were successfully and completely generated from the interlingua functional structures among the 118 test set prepared by the developers of KANT system.

  • PDF

Building a Rule-Based Goal-Model from the IEC 62304 Standard for Medical Device Software

  • Kim, DongYeop;Lee, Byungjeong;Lee, Jung-Won
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.8
    • /
    • pp.4174-4190
    • /
    • 2019
  • IEC 62304 is a standard for the medical device software lifecycle. Developers must develop software that complies with all specifications in the standard for licensing. However, because the standard contains not only a large number of specifications, but also domain-specific information and association relationships between specifications, it requires considerable effort and time for developers to understand and interpret the standard. To support developers, this paper presents a method for extracting the contents of the IEC 62304 standard as a goal model, which is the core methodologies of requirements engineering. The proposed method analyzes the grammar of the standard to robustly extract complex structures and various information from standard specifications and define rules that extract goals and links from syntactic element units. We validated the actual extraction process for the standard document experimentally. Based on the extracted goal model, developers can intuitively and efficiently comply with the standard and track specific information within the medical software and standard domains.

A Comparative Study on the Intransitive Verb Alternation of English and Korean in the Aspectual Event Syntax

  • Khym, Han-Gyoo
    • International journal of advanced smart convergence
    • /
    • v.6 no.4
    • /
    • pp.41-49
    • /
    • 2017
  • In this paper I applies Borer (1993)'s way of classifying English intransitive action verbs such as 'run', walk, among many others, to the corresponding Korean intransitive action verbs such as 'tali-ta' and 'keət-ta', and show how they are different from - or similar with - each other in terms of syntactic structures and verb classification. Unlike the English verb 'run' which can be classified into an unaccusative verb as well as an unergative verb in Borer's theory, the corresponding Korean verbs 'tali-ta' or 't'wi-ta' can behave not only as an unergative and unauucsative verb, but also it can behave as a transitive verb. Though Borer's perspective on classification of verb types may be thought of as somewhat radical mostly due to its heavy dependency on aspectual representation of a whole sentence which a verb is just part of, it is clearly suggesting a new and great insight into the controversial topic of classification of verb types. So it is worth adopting this insightful perspective for the analysis of corresponding Korean verbs and seeing if it also works for the Korean ones.