• 제목/요약/키워드: Linguistic processing

검색결과 167건 처리시간 0.025초

Development of a Traceability Analysis Method Based on Case Grammar for NPP Requirement Documents Written in Korean Language

  • Yoo Yeong Jae;Seong Poong Hyun;Kim Man Cheol
    • Nuclear Engineering and Technology
    • /
    • 제36권4호
    • /
    • pp.295-303
    • /
    • 2004
  • Software inspection is widely believed to be an effective method for software verification and validation (V&V). However, software inspection is labor-intensive and, since it uses little technology, software inspection is viewed upon as unsuitable for a more technology-oriented development environment. Nevertheless, software inspection is gaining in popularity. KAIST Nuclear I&C and Information Engineering Laboratory (NICIEL) has developed software management and inspection support tools, collectively named "SIS-RT. "SIS-RT is designed to partially automate the software inspection processes. SIS-RT supports the analyses of traceability between a given set of specification documents. To make SIS-RT compatible for documents written in Korean, certain techniques in natural language processing have been studied [9]. Among the techniques considered, case grammar is most suitable for analyses of the Korean language [3]. In this paper, we propose a methodology that uses a case grammar approach to analyze the traceability between documents written in Korean. A discussion regarding some examples of such an analysis will follow.

On a Novel Way of Processing Data that Uses Fuzzy Sets for Later Use in Rule-Based Regression and Pattern Classification

  • Mendel, Jerry M.
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제14권1호
    • /
    • pp.1-7
    • /
    • 2014
  • This paper presents a novel method for simultaneously and automatically choosing the nonlinear structures of regressors or discriminant functions, as well as the number of terms to include in a rule-based regression model or pattern classifier. Variables are first partitioned into subsets each of which has a linguistic term (called a causal condition) associated with it; fuzzy sets are used to model the terms. Candidate interconnections (causal combinations) of either a term or its complement are formed, where the connecting word is AND which is modeled using the minimum operation. The data establishes which of the candidate causal combinations survive. A novel theoretical result leads to an exponential speedup in establishing this.

Recent Progresses in the Linguistic Modeling of Biological Sequences Based on Formal Language Theory

  • Park, Hyun-Seok;Galbadrakh, Bulgan;Kim, Young-Mi
    • Genomics & Informatics
    • /
    • 제9권1호
    • /
    • pp.5-11
    • /
    • 2011
  • Treating genomes just as languages raises the possibility of producing concise generalizations about information in biological sequences. Grammars used in this way would constitute a model of underlying biological processes or structures, and that grammars may, in fact, serve as an appropriate tool for theory formation. The increasing number of biological sequences that have been yielded further highlights a growing need for developing grammatical systems in bioinformatics. The intent of this review is therefore to list some bibliographic references regarding the recent progresses in the field of grammatical modeling of biological sequences. This review will also contain some sections to briefly introduce basic knowledge about formal language theory, such as the Chomsky hierarchy, for non-experts in computational linguistics, and to provide some helpful pointers to start a deeper investigation into this field.

분산 환경에서의 지능형 멀티미디어 교육 시스템 (Intelligent Multimedia Educational System on Distributed Environment)

  • 이세훈;윤경섭
    • 한국정보처리학회논문지
    • /
    • 제6권5호
    • /
    • pp.1323-1331
    • /
    • 1999
  • This paper suggests a multimedia educational system which has the ability to extract intelligent instruction on the distributed environment. The proposed system is designed for supporting individual instruction and real time user interaction. As the system based on CORBA, we put group managing module on it for multi user environment, so it has ability for distributed computing facilities. Using MHEG standard, we can provide multimedia courseware and real time user interaction. To diagnose students' responses and generate evaluations, we use several linguistic variables of fuzzy theory. There are two major advantages for using this system. This system can provide dynamic generation of problems and the ability to provide a dynamic instruction strategy. And it can increase reusability of courseware material for using standard of multimedia representation and communication. We use CORBA and MHEG to overcome the disadvantage of the Web, passive protocol and poor interactivity, HTTP.

  • PDF

An Intuitionistic Fuzzy Approach to Classify the User Based on an Assessment of the Learner's Knowledge Level in E-Learning Decision-Making

  • Goyal, Mukta;Yadav, Divakar;Tripathi, Alka
    • Journal of Information Processing Systems
    • /
    • 제13권1호
    • /
    • pp.57-67
    • /
    • 2017
  • In this paper, Atanassov's intuitionistic fuzzy set theory is used to handle the uncertainty of students' knowledgeon domain concepts in an E-learning system. Their knowledge on these domain concepts has been collected from tests that were conducted during their learning phase. Atanassov's intuitionistic fuzzy user model is proposed to deal with vagueness in the user's knowledge description in domain concepts. The user model uses Atanassov's intuitionistic fuzzy sets for knowledge representation and linguistic rules for updating the user model. The scores obtained by each student were collected in this model and the decision about the students' knowledge acquisition for each concept whether completely learned, completely known, partially known or completely unknown were placed into the information table. Finally, it has been found that the proposed scheme is more appropriate than the fuzzy scheme.

Microblog Sentiment Analysis Method Based on Spectral Clustering

  • Dong, Shi;Zhang, Xingang;Li, Ya
    • Journal of Information Processing Systems
    • /
    • 제14권3호
    • /
    • pp.727-739
    • /
    • 2018
  • This study evaluates the viewpoints of user focus incidents using microblog sentiment analysis, which has been actively researched in academia. Most existing works have adopted traditional supervised machine learning methods to analyze emotions in microblogs; however, these approaches may not be suitable in Chinese due to linguistic differences. This paper proposes a new microblog sentiment analysis method that mines associated microblog emotions based on a popular microblog through user-building combined with spectral clustering to analyze microblog content. Experimental results for a public microblog benchmark corpus show that the proposed method can improve identification accuracy and save manually labeled time compared to existing methods.

정보추출을 위한 고유명사 및 대용어 태깅 (Named Entity and Coreference Tagging for Information Extraction)

  • 장성호;강승식;우종우;윤보현
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2002년도 춘계학술발표논문집 (하)
    • /
    • pp.1111-1114
    • /
    • 2002
  • 최근 정보추출에 대한 중요성이 점차 증가하면서 정보추출에서 필요로 하는 Named Entity와 Coreference, Information Extraction, Information Retrieval의 소개와 한국어에 대해 적용시키기 위한 정의와 방법을 제시한다. 또한, 대량의 문서에 대한 태깅을 효율적으로 수행할 수 있도록 Named Entity와 Coreference 태깅을 쉽게 할 수 있는 NE-CO 태깅 도구를 개발하였다. 이 태깅 도구를 이용하여 시험적으로 경제, 공연, 여행 분야의 300문서에 대한 말뭉치를 구축하였으며, 이 말뭉치는 한국어 정보추출 시스템을 개발하는데 기초 자료로서 활용될 예정이다.

  • PDF

언어 인지 실험을 통한 외국어 능력 측정 방법 (A method for Measuring Second Language Ability based on linguistic cognitive experiments)

  • 양영욱;이새벽;임희석
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2012년도 추계학술발표대회
    • /
    • pp.362-363
    • /
    • 2012
  • 외국어 능력이 현대 사회에서 요구하는 필수적인 요소 중에 하나이다. 본 논문에서는 기존의 외국어 능력을 평가하는 능력 시험이 아닌 언어심리학적 관점으로 외국어 능력을 평가하는 방법을 제안한다. 외국어를 처리하는데 있어서 외국어를 모국어를 바꾸는 언어인지 과정이 필요하다. 본 논문에서는 이러한 언어 인지 능력을 측정하는 Reading LDT, Listening LDT, Verbal Span, Yes-No task(Semantic), Same-Different task실험을 제안한다. 해당 과제들은 각각 피험자의 읽기, 듣기, 기억, 의미적 결정, 변환 능력을 측정하는 과제이다.

Image Understanding for Visual Dialog

  • Cho, Yeongsu;Kim, Incheol
    • Journal of Information Processing Systems
    • /
    • 제15권5호
    • /
    • pp.1171-1178
    • /
    • 2019
  • This study proposes a deep neural network model based on an encoder-decoder structure for visual dialogs. Ongoing linguistic understanding of the dialog history and context is important to generate correct answers to questions in visual dialogs followed by questions and answers regarding images. Nevertheless, in many cases, a visual understanding that can identify scenes or object attributes contained in images is beneficial. Hence, in the proposed model, by employing a separate person detector and an attribute recognizer in addition to visual features extracted from the entire input image at the encoding stage using a convolutional neural network, we emphasize attributes, such as gender, age, and dress concept of the people in the corresponding image and use them to generate answers. The results of the experiments conducted using VisDial v0.9, a large benchmark dataset, confirmed that the proposed model performed well.

Individual differences in categorical perception: L1 English learners' L2 perception of Korean stops

  • Kong, Eun Jong
    • 말소리와 음성과학
    • /
    • 제11권4호
    • /
    • pp.63-70
    • /
    • 2019
  • This study investigated individual variability of L2 learners' categorical judgments of L2 stops by exploring English learners' perceptual processing of two acoustic cues (voice onset time [VOT] and f0) and working memory capacity as sources of variation. As prior research has reported that English speakers' greater use of the redundant cue f0 was responsible for gradient processing of native stops, we examined whether the same processing characteristics would be observed in L2 learners' perception of Korean stops (/t/-/th/). 22 English learners of L2 Korean with a range of L2 proficiency participated in a visual analogue scaling task and demonstrated variable manners of judging the L2 Korean stops: Some were more gradient than others in performing the task. Correlation analysis revealed that L2 learners' categorical responses were modestly related to individuals' utilizations of a primary cue for the stop contrast (VOT for L1 English stops and f0 for L2 Korean stops), and were also related to better working memory capacity. Together, the current experimental evidence demonstrates adult L2 learners' top-down processing of stop consonants where linguistic and cognitive resources are devoted to a process of determining abstract phonemic identity.