• Title/Summary/Keyword: 의미적 연관성

Search Result 848, Processing Time 0.026 seconds

OWL Authoring System for building Web Ontology (웹 온톨로지 구축을 위한 OWL 저작 시스템)

  • Lee Moohun;Cho Hyunkyu;Cho Hyeonsung;Cho Sunghoon;Jang Changbok;Choi Euiin
    • The Journal of Society for e-Business Studies
    • /
    • v.10 no.3
    • /
    • pp.21-36
    • /
    • 2005
  • Current web search includes a lot of different results with information that user does not want, because it searches information using keyword mapping. Ontology can describe the correct meaning of web resource and relationships between web resources. And we can extract suitable information that user wants using Ontology Accordingly, we need the ontology to represent knowledge. W3C announced OWL(Web Ontology Language), meaning description technology for such web resource. However, the development of a special tool that can effectively compose and edit OWL is inactive. In this paper, we designed and developed an OWL authoring system that can effectively provide the generation and edit about OWL.

  • PDF

준해양사고 데이터의 실효성에 관한 정량적 고찰

  • Gang, Seok-Yong;No, Beom-Seok
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2018.05a
    • /
    • pp.50-52
    • /
    • 2018
  • 준해양사고는 해양사고를 제외하고 선박의 구조, 설비 또는 운용과 관련하여 시정 또는 개선되지 않을 경우, 선박과 사람의 안전 및 해양환경 등에 위해를 끼치거나 위해를 끼칠 수 있는 사고를 의미하며, 이를 통해 사고를 사전에 예방하는 제도를 준해양사고제도라 한다. 우리나라는 2010년부터 국제해사기구의 권고에 따라 본 제도를 도입하였고, 다각적인 방법을 통해 활성화를 위하여 노력하고 있다. 하지만 8년이 지난 지금도 본 제도는 좀처럼 활성화되지 못하고 있으며 해운선사의 자발적인 참여가 미흡한 실정이다. 이에 본 연구는 준해양사고와 해양사고 데이터를 다각도로 분석하여 연관성을 정량적으로 검증하고자 노력하였고, 동시에 준해양사고제도의 운영이 해양사고를 예방하는데 도움이 됨을 입증하고자 하였다. 이를 위해 준해양사고와 해양사고를 다각도로 비교 분석하여 연관성을 검토해보았고, 그 결과 지금까지의 준해양사고 건수 증가 이후에 해양사고가 증가한다는 일반적 견해에 반하여 준해양사고 건수 증가는 해양사고 증가에 후행하여 나타날 수 있는 가능성에 주목하였다.

  • PDF

A Query Processing Method for Hierarchical Structured e-Learning System (계층적으로 구조화된 이러닝 시스템을 위한 질의 처리 기법)

  • Kim, Youn-Hee;Kim, Jee-Hyun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.3
    • /
    • pp.189-201
    • /
    • 2011
  • In this paper, we design an ontology which provides interoperability by integrating typical metadata specifications and defines concepts and semantic relations between concepts that are used to describe metadata for learning objects in university courses. And we organize a hierarchical structured e-Learning system for efficient retrieval of learning objects on many local storages that use different specifications to describe metadata and propose a query processing method based on inferences. The proposed e-Learning system can provide more accurate and satisfactory retrieval service by using the designed ontology because both learning objects that be directly connected to user queries and deduced learning objects that be semantically connected to them are retrieved.

WV-BTM: A Technique on Improving Accuracy of Topic Model for Short Texts in SNS (WV-BTM: SNS 단문의 주제 분석을 위한 토픽 모델 정확도 개선 기법)

  • Song, Ae-Rin;Park, Young-Ho
    • Journal of Digital Contents Society
    • /
    • v.19 no.1
    • /
    • pp.51-58
    • /
    • 2018
  • As the amount of users and data of NS explosively increased, research based on SNS Big data became active. In social mining, Latent Dirichlet Allocation(LDA), which is a typical topic model technique, is used to identify the similarity of each text from non-classified large-volume SNS text big data and to extract trends therefrom. However, LDA has the limitation that it is difficult to deduce a high-level topic due to the semantic sparsity of non-frequent word occurrence in the short sentence data. The BTM study improved the limitations of this LDA through a combination of two words. However, BTM also has a limitation that it is impossible to calculate the weight considering the relation with each subject because it is influenced more by the high frequency word among the combined words. In this paper, we propose a technique to improve the accuracy of existing BTM by reflecting semantic relation between words.

Statistical Word Sense Disambiguation based on using Variant Window Size (가변길이 윈도우를 이용한 통계 기반 동형이의어의 중의성 해소)

  • Park, Gi-Tae;Lee, Tae-Hoon;Hwang, So-Hyun;Lee, Hyun Ah
    • Annual Conference on Human and Language Technology
    • /
    • 2012.10a
    • /
    • pp.40-44
    • /
    • 2012
  • 어휘가 갖는 의미적 중의성은 자연어의 특성 중 하나로 자연어 처리의 정확도를 떨어트리는 요인으로, 이러한 중의성을 해소하기 위해 언어적 규칙과 다양한 기계 학습 모델을 이용한 연구가 지속되고 있다. 의미적 중의성을 가지고 있는 동형이의어의 의미분별을 위해서는 주변 문맥이 가장 중요한 자질이 되며, 자질 정보를 추출하기 위해 사용하는 문맥 창의 크기는 중의성 해소의 성능과 밀접한 연관이 있어 신중히 결정되어야 한다. 본 논문에서는 의미분별과정에 필요한 문맥을 가변적인 크기로 사용하는 가변길이 윈도우 방식을 제안한다. 세종코퍼스의 형태의미분석 말뭉치로 학습하여 12단어 32,735문장에 대해 실험한 결과 용언의 경우 평균 정확도 92.2%로 윈도우를 고정적으로 사용한 경우에 비해 향상된 결과를 보였다.

  • PDF

Processing Multi-Valued Attributes in Association Rules for Data Mining (데이터 마이닝을 위한 연관규칙의 다중 값 속성 처리방법)

  • 김산성;김명원
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.10d
    • /
    • pp.340-342
    • /
    • 2002
  • 다중 값이란 속성 값이 집합인 것을 말한다. 즉, 관계형 데이터베이스에서 자료 유형이 집합인 속성을 의미한다. 이러한 다중 값 속성 처리는 기존 데이터마이닝 기술 자체로는 처리한 수 없으며 후처리나 선처리 과정을 이용하여 처리하고 있다. 전처리나 후처리 과정을 통해 처리할 경우 수행과장에 있어 많은 시간이 소요되고 혹은 타당하지 않은 규칙이 생성되는 문제점을 가지고 있다. 특히 연관화 기법 특성상 분석하고자 할 항목이 증가할수록 연관성의 수가 지수(exponential)단위이기 때문에 이를 해결하는데는 상당한 어려움이 따르게 된다. 본 논문에서는 관계형 데이터베이스 테이블 구조에서 데이터 마이닝의 수행을 위한 전처리나 후처리의 과정을 고려하지 않음으로 위에서 언급된 문제점들을 해결하고자 한다. 특히 데이터 변환 작업 없이 정량적(Quantitative)연관 규칙과 연관 규칙(Market Basket Analysis)의 혼합 형태의 규칙을 생성할 수 있게끔 알고리즘을 확장하여 보다 효율적인 규칙이 생성될 수 있도록 한다. 마지막으로 Each Movie 데이터를 사용하여 확장한 알고리즘의 다중 값 속성 처리 방법의 효율성과 타탕성을 검증한다.

  • PDF

The application for predictive similarity measures of binary data in association rule mining (이분형 예측 유사성 측도의 연관성 평가 기준 적용 방안)

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.3
    • /
    • pp.495-503
    • /
    • 2011
  • The most widely used data mining technique is to find association rules. Association rule mining is the method to quantify the relationship between each set of items in very huge database based on the association thresholds. There are some basic association thresholds to explore meaningful association rules ; support, confidence, lift, etc. Among them, confidence is the most frequently used, but it has the drawback that it can not determine the direction of the association. The net confidence and the attributably pure confidence were developed to compensate for this drawback, but they have other drawbacks.In this paper we consider some predictive similarity measures for binary data in cluster analysis and multi-dimensional analysis as association threshold to compensate for these drawbacks. The comparative studies with net confidence, attributably pure confidence, and some predictive similarity measures are shown by numerical example.

Association rule ranking function by decreased lift influence (향상도 영향 감소화에 의한 연관성 순위결정함수)

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.3
    • /
    • pp.397-405
    • /
    • 2010
  • Data mining is the method to find useful information for large amounts of data in database, and one of the important goals is to search and decide the association for several variables. The task of association rule mining is to find certain association relationships among a set of data items in a database. There are three primary measures for association rule, support and confidence and lift. In this paper we developed a association rule ranking function by decreased lift influence to generate association rule for items satisfying at least one of three criteria. We compared our function with the functions suggested by Park (2010), and Wu et al. (2004) using some numerical examples. As the result, we knew that our decision function was better than the function of Park's and Wu's functions because our function had a value between -1 and 1regardless of the range for three association thresholds. Our function had the value of 1 if all of three association measures were greater than their thresholds and had the value of -1 if all of three measures were smaller than the thresholds.

Association rule ranking function using conditional probability increment ratio (조건부 확률증분비를 이용한 연관성 순위 결정 함수)

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.4
    • /
    • pp.709-717
    • /
    • 2010
  • The task of association rule mining is to find certain association relationships among a set of data items in a database. There are three primary measures for association rule, support and confidence and lift. In this paper we developed a association rule ranking function using conditional probability increment ratio. We compared our function with several association rule ranking functions by some numerical examples. As the result, we knew that our decision function was better than the existing functions. The reasons were that the proposed function of the reference value is not affected by a particular association threshold, and our function had a value between -1 and 1 regardless of the range for three association thresholds. And we knew that the ranking function using conditional probability increment ratio was very well reflected in the difference between association rule measures and the minimum association rule thresholds, respectively.

Brain MRI Template-Driven Medical Images Mapping Method Based on Semantic Features for Ischemic Stroke (허혈성 뇌졸중을 위한 뇌 자기공명영상의 의미적 특징 기반 템플릿 중심 의료 영상 매핑 기법)

  • Park, Ye-Seul;Lee, Meeyeon;Lee, Jung-Won
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.2
    • /
    • pp.69-78
    • /
    • 2016
  • Ischemic stroke is a disease that the brain tissues cannot function by reducing blood flow due to thrombosis or embolisms. Due to the nature of the disease, it is most important to identify the status of cerebral vessel and the medical images are necessarily used for its diagnosis. Among many indicators, brain MRI is most widely utilized because experts can effectively obtain the semantic information such as cerebral anatomy aiding the diagnosis with it. However, in case of emergency diseases like ischemic stroke, even though a intelligent system is required for supporting the prompt diagnosis and treatment, the current systems have some difficulties to provide the information of medical images intuitively. In other words, as the current systems have managed the medical images based on the basic meta-data such as image name, ID and so on, they cannot consider semantic information inherent in medical images. Therefore, in this paper, to provide core information like cerebral anatomy contained in brain MRI, we suggest a template-driven medical images mapping method. The key idea of the method is defining the mapping characteristics between anatomic feature and representative images by using template images that can be representative of the whole brain MRI image set and revealing the semantic relations that only medical experts can check between images. With our method, it will be possible to manage the medical images based on semantic.