• Title/Summary/Keyword: knowledge discovery system

Search Result 129, Processing Time 0.032 seconds

Ontology based Preprocessing Scheme for Mining Data Streams from Sensor Networks (센서 네트워크의 데이터 스트림 마이닝을 위한 온톨로지 기반의 전처리 기법)

  • Jung, Jason J.
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.3
    • /
    • pp.67-80
    • /
    • 2009
  • By a number of sensors and sensor networks, we can collect environmental information from a certain sensor space. To discover more useful information and knowledge, we want to employ data mining methodologies to sensor data stream from such sensor spaces. In this paper, we present a novel data preprocessing scheme to improve the performances of the data mining algorithms. Especially, ontologies are applied to represent meanings of the sensor data. For evaluating the proposed method, we have collected sensor streams for about 30 days, and simulated them to compare with other approaches.

  • PDF

Discovery of Interesting Knowledge using Concept Hierarchy (개념 계층 이용 흥미로운 부분 데이터의 탐색)

  • 홍정희;김성민;남도원;이동하;이전영
    • Journal of Intelligence and Information Systems
    • /
    • v.6 no.2
    • /
    • pp.77-89
    • /
    • 2000
  • 개념 계층(Concept Hierarchy)은 데이터베이스 분야에서 사용되는 대표적인 배경 지식(Background Knowledge)으로써, 데이터베이스에 내재되어 있는 구조적인 정보, 데이터의 분포, 영역전문가 (Domain Expert)에 의해 주어지는 외부 지식 등이 반영되어 있다. 개념계층의 특성상 부모(parent)-자 식(child) 관계가 있는 두 노드가 있을 때, 한 노드의 값으로부터 다른 노드의 값을 추정할 수 있다 이 추정된 값을 기대치라고 하고, 한 노드의 값으로부터 추정된 기대치와 실제치가 상당히 상이한 값을 보이는 노드가 있을 때, 이를 흥미롭다(interesting)고 말할 수 있다. 그러나 아직까지 개념계층 상에서의 흥미로운 부분 탐색에 대한 연구가 없었으며, 흥미로움(interestingness)의 척도(measurement) 에 대한 연구로서는 신뢰도(confidence),리프트(lift),컨빅션(conviction)등이 있었다. 그러나 이런 흥미도 의 척도에 관한 연구도 연관규칙에 한정되어 이루어졌으므로 개념계층상의 데이터에 적용하기 위해 서는 약간의 수정 및 새로운 정의가 필요하다. 본 논문에서는 데이터의 특성에 따른 개념계층이 존재할 때, 이를 이용하여 기대치와 실제치가 상이한 흥미로운 부분을 발견하고자 하며, 이를 위하여 개념계층상에서의 흥미도의 척도를 제안하고 흥미로운 부분을 탐색하는 방법을 기술하고자 한다. 또한 데이터마이닝의 결과인 연관규칙을 개념 계층에 적용하여 연관규칙을 통해 얻어질 수 있는 기대치를, 지지도(support), 신뢰도(confidence), 리프트(lift), 컨빅션(conviction)등의 관계를 통해 다양한 방법으로 모색해본다. 이 연구에서 제안하는 이러한 개념계층상의 흥미로운 부분의 탐색은, 전자 상거래에서 CRM(Customer Relationship Management)나 틈새시장(niche market) 마케팅 등에 적용 가능하리라 여겨진다.

  • PDF

Adaptation Methods for a Probabilistic Fuzzy Rule-based Learning System (확률적 퍼지 룰 기반 학습 시스템의 적응 방법)

  • Lee, Hyeong-Uk;Byeon, Jeung-Nam
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.11a
    • /
    • pp.223-226
    • /
    • 2007
  • 지식 발견 (knowledge discovery)의 관점에서, 단기간 동안 취득된 데이터 패턴을 학습하고자 하는 경우 데이터에 비일관적인(inconsistent) 패턴이 포함되어 있다면 확률적 퍼지 룰(probabilistic fuzzy rule) 기반의 지식 표현 방법 및 적절한 학습 알고리즘을 이용하여 효과적으로 다룰 수 있다. 하지만 장기간 동안 지속적으로 얻어진 데이터 패턴을 다루고자 하는 경우, 데이터가 시변(time-varying) 특성을 가지고 있으면 기존에 추출된 지식을 변화된 데이터에 활용하기 어렵게 된다. 때문에 이러한 데이터를 다루는 학습 시스템에는 패턴의 변화에 맞추어 갈 수 있는 지속적인 적응력(adaptivity)이 요구된다. 본 논문에서는 이러한 적응성의 측면을 고려하여 평생 학습(life-long learning)의 관점 에 서 확률적 퍼지 룰 기반의 학습 시스템에 적용될 수 있는 두 가지 형태의 적응 방법에 대해서 설명하도록 한다.

  • PDF

An Efficient Algorithm for Mining Frequent Sequences In Spatiotemporal Data

  • Vhan Vu Thi Hong;Chi Cheong-Hee;Ryu Keun-Ho
    • 한국공간정보시스템학회:학술대회논문집
    • /
    • 2005.11a
    • /
    • pp.61-66
    • /
    • 2005
  • Spatiotemporal data mining represents the confluence of several fields including spatiotemporal databases, machine loaming, statistics, geographic visualization, and information theory. Exploration of spatial data mining and temporal data mining has received much attention independently in knowledge discovery in databases and data mining research community. In this paper, we introduce an algorithm Max_MOP for discovering moving sequences in mobile environment. Max_MOP mines only maximal frequent moving patterns. We exploit the characteristic of the problem domain, which is the spatiotemporal proximity between activities, to partition the spatiotemporal space. The task of finding moving sequences is to consider all temporally ordered combination of associations, which requires an intensive computation. However, exploiting the spatiotemporal proximity characteristic makes this task more cornputationally feasible. Our proposed technique is applicable to location-based services such as traffic service, tourist service, and location-aware advertising service.

  • PDF

A Review of Korean Medicine Treatment for Tic Disorder (틱장애의 한방치료에 대한 국내 임상 연구 동향)

  • Cho, Youn Soo;Baek, Jung Han
    • The Journal of Pediatrics of Korean Medicine
    • /
    • v.31 no.4
    • /
    • pp.9-18
    • /
    • 2017
  • Objectives The purpose of this study is to investigate the recent Korean medicine treatment and study trends of tic disorder by reviewing clinical studies. Methods The clinical studies of Korean medicine treatment for tic disorder were obtained from National Discovery for Science Leader (NDSL), Research Information Sharing Service (RISS), Korean Traditional Knowledge Portal (KTKP) and Oriental Medicine Advanced Searching Integrated System (OASIS). Results 15 articles were analyzed to find out the most commonly used herbal medicine, acupoints, other Korean medicine treatments and evaluation methods. Conclusions This study shows some common Korean medicine treatment methods for tic disorder which can be used for further research.

Second-Order Learning for Complex Forecasting Tasks: Case Study of Video-On-Demand (복잡한 예측문제에 대한 이차학습방법 : Video-On-Demand에 대한 사례연구)

  • 김형관;주종형
    • Journal of Intelligence and Information Systems
    • /
    • v.3 no.1
    • /
    • pp.31-45
    • /
    • 1997
  • To date, research on data mining has focused primarily on individual techniques to su, pp.rt knowledge discovery. However, the integration of elementary learning techniques offers a promising strategy for challenging a, pp.ications such as forecasting nonlinear processes. This paper explores the utility of an integrated a, pp.oach which utilizes a second-order learning process. The a, pp.oach is compared against individual techniques relating to a neural network, case based reasoning, and induction. In the interest of concreteness, the concepts are presented through a case study involving the prediction of network traffic for video-on-demand.

  • PDF

Implementation of Knowledge Discovery System Using Integrated Method (통합 방법에 의한 지식 발견 시스템의 구현)

  • Kim, Jung-Ho;Chung, Hong
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10c
    • /
    • pp.21-23
    • /
    • 1998
  • 본 연구에서는 속성중심 귀납법에서 사용하는 개념 계층의 상승 기법, 결정트리에 의한 귀납법에서 사용하는 정보 획득량의 측정 기법, 그리고 라프셋에 의한 지식감축 방법을 복합하여 저수준의 데이터를 고수준 정보로 일반화하고, 불필요한 속성들을 감축하여 간략화된 결정규칙을 도출하는 통합방법의 지식 발견 시스템을 시험적으로 구현했다. 여기서 추출한 최소화 결정 규칙은 대규모 데이터베이스에서 추출할수 있는 유용한 지식으로 의사결정에 사용하는 정보가 된다. 생성된 규칙지식은 각기 방법들보다 간결하다. 그리고 개념 일반화에 의해 유도된 지식이 고수준의 추상으로 표현된다.

  • PDF

A Design of Tutoring System Based on Agents (에이전트 기반 교수시스템의 설계)

  • Choi, Young-Mee;Jang, Young-Hee
    • Journal of The Korean Association of Information Education
    • /
    • v.1 no.2
    • /
    • pp.45-56
    • /
    • 1997
  • A design of Advisory Tutoring Agent(ATA) is proposed to help a student on the basis of a student-oriented discovery learning, The agents adaptively build a student model and interact with a student as well as each other to give a piece of advice at a proper time. The appropriateness of advice depends on the agents' reasoning sod interaction performed based on the student model. domain knowledge, teaching strategy, and user-friendly interface. The design has been applied to a simple English learning environment.

  • PDF

Building Intelligent User Interface Agent for Semantically Reformulating User Query in Medicine

  • Yang, Jung-Jin;Lim, Chae-Myung;Chu, Sung-Joon;Lee, Dong-Hoon;Park, Duck-Whan;Park, Tae-Yong
    • Journal of Intelligence and Information Systems
    • /
    • v.9 no.2
    • /
    • pp.101-119
    • /
    • 2003
  • Achieving the beneficiary goal of recent discovery in human genome project still needs a way to retrieve and analyze the exponentially expanding bio-related information. Research on bio-related fields naturally applies knowledge discovered to the current problem and make inferences to extract new information where shared concepts and data containing information need to be defined and used in a coherent way. In such a professional domain, while the need to help users reduce their work and to improve search results has been emerged, methods for systematic retrieval and adequate exchange of relevant information are still in their infancy. The design of our system aims at improving the quality of information retrieval in a professional domain by utilizing both corpus-based and concept-based ontology. Meta-rules of helping users to make an adequate query are formed into an ontology in the domain. The integration of those knowledge permits the system to retrieve relevant information in a more semantic and systematic fashion. This work mainly describes the query models with details of GUI and a secondary query generation of the system.

  • PDF

Defining and Discovering Cardinalities of the Temporal Workcases from XES-based Workflow Logs

  • Yun, Jaeyoung;Ahn, Hyun;Kim, Kwanghoon Pio
    • Journal of Internet Computing and Services
    • /
    • v.20 no.3
    • /
    • pp.77-84
    • /
    • 2019
  • Workflow management system is a system that manages the workflow model which defines the process of work in reality. We can define the workflow process by sequencing jobs which is performed by the performers. Using the workflow management system, we can also analyze the flow of the process and revise it more efficiently. Many researches are focused on how to make the workflow process model more efficiently and manage it more easily. Recently, many researches use the workflow log files which are the execution history of the workflow process model performed by the workflow management system. Ourresearch group has many interests in making useful knowledge from the workflow event logs. In this paper we use XES log files because there are many data using this format. This papersuggests what are the cardinalities of the temporal workcases and how to get them from the workflow event logs. Cardinalities of the temporal workcases are the occurrence pattern of critical elements in the workflow process. We discover instance cardinalities, activity cardinalities and organizational resource cardinalities from several XES-based workflow event logs and visualize them. The instance cardinality defines the occurrence of the workflow process instances, the activity cardinality defines the occurrence of the activities and the organizational cardinality defines the occurrence of the organizational resources. From them, we expect to get many useful knowledge such as a patterns of the control flow of the process, frequently executed events, frequently working performer and etc. In further, we even expect to predict the original process model by only using the workflow event logs.