• Title/Summary/Keyword: Knowledge-Based Data Mining

Search Result 263, Processing Time 0.026 seconds

The Development of Design Knowledge Management System Using Data Mining (Data Mining 기법을 활용한 디자인 지식경영 시스템 구축)

  • 양종열;오민권;최경은
    • Archives of design research
    • /
    • v.16 no.2
    • /
    • pp.281-290
    • /
    • 2003
  • In the knowledge and information-based age of today, it would be fair to say that the compatibility of each person, enterprise, and nation can be evaluated by how each of them manages and maintains the knowledge created from data and information. Since the importance and necessity of knowledge management has been acknowledged, there have been studies to create, apply, and evaluate the knowledge concerning design. Previous studies done on this subject can be divided into three main categories - CRM, online statistical research, and eCRM - according to the materials used to create knowledge. These studies are meaningful in that they can create knowledge in their respective fields, although they are somewhat inadequate because the designers can't create as much knowledge as can be applied in business; design-related consumers demand composite knowledge integrating the characteristics of all three fields. In other words, they want to know the ordinary customers'preferences in the previous off-line market in the CRM field, the research results of statistical questionnaires to the various elements of design in statistical research fields, and even the pattern of preference and consumption of many and unspecified persons transcending the time and place in eCRU field. This study proposes to solve the problem related with web-based design knowledge maintenance through the synthetic application of CRM, Statistical Research, and eCRM The information proposed in the solution can De expected to help designers working at design-related enterprises, as well as research institutes, to develop the knowledge necessary to design more consumer-oriented products.

  • PDF

Learning process mining techniques based on open education platforms (개방형 e-Learning 플랫폼 기반 학습 프로세스 마이닝 기술)

  • Kim, Hyun-ah
    • The Journal of the Convergence on Culture Technology
    • /
    • v.5 no.2
    • /
    • pp.375-380
    • /
    • 2019
  • In this paper, we study learning process mining and analytic technology based on open education platform. A study on mining through personal learning history log data based on an open education platform such as MOOC which is growing in interest recently. This technology is to design and implement a learning process mining framework for discovering and analyzing meaningful learning processes and knowledge from learning history log data. Learning process mining framework technology is a technique for expressing, extracting, analyzing and visualizing the learning process to provide learners with improved learning processes and educational services.

A Method for Frequent Itemsets Mining from Data Stream (데이터 스트림 환경에서 효율적인 빈발 항목 집합 탐사 기법)

  • Seo, Bok-Il;Kim, Jae-In;Hwang, Bu-Hyun
    • The KIPS Transactions:PartD
    • /
    • v.19D no.2
    • /
    • pp.139-146
    • /
    • 2012
  • Data Mining is widely used to discover knowledge in many fields. Although there are many methods to discover association rule, most of them are based on frequency-based approaches. Therefore it is not appropriate for stream environment. Because the stream environment has a property that event data are generated continuously. it is expensive to store all data. In this paper, we propose a new method to discover association rules based on stream environment. Our new method is using a variable window for extracting data items. Variable windows have variable size according to the gap of same target event. Our method extracts data using COBJ(Count object) calculation method. FPMDSTN(Frequent pattern Mining over Data Stream using Terminal Node) discovers association rules from the extracted data items. Through experiment, our method is more efficient to apply stream environment than conventional methods.

DISCOVERY TEMPORAL FREQUENT PATTERNS USING TFP-TREE

  • Jin Long;Lee Yongmi;Seo Sungbo;Ryu Keun Ho
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.454-457
    • /
    • 2005
  • Mining frequent patterns in transaction databases, time-series databases, and many other kinds of databases has been studied popularly in data mining research. Most of the previous studies adopt an Apriori-like candidate set generation-and-test approach. However, candidate set generation is still costly, especially when there exist prolific patterns and/or long patterns. And calendar based on temporal association rules proposes the discovery of association rules along with their temporal patterns in terms of calendar schemas, but this approach is also adopt an Apriori-like candidate set generation. In this paper, we propose an efficient temporal frequent pattern mining using TFP-tree (Temporal Frequent Pattern tree). This approach has three advantages: (1) this method separates many partitions by according to maximum size domain and only scans the transaction once for reducing the I/O cost. (2) This method maintains all of transactions using FP-trees. (3) We only have the FP-trees of I-star pattern and other star pattern nodes only link them step by step for efficient mining and the saving memory. Our performance study shows that the TFP-tree is efficient and scalable for mining, and is about an order of magnitude faster than the Apriori algorithm and also faster than calendar based on temporal frequent pattern mining methods.

  • PDF

The HCARD Model using an Agent for Knowledge Discovery

  • Gerardo Bobby D.;Lee Jae-Wan;Joo Su-Chong
    • The Journal of Information Systems
    • /
    • v.14 no.3
    • /
    • pp.53-58
    • /
    • 2005
  • In this study, we will employ a multi-agent for the search and extraction of data in a distributed environment. We will use an Integrator Agent in the proposed model on the Hierarchical Clustering and Association Rule Discovery(HCARD). The HCARD will address the inadequacy of other data mining tools in processing performance and efficiency when use for knowledge discovery. The Integrator Agent was developed based on CORBA architecture for search and extraction of data from heterogeneous servers in the distributed environment. Our experiment shows that the HCARD generated essential association rules which can be practically explained for decision making purposes. Shorter processing time had been noted in computing for clusters using the HCARD and implying ideal processing period than computing the rules without HCARD.

  • PDF

Design and Adaptation for Internet News Data Extraction Middleware(INDEM) System

  • Sun, Bok-Keun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.4
    • /
    • pp.55-62
    • /
    • 2016
  • In this paper, we propose the INDEM(Internet News Data Extraction Middleware) system for the removal of the unnecessary data in internet news. Although data on the internet can be used in various fields such as source of data of IR(Information Retrieval), Data mining and knowledge information service, it contains a lot of unnecessary information. The removal of the unnecessary data is a problem to be solved prior to the study of the knowledge-based information service that is based on the data of the web page. The INDEM system parses html and explores the XPath, and it is to perform the analysis. The user simply utilize INDEM by implementing an abstract class that provides INDEM, and can obtain the analysis information. INDEM System through this process delivers the analysis information including the main contents of news site to the users. In this paper, the INDEM system was adapted in a stand-alone and web service system and it was evaluated on the basis of 16 news site. As a result, performance of the INDEM system is affected in html source data size and complexity of used html grammar than the main news data size.

Business Performance Analysis System based on Knowledge Discovery in Databases (Knowledge Discovery in Databases에 기반한 경영성과분석 시스템)

  • 조성훈;정민용
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.23 no.57
    • /
    • pp.11-20
    • /
    • 2000
  • In dynamic management environment, CEO must make an efficient decision with information & knowledge management systems based on IT(Information Technology). As a key component to cope with this current, we suggest the business performance analysis system based on KDD(Knowledge Discovery in Databases). We consider the theoretical model that is composited both Value-Added in respect of stakeholder and Economic Value-Added in respect of shareholder. Additionally we use DBMS and data mining method using Genetic Algorithms as physical model. To demonstrate the performance of the business performance analysis system, we analyse a domestic motors industry. The empirical case is based on the financial data of KISFAS(Korea Investors Services Financial Analysis System) database. The samples included in the study consist of H motors/S motors industry over the 16-year from 1981 to 1996.

  • PDF

Quality Design Support System based on Data Mining Approach (데이터 마이닝 기반의 품질설계지원시스템)

  • 지원철
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.28 no.3
    • /
    • pp.31-47
    • /
    • 2003
  • Quality design in practice highly depends on human designer's intuition and past experiences due to lack of formal knowledge about the relationship among 10 variables. This paper represents an data mining approach for developing quality design support system that integrates Case Based Reasoning (CBR) and Artificial Neural Networks (ANN) to effectively support all the steps in quality design process. CBR stores design cases in a systematic way and retrieve them quickly and accurately. ANN predicts the resulting quality attributes of design alternatives that are generated from CBR's adaptation process. When the predicted attributes fail to meet the target values, quality design simulation starts to further adapt the alternatives to the customer's new orders. To implement the quality design simulation, this paper suggests (1) the data screening method based on ξ-$\delta$ Ball to obtain the robust ANN models from the large production data bases, (2) the procedure of quality design simulation using ANN and (3) model management system that helps users find the appropriate one from the ANN model base. The integration of CBR and ANN provides quality design engineers the way that produces consistent and reliable design solutions in the remarkably reduced time.

Design and Implementation of a Distributed Data Mining Framework (분산된 데이터마이닝을 위한 프레임워크의 설계 및 구현)

  • Kadel, Prakash;Choi, Ho-Jin
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2007.06c
    • /
    • pp.336-340
    • /
    • 2007
  • We envisage that grid computing environments allow us to implement distributed data mining services, that is, those applications which analyze large sets of geographically distributed databases and information using the computational power and resources of a grid environment. This paper describes an experimental framework towards such a distributed data mining approach, including design considerations and a prototype implementation. Based on the "Knowledge Grid" architecture suggested by Cannataro et al., we identify four major components - user node, broker node, data node, and computation node - and define their individual roles. For implementing the prototype, we have investigated methods for utilizing distributed resources within a grid computing environment, e.g., communication and coordination among the various resources available.

  • PDF

Data Mining Approach for Real-Time Processing of Large Data Using Case-Based Reasoning : High-Risk Group Detection Data Warehouse for Patients with High Blood Pressure (사례기반추론을 이용한 대용량 데이터의 실시간 처리 방법론 : 고혈압 고위험군 관리를 위한 자기학습 시스템 프레임워크)

  • Park, Sung-Hyuk;Yang, Kun-Woo
    • Journal of Information Technology Services
    • /
    • v.10 no.1
    • /
    • pp.135-149
    • /
    • 2011
  • In this paper, we propose the high-risk group detection model for patients with high blood pressure using case-based reasoning. The proposed model can be applied for public health maintenance organizations to effectively manage knowledge related to high blood pressure and efficiently allocate limited health care resources. Especially, the focus is on the development of the model that can handle constraints such as managing large volume of data, enabling the automatic learning to adapt to external environmental changes and operating the system on a real-time basis. Using real data collected from local public health centers, the optimal high-risk group detection model was derived incorporating optimal parameter sets. The results of the performance test for the model using test data show that the prediction accuracy of the proposed model is two times better than the natural risk of high blood pressure.