• Title/Summary/Keyword: Mining method

Search Result 2,069, Processing Time 0.029 seconds

Fuzzy Web Usage Mining for User Modeling

  • Jang, Jae-Sung;Jun, Sung-Hae;Oh, Kyung-Whan
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.2 no.3
    • /
    • pp.204-209
    • /
    • 2002
  • The interest of data mining in artificial intelligence with fuzzy logic has been increased. Data mining is a process of extracting desirable knowledge and interesting pattern ken large data set. Because of expansion of WWW, web data is more and more huge. Besides mining web contents and web structures, another important task for web mining is web usage mining which mines web log data to discover user access pattern. The goal of web usage mining in this paper is to find interesting user pattern in the web with user feedback. It is very important to find user's characteristic fer e-business environment. In Customer Relationship Management, recommending product and sending e-mail to user by extracted users characteristics are needed. Using our method, we extract user profile from the result of web usage mining. In this research, we concentrate on finding association rules and verify validity of them. The proposed procedure can integrate fuzzy set concept and association rule. Fuzzy association rule uses given server log file and performs several preprocessing tasks. Extracted transaction files are used to find rules by fuzzy web usage mining. To verify the validity of user's feedback, the web log data from our laboratory web server.

WIS: Weighted Interesting Sequential Pattern Mining with a Similar Level of Support and/or Weight

  • Yun, Un-Il
    • ETRI Journal
    • /
    • v.29 no.3
    • /
    • pp.336-352
    • /
    • 2007
  • Sequential pattern mining has become an essential task with broad applications. Most sequential pattern mining algorithms use a minimum support threshold to prune the combinatorial search space. This strategy provides basic pruning; however, it cannot mine correlated sequential patterns with similar support and/or weight levels. If the minimum support is low, many spurious patterns having items with different support levels are found; if the minimum support is high, meaningful sequential patterns with low support levels may be missed. We present a new algorithm, weighted interesting sequential (WIS) pattern mining based on a pattern growth method in which new measures, sequential s-confidence and w-confidence, are suggested. Using these measures, weighted interesting sequential patterns with similar levels of support and/or weight are mined. The WIS algorithm gives a balance between the measures of support and weight, and considers correlation between items within sequential patterns. A performance analysis shows that WIS is efficient and scalable in weighted sequential pattern mining.

  • PDF

Study of Temporal Data Mining for Transformer Load Pattern Analysis (변압기 부하패턴 분석을 위한 시간 데이터마이닝 연구)

  • Shin, Jin-Ho;Yi, Bong-Jae;Kim, Young-Il;Lee, Heon-Gyu;Ryu, Keun-Ho
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.57 no.11
    • /
    • pp.1916-1921
    • /
    • 2008
  • This paper presents the temporal classification method based on data mining techniques for discovering knowledge from measured load patterns of distribution transformers. Since the power load patterns have time-varying characteristics and very different patterns according to the hour, time, day and week and so on, it gives rise to the uninformative results if only traditional data mining is used. Therefore, we propose a temporal classification rule for analyzing and forecasting transformer load patterns. The main tasks include the load pattern mining framework and the calendar-based expression using temporal association rule and 3-dimensional cube mining to discover load patterns in multiple time granularities.

Rating and Comments Mining Using TF-IDF and SO-PMI for Improved Priority Ratings

  • Kim, Jinah;Moon, Nammee
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.11
    • /
    • pp.5321-5334
    • /
    • 2019
  • Data mining technology is frequently used in identifying the intention of users over a variety of information contexts. Since relevant terms are mainly hidden in text data, it is necessary to extract them. Quantification is required in order to interpret user preference in association with other structured data. This paper proposes rating and comments mining to identify user priority and obtain improved ratings. Structured data (location and rating) and unstructured data (comments) are collected and priority is derived by analyzing statistics and employing TF-IDF. In addition, the improved ratings are generated by applying priority categories based on materialized ratings through Sentiment-Oriented Point-wise Mutual Information (SO-PMI)-based emotion analysis. In this paper, an experiment was carried out by collecting ratings and comments on "place" and by applying them. We confirmed that the proposed mining method is 1.2 times better than the conventional methods that do not reflect priorities and that the performance is improved to almost 2 times when the number to be predicted is small.

Process Planning Method under Make-to-Order Production System using Data Mining (데이터마이닝을 이용한 수주생산시스템의 공정계획방안)

  • Oh, Kyung-Mo;Park, Chang-Kwon
    • IE interfaces
    • /
    • v.18 no.2
    • /
    • pp.148-157
    • /
    • 2005
  • The manufacturing industry with Make-to-Order production system is difficult to decide the standard information for the product and the demand is variable to estimate. In this paper, we concerned with the process planning method using data mining in the manufacturing industry with Make-to-Order environment. The subject of our study is the industry transformer plant which is received an diverse order of customer and then produced the product. Currently, process planning method is classified the standard information by hand based on the acquired knowledge through the experience. The standard information stored the various information, such as work sequence, time and so on. This process planning method needs an experts which possesses the field experience for several years. For the product specification which is varied in each order, current process planning method is not efficient due to need many times To solve this problem, we extract the information using data mining process for each processing time, and then construct the knowledge base. We propose a method which is the process planning of the industry transformer product in Make-to-Order environment using the knowledge base.

A Study on Extracting Ideas from Documents and Webpages in the Field of Idea Mining (아이디어 마이닝 분야에서 문헌과 웹페이지의 아이디어 발췌에 대한 연구)

  • Lee, Tae-Young
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.1
    • /
    • pp.25-43
    • /
    • 2012
  • The ideas and quasi-ideas useful for human's creation were drawn out from documents and webpages with extraction methods used in idea mining, opinion mining, and topic signal mining. The extraction methods comprised (1) decisive cue phrases, (2) cue figures and sounds, (3) contextual signals, and (4) discourse segmentations, They tested on the idea samples, such as thoughts, plans, opinions, writings, figures, sounds, and formulas. Methods (1), (3), and (4) received largely positive evaluation, judging the efficiency of 4 methods by F measure, a mixture of recall and precision ratio. In particular, decisive cue phrase method was effective to search idea and contextual signal method was effective to detect quasi-idea.

Analysis of a Repair Processes Using a Process Mining Tool (프로세스 마이닝 기법을 활용한 고장 수리 프로세스 분석)

  • Choi, Sang Hyun;Han, Kwan Hee;Lim, Gun Hoon
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.4
    • /
    • pp.399-406
    • /
    • 2013
  • Recently, studies about process mining for creating and analyzing business process models from log data have received much attention from BPM (Business Process Management) researchers. Process mining is a kind of method that extracts meaningful information and hidden rules from the event log of enterprise information systems such as ERP and BPM. In this paper, repair processes of electronic devices are analyzed using ProM which is a process mining tool. And based on the analysis of repair processes, the method for finding major failure patterns is proposed by multi-dimensional data analysis beyond simple statistics. By using the proposed method, the reliability of electronic device can be increased by providing the identified failure patterns to design team.

Study on damage law and width optimization design of coal pillar with the discrete element method

  • Chuanwei Zang;Bingzheng Jiang;Xiaoshan Wang;Hao Wang;Jia Zhou;Miao Chen;Yu Cong
    • Geomechanics and Engineering
    • /
    • v.37 no.6
    • /
    • pp.555-563
    • /
    • 2024
  • The reasonable setting of coal pillar width plays a key role in guaranteeing the steadiness of surrounding rock of fully mechanized caving gateroad driving along the next goaf. Based on the engineering background of the Bayangaole mine, the discrete element method was used to simulate the fracture evolution of coal pillars with different pillar widths. The results show that the damage rate of the coal pillar increases with the decrease in the width of the coal pillar. Once the coal pillar width is smaller than 6 m, cracks run through the coal pillar, and the coal pillar is completely damaged. In the middle of the coal pillar, which has a width of 6 m and above, there is a relatively complete area with low damage. The results show that the pillar width of 6 m is the most appropriate. Field tests prove that the reserved width of a 6 m small coal pillar can effectively control the surrounding rock deformation, ensuring the overall steadiness of the gateroad in the thick coal seam. It is hoped that this study will offer some reference for the determination of the reasonable size of the coal pillar.

The Efficient Spatio-Temporal Moving Pattern Mining using Moving Sequence Tree (이동 시퀀스 트리를 이용한 효율적인 시공간 이동 패턴 탐사 기법)

  • Lee, Yon-Sik;Ko, Hyun
    • The KIPS Transactions:PartD
    • /
    • v.16D no.2
    • /
    • pp.237-248
    • /
    • 2009
  • Recently, based on dynamic location or mobility of moving object, many researches on pattern mining methods actively progress to extract more available patterns from various moving patterns for development of location based services. The performance of moving pattern mining depend on how analyze and process the huge set of spatio-temporal data. Some of traditional spatio-temporal pattern mining methods[1-6,8-11]have proposed to solve these problem, but they did not solve properly to reduce mining execution time and minimize required memory space. Therefore, in this paper, we propose new spatio-temporal pattern mining method which extract the sequential and periodic frequent moving patterns efficiently from the huge set of spatio-temporal moving data. The proposed method reduces mining execution time of $83%{\sim}93%$ rate on frequent moving patterns mining using the moving sequence tree which generated from historical data of moving objects based on hash tree. And also, for minimizing the required memory space, it generalize the detained historical data including spatio-temporal attributes into the real world scope of space and time using spatio-temporal concept hierarchy.

Interplay of Text Mining and Data Mining for Classifying Web Contents (웹 컨텐츠의 분류를 위한 텍스트마이닝과 데이터마이닝의 통합 방법 연구)

  • 최윤정;박승수
    • Korean Journal of Cognitive Science
    • /
    • v.13 no.3
    • /
    • pp.33-46
    • /
    • 2002
  • Recently, unstructured random data such as website logs, texts and tables etc, have been flooding in the internet. Among these unstructured data there are potentially very useful data such as bulletin boards and e-mails that are used for customer services and the output from search engines. Various text mining tools have been introduced to deal with those data. But most of them lack accuracy compared to traditional data mining tools that deal with structured data. Hence, it has been sought to find a way to apply data mining techniques to these text data. In this paper, we propose a text mining system which can incooperate existing data mining methods. We use text mining as a preprocessing tool to generate formatted data to be used as input to the data mining system. The output of the data mining system is used as feedback data to the text mining to guide further categorization. This feedback cycle can enhance the performance of the text mining in terms of accuracy. We apply this method to categorize web sites containing adult contents as well as illegal contents. The result shows improvements in categorization performance for previously ambiguous data.

  • PDF