• Title/Summary/Keyword: 순차패턴 마이닝

Search Result 90, Processing Time 0.021 seconds

Adapted Sequential Pattern Mining Algorithms for Business Service Identification (비즈니스 서비스 식별을 위한 변형 순차패턴 마이닝 알고리즘)

  • Lee, Jung-Won
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.4
    • /
    • pp.87-99
    • /
    • 2009
  • The top-down method for SOA delivery is recommended as a best way to take advantage of SOA. The core step of SOA delivery is the step of service modeling including service analysis and design based on ontology. Most enterprises know that the top-down approach is the best but they are hesitant to employ it because it requires them to invest a great deal of time and money without it showing any immediate results, particularly because they use well-defined component based systems. In this paper, we propose a service identification method to use a well-defined components maximally as a bottom-up approach. We assume that user's inputs generates events on a GUI and the approximate business process can be obtained from concatenating the event paths. We first find the core GUIs which have many outgoing event calls and form event paths by concatenating the event calls between the GUIs. Next, we adapt sequential pattern mining algorithms to find the maximal frequent event paths. As an experiment, we obtained business services with various granularity by applying a cohesion metric to extracted frequent event paths.

On-Line Mining using Association Rules and Sequential Patterns in Electronic Commerce (전자상거래에서 연관규칙과 순차패턴을 이용한 온라인 마이닝)

  • 김성학
    • Journal of the Korea Computer Industry Society
    • /
    • v.2 no.7
    • /
    • pp.945-952
    • /
    • 2001
  • In consequence of expansion of internet users, electronic commerce is becoming a new prototype for marketing and sales, arid most of electronic commerce sites or internet shopping malls provide a rich source of information and convenient user interfaces about the organizations customers to maintain their patrons. One of the convenient interfaces for users is service to recommend products. To do this, they must exploit methods to extract and analysis specific patterns from purchasing information, behavior and market basket about customers. The methods are association rules and sequential patterns, which are widely used to extract correlation among products, and in most of on-line electronic commerce sites are executed with users information and purchased history by category-oriented. But these can't represent the diverse correlation among products and also hardly reflect users' buying patterns precisely, since the results are simple set of relations for single purchased pattern. In this paper, we propose an efficient mining technique, which allows for multiple purchased patterns that are category-independent and have relationship among items in the linked structure of single pattern items.

  • PDF

Mining Interesting Sequential Pattern with a Time-interval Constraint for Efficient Analyzing a Web-Click Stream (웹 클릭 스트림의 효율적 분석을 위한 시간 간격 제한을 활용한 관심 순차패턴 탐색)

  • Chang, Joong-Hyuk
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.16 no.2
    • /
    • pp.19-29
    • /
    • 2011
  • Due to the development of web technologies and the increasing use of smart devices such as smart phone, in recent various web services are widely used in many application fields. In this environment, the topic of supporting personalized and intelligent web services have been actively researched, and an analysis technique on a web-click stream generated from web usage logs is one of the essential techniques related to the topic. In this paper, for efficient analyzing a web-click stream of sequences, a sequential pattern mining technique is proposed, which satisfies the basic requirements for data stream processing and finds a refined mining result. For this purpose, a concept of interesting sequential patterns with a time-interval constraint is defined, which uses not on1y the order of items in a sequential pattern but also their generation times. In addition, A mining method to find the interesting sequential patterns efficiently over a data stream such as a web-click stream is proposed. The proposed method can be effectively used to various computing application fields such as E-commerce, bio-informatics, and USN environments, which generate data as a form of data streams.

Finding Weighted Sequential Patterns over Data Streams via a Gap-based Weighting Approach (발생 간격 기반 가중치 부여 기법을 활용한 데이터 스트림에서 가중치 순차패턴 탐색)

  • Chang, Joong-Hyuk
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.3
    • /
    • pp.55-75
    • /
    • 2010
  • Sequential pattern mining aims to discover interesting sequential patterns in a sequence database, and it is one of the essential data mining tasks widely used in various application fields such as Web access pattern analysis, customer purchase pattern analysis, and DNA sequence analysis. In general sequential pattern mining, only the generation order of data element in a sequence is considered, so that it can easily find simple sequential patterns, but has a limit to find more interesting sequential patterns being widely used in real world applications. One of the essential research topics to compensate the limit is a topic of weighted sequential pattern mining. In weighted sequential pattern mining, not only the generation order of data element but also its weight is considered to get more interesting sequential patterns. In recent, data has been increasingly taking the form of continuous data streams rather than finite stored data sets in various application fields, the database research community has begun focusing its attention on processing over data streams. The data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. In data stream processing, each data element should be examined at most once to analyze the data stream, and the memory usage for data stream analysis should be restricted finitely although new data elements are continuously generated in a data stream. Moreover, newly generated data elements should be processed as fast as possible to produce the up-to-date analysis result of a data stream, so that it can be instantly utilized upon request. To satisfy these requirements, data stream processing sacrifices the correctness of its analysis result by allowing some error. Considering the changes in the form of data generated in real world application fields, many researches have been actively performed to find various kinds of knowledge embedded in data streams. They mainly focus on efficient mining of frequent itemsets and sequential patterns over data streams, which have been proven to be useful in conventional data mining for a finite data set. In addition, mining algorithms have also been proposed to efficiently reflect the changes of data streams over time into their mining results. However, they have been targeting on finding naively interesting patterns such as frequent patterns and simple sequential patterns, which are found intuitively, taking no interest in mining novel interesting patterns that express the characteristics of target data streams better. Therefore, it can be a valuable research topic in the field of mining data streams to define novel interesting patterns and develop a mining method finding the novel patterns, which will be effectively used to analyze recent data streams. This paper proposes a gap-based weighting approach for a sequential pattern and amining method of weighted sequential patterns over sequence data streams via the weighting approach. A gap-based weight of a sequential pattern can be computed from the gaps of data elements in the sequential pattern without any pre-defined weight information. That is, in the approach, the gaps of data elements in each sequential pattern as well as their generation orders are used to get the weight of the sequential pattern, therefore it can help to get more interesting and useful sequential patterns. Recently most of computer application fields generate data as a form of data streams rather than a finite data set. Considering the change of data, the proposed method is mainly focus on sequence data streams.

A Personalized Automatic TV Program Scheduler using Sequential Pattern Mining (순차 패턴 마이닝 기법을 이용한 개인 맞춤형 TV 프로그램 스케줄러)

  • Pyo, Shin-Jee;Kim, Eun-Hui;Kim, Mun-Churl
    • Journal of Broadcast Engineering
    • /
    • v.14 no.5
    • /
    • pp.625-637
    • /
    • 2009
  • With advent of TV environment and increasing of variety of program contents, users are able to experience more various and complex environment for watching TV contents. According to the change of content watching environment, users have to make more efforts to choose his/her interested TV program contents or TV channels than before. Also, the users usually watch the TV program contents with their own regular way. So, in this paper, we suggests personalized TV program schedule recommendation system based on the analyzing users' TV watching history data. And we extract the users' watched program patterns using the sequential pattern mining method. Also, we proposed a new sequential pattern mining which is suitable for TV watching environment and verify our proposed method have better performance than existing sequential pattern mining method in our application area. In the future, we will consider a VoD characteristic for extending to IPTV program schedule recommendation system.

An Efficient Mining for Closed Frequent Sequences (효율적인 닫힌 빈발 시퀀스 마이닝)

  • Kim, Hyung-Geun;Whang, Whan-Kyu
    • Journal of Industrial Technology
    • /
    • v.25 no.A
    • /
    • pp.163-173
    • /
    • 2005
  • Recent sequential pattern mining algorithms mine all of the frequent sequences satisfying a minimum support threshold in a large database. However, when a frequent sequence becomes very long, such mining will generate an explosive number of frequent sequence, which is prohibitively expensive in time. In this paper, we proposed a novel sequential pattern algorithm using only closed frequent sequences which are small subset of very large frequent sequences. Our algorithm extends the sequence by depth-first search strategy with effective pruning. Using bitmap representation of underlying databases, we can obtain a closed frequent sequence considerably faster than the currently reported methods.

  • PDF

Efficient Mining of User Behavior patterns by classification of age based on location information (위치에 따른 연령대별 유용한 행동패턴 추출 기법)

  • Kim, HyeRan;Lee, SeungCheol;Kim, UngMo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.11a
    • /
    • pp.250-253
    • /
    • 2007
  • 통신기술의 발달로 무선단말기의 보급이 급증하고 무선 네트워크 사용이 일반화됨으로써, 최근 유비쿼터스 컴퓨팅 기술이 중요한 이슈가 되고 있다. 유비쿼터스 컴퓨팅은 시간과 장소의 한계를 넘어 사용자가 하고자 하는 일을 컴퓨팅 환경이 상황을 인지하여 돕는 것을 가능하게 한다. 상황인지를 위해 순차패턴과 시간 연관규칙 탐사를 이용하여 사용자의 행동패턴을 추출하는 연구가 활발히 진행되고 있다. 이러한 연구를 통한 행동패턴은 사용자의 특성을 간과하게 되며, 각 사용자에게 더욱 유용한 서비스를 제공하기 위해서는 사용자를 분류하는 것이 필요하다. 그러나 기존의 연구는 단지 통계적인 사용자의 빈발 행동패턴만을 추출하여 각 사용자의 관심사와는 무관한 서비스 제공이 이루어질 수 있다. 성별, 나이, 직업 등의 개인정보와 위치를 고려하여 사용자에게 더욱 더 효율적이고 유용한 서비스를 제공할 수 있도록 행동패턴을 유형별로 분류할 필요가 있다. 본 논문에서는 각 위치에 따른 사용자의 연령대별 유용한 행동패턴을 추출하여 정확한 서비스를 제공할 수 있는 마이닝 기법을 제안한다.

Product Recommendation System on VLDB using k-means Clustering and Sequential Pattern Technique (k-means 클러스터링과 순차 패턴 기법을 이용한 VLDB 기반의 상품 추천시스템)

  • Shim, Jang-Sup;Woo, Seon-Mi;Lee, Dong-Ha;Kim, Yong-Sung;Chung, Soon-Key
    • The KIPS Transactions:PartD
    • /
    • v.13D no.7 s.110
    • /
    • pp.1027-1038
    • /
    • 2006
  • There are many technical problems in the recommendation system based on very large database(VLDB). So, it is necessary to study the recommendation system' structure and the data-mining technique suitable for the large scale Internet shopping mail. Thus we design and implement the product recommendation system using k-means clustering algorithm and sequential pattern technique which can be used in large scale Internet shopping mall. This paper processes user information by batch processing, defines the various categories by hierarchical structure, and uses a sequential pattern mining technique for the search engine. For predictive modeling and experiment, we use the real data(user's interest and preference of given category) extracted from log file of the major Internet shopping mall in Korea during 30 days. And we define PRP(Predictive Recommend Precision), PRR(Predictive Recommend Recall), and PF1(Predictive Factor One-measure) for evaluation. In the result of experiments, the best recommendation time and the best learning time of our system are much as O(N) and the values of measures are very excellent.

FMC's Robot Path Analysis and Design Using Simulation and Sequential patterns (시뮬레이션과 순차 패턴을 이용한 FMC의 로봇 경로 분석 및 설계)

  • Kim, Sun-Gil;Lee, Hong-Chul
    • Proceedings of the KAIS Fall Conference
    • /
    • 2009.12a
    • /
    • pp.806-809
    • /
    • 2009
  • 본 논문에서는 FMC의 로봇 경로 분석 및 설계를 하기 위해 시뮬레이션을 이용해 FMC의 로봇 패턴을 분석하고 그 결과를 이용해 최적의 로봇 경로를 설계하는 방법을 제시하였다. 전형적인 FMC를 시뮬레이션으로 설계하고 설비에서 신호를 추출 해 순차 패턴 마이닝을 이용해 로봇의 최적 이동 경로를 도출하는 방법을 제시하였다. 이러한 신호의 패턴을 이용한 분석 방법은 로봇의 경로 설계를 도출하기가 용이하여 최적의 경로를 설계하여 FMC에 적용한 결과 기존보다 총 처리량의 증가와 총 처리시간 감소를 가져왔다. 또한 이 방법은 FMC 뿐만 아니라 로봇이 있는 모든 생산라인에 시뮬레이션을 통해 분석이 가능하기 때문에 생산성 향상에 크게 기여할 것으로 기대된다.

  • PDF