• Title/Summary/Keyword: 빈발도

Search Result 465, Processing Time 0.025 seconds

An Efficient Candidate Pattern Tree Structure and Algorithm for Incremental Web Mining (점진적인 웹 마이닝을 위한 효율적인 후보패턴 저장 트리구조 및 알고리즘)

  • Kang, Hee-Seong;Park, Byung-Joon
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.44 no.1
    • /
    • pp.71-79
    • /
    • 2007
  • Recent advances in the internet infrastructure have resulted in a large number of huge Web sites and portals worldwide. These Web sites are being visited by various types of users in many different ways. Among all the web page access sequences from different users, some of them occur so frequently that may need an attention from those who are interested. We call them frequent access patterns and access sequences that can be frequent the candidate patterns. Since these candidate patterns play an important role in the incremental Web mining, it is important to efficiently generate, add, delete, and search for them. This thesis presents a novel tree structure that can efficiently store the candidate patterns and a related set of algorithms for generating the tree structure, adding new patterns, deleting unnecessary patterns, and searching for the needed ones. The proposed tree structure has a kind of the 3 dimensional link structure and its nodes are layered.

A Study on the Implementation of an optimized Algorithm for association rule mining system using Fuzzy Utility (Fuzzy Utility를 활용한 연관규칙 마이닝 시스템을 위한 알고리즘의 구현에 관한 연구)

  • Park, In-Kyu;Choi, Gyoo-Seok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.20 no.1
    • /
    • pp.19-25
    • /
    • 2020
  • In frequent pattern mining, the uncertainty of each item is accompanied by a loss of information. AAlso, in real environment, the importance of patterns changes with time, so fuzzy logic must be applied to meet these requirements and the dynamic characteristics of the importance of patterns should be considered. In this paper, we propose a fuzzy utility mining technique for extracting frequent web page sets from web log databases through fuzzy utility-based web page set mining. Here, the downward closure characteristic of the fuzzy set is applied to remove a large space by the minimum fuzzy utility threshold (MFUT)and the user-defined percentile(UDP). Extensive performance analyses show that our algorithm is very efficient and scalable for Fuzzy Utility Mining using dynamic weights.

Performance Evaluation of the FP-tree and the DHP Algorithms for Association Rule Mining (FP-tree와 DHP 연관 규칙 탐사 알고리즘의 실험적 성능 비교)

  • Lee, Hyung-Bong;Kim, Jin-Ho
    • Journal of KIISE:Databases
    • /
    • v.35 no.3
    • /
    • pp.199-207
    • /
    • 2008
  • The FP-tree(Frequency Pattern Tree) mining association rules algorithm was proposed to improve mining performance by reducing DB scan overhead dramatically, and it is recognized that the performance of it is better than that of any other algorithms based on different approaches. But the FP-tree algorithm needs a few more memory because it has to store all transactions including frequent itemsets of the DB. This paper implements a FP-tree algorithm on a general purpose UNK system and compares it with the DHP(Direct Hashing and Pruning) algorithm which uses hash tree and direct hash table from the point of memory usage and execution time. The results show surprisingly that the FP-tree algorithm is poor than the DHP algorithm in some cases even if the system memory is sufficient for the FP-tree. The characteristics of the test data are as follows. The site of DB is look, the number of total items is $1K{\sim}7K$, avenrage length of transactions is $5{\sim}10$, avergage size of maximal frequent itemsets is $2{\sim}12$(these are typical attributes of data for large-scale convenience stores).

Structure-based Clustering for XML Document Retrieval (XML 문서 검색을 위한 구조 기반 클러스터링)

  • Hwang Jeong Hee;Ryu Keun Ho
    • The KIPS Transactions:PartD
    • /
    • v.11D no.7 s.96
    • /
    • pp.1357-1366
    • /
    • 2004
  • As the importance or XML is increasing to manage information and exchange data efficiently in the web, there are on going works about structural integration and retrieval. The XML. document with the defined structure can retrieve the structure through the DTD or XML schema, but the existing method can't apply to XML. documents which haven't the structure information. Therefore. in this paper we propose a new clus-tering technique at a basic research which make it possible to retrieve structure fast about the XML documents that haven't the structure information. We first estract the feature of frequent structure from each XML document. And we cluster based on the similar structure by con-sidering the frequent structure as representative structure of the XML document, which makes it possible to retrieve the XML document raster than dealing with the whole documents that have different structure. And also we perform the structure retrieval about XML documents based on the clusters which is the group of similar structure. Moreover, we show efficiency of proposed method to describe how to apply the structure retrieval as well as to display the example of application result.

Enzyme Metabolite Analysis Using Data Mining (데이터 마이닝을 활용한 효소 대사물의 분석)

  • Ceong, Hyi-Thaek;Park, Chun-Goo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.11 no.10
    • /
    • pp.969-982
    • /
    • 2016
  • Recently, the researches to discovery drug candidates from natural herbs have received considerable attention. In human body, enzyme mostly metabolize the compounds of natural herbs. In this study, we analysis the enzyme interactions using assoication mining. We get this data from BRENDA(: BRaunschweig ENzyme DAtabase) system. Based on enzyme interaction model, we divide the metabolites into substrate metabolites, product metabolites, inhibitor metabolites, and activating metabolites. We then compose substrate metabolite transaction, product metabolite transaction with each metabolites and enzyme interaction transaction with all metabolites. Also we take account of organism for each transactions. We mine frequent metabolites and patterns from six transactions using association rule mining. And we analysis the relationship among metabolites. As a result, we identify the distributions and patterns of metabolites consist in enzyme interactions. We found that metabolites include in only substrate are identified and have very low supports. This results can be useful to develop the effective metabolism prediction model for compounds of natural herbs.

Designn and Implementation Online Customer Reviews Analysis System based on Dependency Network Model (종속성 네트워크 기반의 온라인 고객리뷰 분석시스템 설계 및 구현)

  • Kim, Keun-Hyung
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.11
    • /
    • pp.30-37
    • /
    • 2010
  • It is very important to analyze online customer reviews, which are small documents of writing opinions or experiences about products or services, for both customers and companies because the customers can get good informations and the companies can establish good marketing strategies. In this paper, we did not propose only dependency network model which is tool for analyzing online customer reviews, but also designed and implemented the system based on the dependency network model. The dependency network model analyzes both subjective and objective sentences, so that it can represent relative importance and relationship between the nouns in the sentences. In the result of implementing, we recognized that relative importance and relationship between the features of products or services, which can not be mined by opinion mining, can be represented by the dependency network model.

Discovering Temporal Relation Considering the Weight of Events in Multidimensional Stream Data Environment (다차원 스트림 데이터 환경에서 이벤트 가중치를 고려한 시간 관계 탐사)

  • Kim, Jae-In;Kim, Dae-In;Song, Myung-Jin;Han, Dae-Young;Hwang, Bu-Hyun
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.2
    • /
    • pp.99-110
    • /
    • 2010
  • An event means a flow which has a time attribute such as a symptom of patient. Stream data collected by sensors can be summarized as an interval event which has a time interval between the start-time point and the end-time point in multiple stream data environment. Most of temporal mining techniques have considered only the frequent events. However, these approaches may ignore the infrequent event even if it is important. In this paper, we propose a new temporal data mining that can find association rules for the significant temporal relation based on interval events in multidimensional stream data environment. Our method considers the weight of events and stream data on the sensing time point of abnormal events. And we can discover association rules on the significant temporal relation regardless of the occurrence frequency of events. The experimental analysis has shown that our method provide more useful knowledge than other conventional methods.

The Goods Recommendation System based on modified FP-Tree Algorithm (변형된 FP-Tree를 기반한 상품 추천 시스템)

  • Kim, Jong-Hee;Jung, Soon-Key
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.11
    • /
    • pp.205-213
    • /
    • 2010
  • This study uses the FP-tree algorithm, one of the mining techniques. This study is an attempt to suggest a new recommended system using a modified FP-tree algorithm which yields an association rule based on frequent 2-itemsets extracted from the transaction database. The modified recommended system consists of a pre-processing module, a learning module, a recommendation module and an evaluation module. The study first makes an assessment of the modified recommended system with respect to the precision rate, recall rate, F-measure, success rate, and recommending time. Then, the efficiency of the system is compared against other recommended systems utilizing the sequential pattern mining. When compared with other recommended systems utilizing the sequential pattern mining, the modified recommended system exhibits 5 times more efficiency in learning, and 20% improvement in the recommending capacity. This result proves that the modified system has more validity than recommended systems utilizing the sequential pattern mining.

Lack of p53 Gene Nucleotide Change in Mutation Hot Spots During HeLa Cell Apoptosis by Adriamycin (아드리아마이신에 의한 HeLa 세포의 자살 과정 중 p53 유전자의 돌연변이 빈발 부위에서의 핵산 변화의 부재)

  • Ryu, Seung-Wook;Kim, Jung-Woo;Kim, Eun-Hee
    • The Journal of Natural Sciences
    • /
    • v.9 no.1
    • /
    • pp.31-37
    • /
    • 1997
  • Apoptosis is an important event in the anticancer drug therapy. p53 was demonstrated to serve a key component to lead tumor cell death by inducing apoptosis. However, recent study showed the presence of p53 independent apoptotic pathway (Gaftenhaus et al., 1996). We were curious to know it apoptosis induced by adriamycin, a genotoxic anticancer agent, involved p53 gene mutation. Thus this study investigated the p53 gene mutation status among HeLa cell population during apoptosis induced by adriamycin. Under our experimental condition, 12 hour treatment of 1 ${\mu}m$ adriamycin caused apoptosis which was monitored by DNA fragmentation assay. In order to see the p53 gene mutation status, exons of 5, 7 and 8 of p53 gene, where previously reported p53 mutation hot spots reside, were amplified by PCR and nucleotide sequence change was scanned. However, no nucleotide change was observed among apoptotic HeLa cell population. Therefore this study demonstrated that adriamycin induced apoptosis without causing p53 gene damage.

  • PDF

An Efficient Data Mining Algorithm based on the Database Characteristics (데이터 베이스 특성에 따른 효율적인 데이터 마이닝 알고리즘)

  • Park, Ji-Hyun;Koh, Chan
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.10 no.1
    • /
    • pp.107-119
    • /
    • 2006
  • Recently with developments of an internet and web techniques, the amount of data that are stored in database is increasing rapidly. So the range of adaption in database has been expanded and a research of Data Mining techniques finding useful skills from the huge database has been progressed. Many original algorithms have been developed by cutting down the item set and the size of database isn't required in the entire course of creating frequent item sets. Although those skills could save time in some course, it requires too much time for adapting those techniques in other courses. In this paper, an algorithm is proposed. In an Transaction Database that the length of it's transactions are short or the number of items are relatively small, this algorithm scans a database once by using a Hashing Technique and at the same time, stores all parts of the set, can be appeared at each transaction, in an Hash-table. So without an influence of n minimum percentage of support, it can discover a set of frequent items in more shorter time than the time what is used by an original algorithm.

  • PDF