• Title/Summary/Keyword: Frequent Structure

Search Result 428, Processing Time 0.027 seconds

An Efficient Tree Structure Method for Mining Association Rules (트리 구조를 이용한 연관규칙의 효율적 탐색)

  • Kim, Chang-Oh;Ahn, Kwang-Il;Kim, Seong-Jip;Kim, Jae-Yearn
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.27 no.1
    • /
    • pp.30-36
    • /
    • 2001
  • We present a new algorithm for mining association rules in the large database. Association rules are the relationships of items in the same transaction. These rules provide useful information for marketing. Since Apriori algorithm was introduced in 1994, many researchers have worked to improve Apriori algorithm. However, the drawback of Apriori-based algorithm is that it scans the transaction database repeatedly. The algorithm which we propose scans the database twice. The first scanning of the database collects frequent length l-itemsets. And then, the algorithm scans the database one more time to construct the data structure Common-Item Tree which stores the information about frequent itemsets. To find all frequent itemsets, the algorithm scans Common-Item Tree instead of the database. As scanning Common-Item Tree takes less time than scanning the database, the algorithm proposed is more efficient than Apriori-based algorithm.

  • PDF

An Efficient Approach to Mining Maximal Contiguous Frequent Patterns from Large DNA Sequence Databases

  • Karim, Md. Rezaul;Rashid, Md. Mamunur;Jeong, Byeong-Soo;Choi, Ho-Jin
    • Genomics & Informatics
    • /
    • v.10 no.1
    • /
    • pp.51-57
    • /
    • 2012
  • Mining interesting patterns from DNA sequences is one of the most challenging tasks in bioinformatics and computational biology. Maximal contiguous frequent patterns are preferable for expressing the function and structure of DNA sequences and hence can capture the common data characteristics among related sequences. Biologists are interested in finding frequent orderly arrangements of motifs that are responsible for similar expression of a group of genes. In order to reduce mining time and complexity, however, most existing sequence mining algorithms either focus on finding short DNA sequences or require explicit specification of sequence lengths in advance. The challenge is to find longer sequences without specifying sequence lengths in advance. In this paper, we propose an efficient approach to mining maximal contiguous frequent patterns from large DNA sequence datasets. The experimental results show that our proposed approach is memory-efficient and mines maximal contiguous frequent patterns within a reasonable time.

The Management Method for Preventing Frequent Defect about Each Inspection Part from The Data Analysis of Pre-qualification of Residential Building Reconstruction (주택 재건축 예비평가 자료분석을 통한 분야별 주요결함의 예방·조치방안)

  • Kim, chun hag;Bae, cheol hak
    • Proceedings of the Korean Institute of Building Construction Conference
    • /
    • 2008.11a
    • /
    • pp.181-186
    • /
    • 2008
  • The purpose of this study is analyzing the frequent defect about each inspection part of Pre-qualification of residential building reconstruction and finding the cause of occurrence so that can supply a solution of the problem during the plalnning and constructure. The frequent defects are naturally following the deterioration and more occur in the structure body than a skin. The meaning that supply the cause and the managing method for preventing frequent defects about each inspection part is as follows. To prevent the defects, it is needed that the choice of the appropriate materials, giving a sufficient space for facilities, reliable constructure for the part of expecting the defects. The quality of life will go up due to making a good dwelling environment and extending the life cycle of structures from the inspection and managing the part of expecting defects.

  • PDF

Query Optimization on Large Scale Nested Data with Service Tree and Frequent Trajectory

  • Wang, Li;Wang, Guodong
    • Journal of Information Processing Systems
    • /
    • v.17 no.1
    • /
    • pp.37-50
    • /
    • 2021
  • Query applications based on nested data, the most commonly used form of data representation on the web, especially precise query, is becoming more extensively used. MapReduce, a distributed architecture with parallel computing power, provides a good solution for big data processing. However, in practical application, query requests are usually concurrent, which causes bottlenecks in server processing. To solve this problem, this paper first combines a column storage structure and an inverted index to build index for nested data on MapReduce. On this basis, this paper puts forward an optimization strategy which combines query execution service tree and frequent sub-query trajectory to reduce the response time of frequent queries and further improve the efficiency of multi-user concurrent queries on large scale nested data. Experiments show that this method greatly improves the efficiency of nested data query.

Structure-based Clustering for XML Document Retrieval (XML 문서 검색을 위한 구조 기반 클러스터링)

  • Hwang Jeong Hee;Ryu Keun Ho
    • The KIPS Transactions:PartD
    • /
    • v.11D no.7 s.96
    • /
    • pp.1357-1366
    • /
    • 2004
  • As the importance or XML is increasing to manage information and exchange data efficiently in the web, there are on going works about structural integration and retrieval. The XML. document with the defined structure can retrieve the structure through the DTD or XML schema, but the existing method can't apply to XML. documents which haven't the structure information. Therefore. in this paper we propose a new clus-tering technique at a basic research which make it possible to retrieve structure fast about the XML documents that haven't the structure information. We first estract the feature of frequent structure from each XML document. And we cluster based on the similar structure by con-sidering the frequent structure as representative structure of the XML document, which makes it possible to retrieve the XML document raster than dealing with the whole documents that have different structure. And also we perform the structure retrieval about XML documents based on the clusters which is the group of similar structure. Moreover, we show efficiency of proposed method to describe how to apply the structure retrieval as well as to display the example of application result.

Clustering XML Documents Considering The Weight of Large Items in Clusters (클러스터의 주요항목 가중치 기반 XML 문서 클러스터링)

  • Hwang, Jeong-Hee
    • The KIPS Transactions:PartD
    • /
    • v.14D no.1 s.111
    • /
    • pp.1-8
    • /
    • 2007
  • As the web document of XML, an exchange language of data in the advanced Internet, is increasing, a target of information retrieval becomes the web documents. Therefore, there we researches on structure, integration and retrieval of XML documents. This paper proposes a clustering method of XML documents based on frequent structures, as a basic research to efficiently process query and retrieval. To do so, first, trees representing XML documents are decomposed and we extract frequent structures from them. Second, we perform clustering considering the weight of large items to adjust cluster creation and cluster cohesion, considering frequent structures as items of transactions. Third, we show the excellence of our method through some experiments which compare which the previous methods.

Sliding Ceiling Panel Structure (미닫이 천정판 구조)

  • Gook, Seung-Uk
    • Proceedings of the KSR Conference
    • /
    • 2011.10a
    • /
    • pp.1478-1485
    • /
    • 2011
  • Subway stations have a characteristic that the facilities in the ceiling should be checked, replaced or expanded frequently. The frequent opening and closing of ceiling panels result in the damage of panels which makes them easily disengaged. In order to resolve this issue, which is giving difficulty in the maintenance of the stations, this study suggest a sliding ceiling panel structure which can be opened and closed with sliding mechanism, instead of current fixed type ceiling panels. The sliding ceiling panel will prevent the damage of ceiling panels due to frequent disengagement and engagement. It will increase the durability of ceiling panel and extend the cycle of panel replacement. Accordingly, the maintenance cost will be also reduced.

  • PDF

Pattern mining for large distributed dataset: A parallel approach (PMLDD)

  • Pal, Amrit;Kumar, Manish
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.11
    • /
    • pp.5287-5303
    • /
    • 2018
  • Handling vast amount of data found in large transactional datasets is an obvious challenge for the conventional data mining algorithms. Addressing this challenge, our paper proposes a parallel approach for proper decomposition of mining problem into sub-problems in order to find frequent patterns from these datasets. The proposed, Pattern Mining for Large Distributed Dataset (PMLDD) approach, ensures minimum dependencies as well as minimum communications among sub-problems. It establishes a linear aggregation of the intermediate results so that it can be adapted to large-scale programming models like MapReduce. In this context, an algorithmic structure for MapReduce programming model is presented. PMLDD guarantees an efficient load balancing among the sub-problems by a specific selection criterion. Further, it optimizes the number of required iterations over the dataset for mining frequent patterns as compared to the existing approaches. Finally, we believe that our approach is scalable enough to handle larger datasets in terms of performance evaluation, and the result analysis justifies all these mentioned concerns.

Ownership Structure and Syndicated Loan Maturity

  • Lee, Sang-Whi
    • The Korean Journal of Financial Management
    • /
    • v.25 no.3
    • /
    • pp.155-173
    • /
    • 2008
  • Controlling for the impacts of main strands of debt maturity theories, we highlight the relationship between syndicated loan maturity and ownership structure of Korean borrowers. We find that as the ownership of large shareholders increases, the maturity of syndicated loans also increases. Additionally, we identify a negative relation between foreigners' ownership and loan maturity, indicating that foreign institutional investors serve valuable monitoring functions; as their equity shares increase, they fully take advantage of frequent renewals through the short maturity of syndicated loan. We also show that the predicted value of leverage is more systematically and positively related to the maturity of syndicated loan.

  • PDF

A New Ac-to-Dc Power Converter for a Load with Frequent Short Circuits (부하단락이 빈번히 발생하는 경우에 적합한 교류-직류 전력변환기)

  • No, Ui-Cheol;Kim, In-Dong
    • The Transactions of the Korean Institute of Electrical Engineers B
    • /
    • v.48 no.7
    • /
    • pp.384-390
    • /
    • 1999
  • This paper describes a new ac-to-dc power converter using a multilevel converter. A conventional multilevel ac-to-dc converter has large output dc filter capacitors. When a short circuit happens in a load, the stored energy in the capacitors should be discharged through the load with a high short circuit current. The high current may cause considerable damage to the capacitors and the load. The output dc capacitors of the proposed converter do not discharge even under load short circuit condition. In the case of a load short circuit, the capacitors become a floating state immediately and remain in the state. Then the stored capacitor energy is supplied to the load again as soon as the short circuit has been cleared. Therefore, the rising time of the load voltage can be significantly reduced. This feature satisfies the requirement of a power supply for a load with frequent short circuits. The proposed converter has the characteristics of a simplified structure, a reduced cost, weight, and volume compared with conventional power supplies with frequent output short circuits. Experimental results are presented to verify the usefulness of the proposed converter.

  • PDF