• Title/Summary/Keyword: Pruning Strategy

Search Result 24, Processing Time 0.029 seconds

WIS: Weighted Interesting Sequential Pattern Mining with a Similar Level of Support and/or Weight

  • Yun, Un-Il
    • ETRI Journal
    • /
    • v.29 no.3
    • /
    • pp.336-352
    • /
    • 2007
  • Sequential pattern mining has become an essential task with broad applications. Most sequential pattern mining algorithms use a minimum support threshold to prune the combinatorial search space. This strategy provides basic pruning; however, it cannot mine correlated sequential patterns with similar support and/or weight levels. If the minimum support is low, many spurious patterns having items with different support levels are found; if the minimum support is high, meaningful sequential patterns with low support levels may be missed. We present a new algorithm, weighted interesting sequential (WIS) pattern mining based on a pattern growth method in which new measures, sequential s-confidence and w-confidence, are suggested. Using these measures, weighted interesting sequential patterns with similar levels of support and/or weight are mined. The WIS algorithm gives a balance between the measures of support and weight, and considers correlation between items within sequential patterns. A performance analysis shows that WIS is efficient and scalable in weighted sequential pattern mining.

  • PDF

EXTENDED ONLINE DIVISIVE AGGLOMERATIVE CLUSTERING

  • Musa, Ibrahim Musa Ishag;Lee, Dong-Gyu;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • 2008.10a
    • /
    • pp.406-409
    • /
    • 2008
  • Clustering data streams has an importance over many applications like sensor networks. Existing hierarchical methods follow a semi fuzzy clustering that yields duplicate clusters. In order to solve the problems, we propose an extended online divisive agglomerative clustering on data streams. It builds a tree-like top-down hierarchy of clusters that evolves with data streams using geometric time frame for snapshots. It is an enhancement of the Online Divisive Agglomerative Clustering (ODAC) with a pruning strategy to avoid duplicate clusters. Our main features are providing update time and memory space which is independent of the number of examples on data streams. It can be utilized for clustering sensor data and network monitoring as well as web click streams.

  • PDF

Splitting Algorithm Using Total Information Gain for a Market Segmentation Problem

  • Kim, Jae-Kyeong;Kim, Chang-Kwon;Kim, Soung-Hie
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.18 no.2
    • /
    • pp.183-203
    • /
    • 1993
  • One of the most difficult and time-consuming stages in the development of the knowledge-based system is a knowledge acquisition. A splitting algorithm is developed to infer a rule-tree which can be converted to a rule-typed knowledge. A market segmentation may be performed in order to establish market strategy suitable to each market segment. As the sales data of a product market is probabilistic and noisy, it becomes necessary to prune the rule-tree-at an acceptable level while generating a rule-tree. A splitting algorithm is developed using the pruning measure based on a total amount of information gain and the measure of existing algorithms. A user can easily adjust the size of the resulting rule-tree according to his(her) preferences and problem domains. The algorithm is applied to a market segmentation problem of a medium-large computer market. The algorithm is illustrated step by step with a sales data of a computer market and is analyzed.

  • PDF

Optimal Evasive Maneuver for Sea Skimming Missiles against Close-In Weapon System (근접방어무기체계에 대한 함대함 유도탄의 최적회피기동)

  • Whang, Ick-Ho
    • Proceedings of the KIEE Conference
    • /
    • 2002.07d
    • /
    • pp.2096-2098
    • /
    • 2002
  • In this paper, the optimal evasive maneuver strategies for typical subsonic ASM(anti-ship missile) to reach its target ship with high survivability against CIWS(close in weapon system) are studied. The optimal evasive maneuver input is defined by the homing command optimizing the cost function which takes aiming errors of CIWS into account. The optimization problem for the effective evasive maneuver is formulated based on a simple missile dynamics model and a CIWS model. By means of solving the problem, a multiple hypotheses testing method is proposed. Since this method requires generation of too many hypotheses, the hypothesis-pruning technique is adopted. The solution shows that the optimal evasive maneuver is a bang-bane shaped command whose frequency is varied by the aimpoint determination strategy in CIWS.

  • PDF

An Efficient Mining for Closed Frequent Sequences (효율적인 닫힌 빈발 시퀀스 마이닝)

  • Kim, Hyung-Geun;Whang, Whan-Kyu
    • Journal of Industrial Technology
    • /
    • v.25 no.A
    • /
    • pp.163-173
    • /
    • 2005
  • Recent sequential pattern mining algorithms mine all of the frequent sequences satisfying a minimum support threshold in a large database. However, when a frequent sequence becomes very long, such mining will generate an explosive number of frequent sequence, which is prohibitively expensive in time. In this paper, we proposed a novel sequential pattern algorithm using only closed frequent sequences which are small subset of very large frequent sequences. Our algorithm extends the sequence by depth-first search strategy with effective pruning. Using bitmap representation of underlying databases, we can obtain a closed frequent sequence considerably faster than the currently reported methods.

  • PDF

Optimized Neurocontroller for Human Control Skill Transfer

  • Seo, Kap-Ho;Changmok Oh;Lee, Ju-Jang
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2001.10a
    • /
    • pp.42.3-42
    • /
    • 2001
  • A human is an expert in manipulation. We have acquired skills to perform dexterous operations based upon knowledge and experience attained over a long period of time. It is important in robotics to understand these human skills, and utilize them to bring about better robot control and operation It is hoped that the neurocontroller can be trained and organized by simply presenting human teaching data, which implicate human intention, strategy and expertise. In designing a neurocontroller, we must determine the size of neurocontroller. Improper size may not only incur difficulties in training neural nets, e.g. no convergence, but also cause instability and erratic behavior in machines. Therefore, it is necessary to determine the proper size of neurocontroller for human control transfer. In this paper, a new pruning method is developed, based on the penalty-term methods. This method makes ...

  • PDF

Fast Quadtree Structure Decision for HEVC Intra Coding Using Histogram Statistics

  • Li, Yuchen;Liu, Yitong;Yang, Hongwen;Yang, Dacheng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.5
    • /
    • pp.1825-1839
    • /
    • 2015
  • The final draft of the latest video coding standard, High Efficiency Video Coding (HEVC), was approved in January 2013. The coding efficiency of HEVC surpasses its predecessor, H.264/MPEG-4 Advanced Video Coding (AVC), by using only half of the bitrate to encode the same sequence with similar quality. However, the complexity of HEVC is sharply increased compared to H.264/AVC. In this paper, a method is proposed to decrease the complexity of intra coding in HEVC. Early pruning and an early splitting strategy are applied to the quadtree structure of coding tree units (CTU) and residual quadtree (RQT). According to our experiment, when our method is applied to sequences from Class A to Class E, the coding time is decreased by 44% at the cost of a 1.08% Bjontegaard delta rate (BD-rate) increase on average.

High Utility Itemset Mining over Uncertain Datasets Based on a Quantum Genetic Algorithm

  • Wang, Ju;Liu, Fuxian;Jin, Chunjie
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.8
    • /
    • pp.3606-3629
    • /
    • 2018
  • The discovered high potential utility itemsets (HPUIs) have significant influence on a variety of areas, such as retail marketing, web click analysis, and biological gene analysis. Thus, in this paper, we propose an algorithm called HPUIM-QGA (Mining high potential utility itemsets based on a quantum genetic algorithm) to mine HPUIs over uncertain datasets based on a quantum genetic algorithm (QGA). The proposed algorithm not only can handle the problem of the non-downward closure property by developing an upper bound of the potential utility (UBPU) (which prunes the unpromising itemsets in the early stage) but can also handle the problem of combinatorial explosion by introducing a QGA, which finds optimal solutions quickly and needs to set only very few parameters. Furthermore, a pruning strategy has been designed to avoid the meaningless and redundant itemsets that are generated in the evolution process of the QGA. As proof of the HPUIM-QGA, a substantial number of experiments are performed on the runtime, memory usage, analysis of the discovered itemsets and the convergence on real-life and synthetic datasets. The results show that our proposed algorithm is reasonable and acceptable for mining meaningful HPUIs from uncertain datasets.

Semantic Trajectory Based Behavior Generation for Groups Identification

  • Cao, Yang;Cai, Zhi;Xue, Fei;Li, Tong;Ding, Zhiming
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.12
    • /
    • pp.5782-5799
    • /
    • 2018
  • With the development of GPS and the popularity of mobile devices with positioning capability, collecting massive amounts of trajectory data is feasible and easy. The daily trajectories of moving objects convey a concise overview of their behaviors. Different social roles have different trajectory patterns. Therefore, we can identify users or groups based on similar trajectory patterns by mining implicit life patterns. However, most existing daily trajectories mining studies mainly focus on the spatial and temporal analysis of raw trajectory data but missing the essential semantic information or behaviors. In this paper, we propose a novel trajectory semantics calculation method to identify groups that have similar behaviors. In our model, we first propose a fast and efficient approach for stay regions extraction from daily trajectories, then generate semantic trajectories by enriching the stay regions with semantic labels. To measure the similarity between semantic trajectories, we design a semantic similarity measure model based on spatial and temporal similarity factor. Furthermore, a pruning strategy is proposed to lighten tedious calculations and comparisons. We have conducted extensive experiments on real trajectory dataset of Geolife project, and the experimental results show our proposed method is both effective and efficient.

Assessment of Carbon Storage Capacity of Stands in Abandoned Coal Mine Forest Rehabilitation Areas over time for its Development of Management Strategy (폐탄광 산림복구지 관리방안 도출을 위한 산림복구 후 시간경과에 따른 임분탄소저장량 평가)

  • Mun Ho Jung;Kwan In Park;Ji Hye Kim;Won Hyun Ji
    • Journal of Environmental Science International
    • /
    • v.32 no.4
    • /
    • pp.233-242
    • /
    • 2023
  • The objective of this study was to develop a management strategy for the recovery of carbon storage capacity of abandoned coal mine forest rehabilitation area. For the purpose, the biomass and stand carbon storage over time after the forest rehabilitation by tree type for Betula platyphylla, Pinus densiflora, and Alnus hirsuta trees which are major tree species widely planted for the forest rehabilitation in the abandoned coal mine were calculated, and compared them with general forest. The carbon storage in abandoned coal mine forest rehabilitation areas was lower than that in general forests, and based on tree species, Pinus densiflora stored 48.9%, Alnus hirsuta 41.1%, and Betula platyphylla 27.0%. This low carbon storage is thought to be caused by poor growth because soil chemical properties, such as low TOC and total nitrogen content, in the soil of abandoned coal mine forest rehabilitation areas, were adverse to vegetation growth compared to those in general forests. DBH, stand biomass, and stand carbon storage tended to increase after forest rehabilitation over time, whereas stand density decreased. Stand' biomass and carbon storage increased as DBH and stand density increased, but there was a negative correlation between stand density and DBH. Therefore, after forest rehabilitation, growth status should be monitored, an appropriate growth space for trees should be maintained by thinning and pruning, and the soil chemical properties such as fertilization must be managed. It is expected that the carbon storage capacity the forest rehabilitation area could be restored to a level similar to that of general forests.