• Title/Summary/Keyword: big tree

Search Result 222, Processing Time 0.02 seconds

IRFP-tree: Intersection Rule Based FP-tree (IRFP-tree(Intersection Rule Based FP-tree): 메모리 효율성을 향상시키기 위해 교집합 규칙 기반의 패러다임을 적용한 FP-tree)

  • Lee, Jung-Hun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.3
    • /
    • pp.155-164
    • /
    • 2016
  • For frequency pattern analysis of large databases, the new tree-based frequency pattern analysis algorithm which can compensate for the disadvantages of the Apriori method has been variously studied. In frequency pattern tree, the number of nodes is associated with memory allocation, but also affects memory resource consumption and processing speed of the growth. Therefore, reducing the number of nodes in the tree is very important in the frequency pattern mining. However, the absolute criteria which need to order the transaction items for construction frequency pattern tree has lowered the compression ratio of the tree nodes. But most of the frequency based tree construction methods adapted the absolute criteria. FP-tree is typically frequency pattern tree structure which is an extended prefix-tree structure for storing compressed frequent crucial information about frequent patterns. For construction the tree, all the frequent items in different transactions are sorted according to the absolute criteria, frequency descending order. CanTree also need to absolute criteria, canonical order, to construct the tree. In this paper, we proposed a novel frequency pattern tree construction method that does not use the absolute criteria, IRFP-tree algorithm. IRFP-tree(Intersection Rule based FP-tree). IRFP-tree is constituted with the new paradigm of the intersection rule without the use of the absolute criteria. It increased the compression ratio of the tree nodes, and reduced the tree construction time. Our method has the additional advantage that it provides incremental mining. The reported test result demonstrate the applicability and effectiveness of the proposed approach.

Evaluation of Predictive Models for Early Identification of Dropout Students

  • Lee, JongHyuk;Kim, Mihye;Kim, Daehak;Gil, Joon-Min
    • Journal of Information Processing Systems
    • /
    • v.17 no.3
    • /
    • pp.630-644
    • /
    • 2021
  • Educational data analysis is attracting increasing attention with the rise of the big data industry. The amounts and types of learning data available are increasing steadily, and the information technology required to analyze these data continues to develop. The early identification of potential dropout students is very important; education is important in terms of social movement and social achievement. Here, we analyze educational data and generate predictive models for student dropout using logistic regression, a decision tree, a naïve Bayes method, and a multilayer perceptron. The multilayer perceptron model using independent variables selected via the variance analysis showed better performance than the other models. In addition, we experimentally found that not only grades but also extracurricular activities were important in terms of preventing student dropout.

Iowa Liquor Sales Data Predictive Analysis Using Spark

  • Ankita Paul;Shuvadeep Kundu;Jongwook Woo
    • Asia pacific journal of information systems
    • /
    • v.31 no.2
    • /
    • pp.185-196
    • /
    • 2021
  • The paper aims to analyze and predict sales of liquor in the state of Iowa by applying machine learning algorithms to models built for prediction. We have taken recourse of Azure ML and Spark ML for our predictive analysis, which is legacy machine learning (ML) systems and Big Data ML, respectively. We have worked on the Iowa liquor sales dataset comprising of records from 2012 to 2019 in 24 columns and approximately 1.8 million rows. We have concluded by comparing the models with different algorithms applied and their accuracy in predicting the sales using both Azure ML and Spark ML. We find that the Linear Regression model has the highest precision and Decision Forest Regression has the fastest computing time with the sample data set using the legacy Azure ML systems. Decision Tree Regression model in Spark ML has the highest accuracy with the quickest computing time for the entire data set using the Big Data Spark systems.

Tree-Ring Growth Characteristics of Zelkova serrata Makino after Replanting on the Reclaimed Land from the sea in Gwangyang Bay (광양만 임해 매립지 느티나무 이식 이후의 연륜생장 특성)

  • Kim Do-Gyun
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.33 no.6 s.113
    • /
    • pp.40-50
    • /
    • 2006
  • This study was carried out to examine the tree-ring growth characteristics of Zelkova serrata Makino after replanting, for the built-up planting founds for stability of landscaped trees in the reclaimed land from the sea. the factors, many affecting the growths of Zelkova serrata Makino, were the replanting stress and drought. The growth reduction due to replanting and drought occurred in the replanting year and the following year. The mean sensitivity(year-to-year variation) and the coefficient of variation(tree-to-tree variation in a certain year) in tree rings of Zelkova serrate Makino, were higher in the poor soil sites than in the favourable soil ones. And the poor soil sites were the filled ground of improve soil and the covered ground of improve soil and the top ground of big mounding than mounding ground sites, especially soil hardness, alkali soil, high $Na^+\;and\;K^+,\;low\;Ca^{++}\;and\;Mg^{++}$ and T-C were the most crucial. We suggest technique development of the built-up planting ground for stability in the reclaimed land from the sea. The built-up planting grounds in reclaimed land from the sea, should be considered for the use of fair soil with the physical and chemical soil properties, -high level foundation of planting ground, and the prevention of disturbed soil-.

Prefetch R-tree: A Disk and Cache Optimized Multidimensional Index Structure (Prefetch R-tree: 디스크와 CPU 캐시에 최적화된 다차원 색인 구조)

  • Park Myung-Sun
    • The KIPS Transactions:PartD
    • /
    • v.13D no.4 s.107
    • /
    • pp.463-476
    • /
    • 2006
  • R-trees have been traditionally optimized for the I/O performance with the disk page as the tree node. Recently, researchers have proposed cache-conscious variations of R-trees optimized for the CPU cache performance in main memory environments, where the node size is several cache lines wide and more entries are packed in a node by compressing MBR keys. However, because there is a big difference between the node sizes of two types of R-trees, disk-optimized R-trees show poor cache performance while cache-optimized R-trees exhibit poor disk performance. In this paper, we propose a cache and disk optimized R-tree, called the PR-tree (Prefetching R-tree). For the cache performance, the node size of the PR-tree is wider than a cache line, and the prefetch instruction is used to reduce the number of cache misses. For the I/O performance, the nodes of the PR-tree are fitted into one disk page. We represent the detailed analysis of cache misses for range queries, and enumerate all the reasonable in-page leaf and nonleaf node sizes, and heights of in-page trees to figure out tree parameters for best cache and I/O performance. The PR-tree that we propose achieves better cache performance than the disk-optimized R-tree: a factor of 3.5-15.1 improvement for one-by-one insertions, 6.5-15.1 improvement for deletions, 1.3-1.9 improvement for range queries, and 2.7-9.7 improvement for k-nearest neighbor queries. All experimental results do not show notable declines of the I/O performance.

Trends of Several Air Pollutants and the Effects of Ozone on the Plant Antioxidant system in Platanus occidentalis in Korea

  • Woo, Su-Young
    • Journal of Korean Society of Forest Science
    • /
    • v.95 no.2
    • /
    • pp.183-187
    • /
    • 2006
  • This study investigated concentrations of the several air pollutants and compared antioxidative enzyme activities on Platanus occidentalis because this tree species is one of the widespread street trees in Korea. This species has been emerging the ambient air pollutants during its growing periods. The purpose of this study was to identify the relationship between air pollution on the tree species and antioxidant enzyme activities on the trees. $O_3$, $NO_2$, CO and $SO_2$ concentrations of several cities in Korea were compared for last decades. Among the air pollutants, $O_3$ and $NO_2$ concentrations in six big cities in Korea showed similar increasing trends during this period. In contrast, $SO_2$ and CO concentrations in the same cities dramatically decreased between 1994 and 2005. Platanus occidentalis trees were controlled to investigate, ascorbate peroxidase (APX) and glutathione reductase (GR) activity. Ozone exposure generally increased APX and GR activities of tree seedlings. It is a typical compensatory strategy of stressed trees.

Study on the Application of Decision Trees for Personalization based on e-CRM (e-CRM에서 개인화 향상을 위한 의사결정나무 사용에 관한 연구)

  • 양정희;한서정
    • Journal of the Korea Safety Management & Science
    • /
    • v.5 no.3
    • /
    • pp.107-119
    • /
    • 2003
  • Expectation and interest about e-CRM are rising for more efficient customer management in on-line including electronic commerce. The decision-making tree can be used usefully as the data mining technology for e-CRM. In this paper, the representative decision making techniques, CART, C4.5, CHAID analyzed the differences in personalization point of view with actuality customer data through an experiment. With these analysis data, it is proposed a new decision-making tree system that has big advantage in personalization techniques. Through new system, it can get following advantage. First, it can form superior model more qualitatively in personalization by adding individual's weight value. Second it can supply information personalized more to customer. Third, it can have high position about customer's loyalty than other site of similar types of business. Fourth, it can reduce expense that cost marketing and decision-making. Fifth, it becomes possible that know that customer through smooth communication with customer who use personalized service wants and make from goods or service's quality to more worth thing.

Query Optimization on Large Scale Nested Data with Service Tree and Frequent Trajectory

  • Wang, Li;Wang, Guodong
    • Journal of Information Processing Systems
    • /
    • v.17 no.1
    • /
    • pp.37-50
    • /
    • 2021
  • Query applications based on nested data, the most commonly used form of data representation on the web, especially precise query, is becoming more extensively used. MapReduce, a distributed architecture with parallel computing power, provides a good solution for big data processing. However, in practical application, query requests are usually concurrent, which causes bottlenecks in server processing. To solve this problem, this paper first combines a column storage structure and an inverted index to build index for nested data on MapReduce. On this basis, this paper puts forward an optimization strategy which combines query execution service tree and frequent sub-query trajectory to reduce the response time of frequent queries and further improve the efficiency of multi-user concurrent queries on large scale nested data. Experiments show that this method greatly improves the efficiency of nested data query.

Management Improvement of Big and Old Trees in the Byeol-seo Scenic Sites (별서 명승지역 내 노거수목의 관리개선방안)

  • Lee, Jong-Bum;Lee, Chang-Hun;Choi, Byoung-Jae;Lee, Jae-Keun
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.31 no.1
    • /
    • pp.98-107
    • /
    • 2013
  • Big and old trees in the scenic spots with the attributes of remote villas are vulnerable to man-made damages and very sensitive to the external environment such as soil conditions, so the corresponding management plans are required. Thus this study has been conducted to survey the big and old trees in the scenic remote villas and suggest the ideal management plans. The results can be summarized as follows. First, regarding the tree heath above the ground, transformation of tree, death of branches, and death of barks are closely related to tree vigor. Particularly, the areas receiving many visitors require prompt countermeasures against the dried and dead tress above the ground and the areas in which dried and dead tress occurred and also the safety measures for the visitors and facilities. Second, regarding the soil environment, visitor traffic is closely related to the tree vigor. In 15 remote villa gardens, 64% of trees are exposed to heavy traffic and the tree vigor has declined due to an increase of visitor. Thus, there is a need to give positive consideration the installation of the complementary facilities and the plantation of herbal plants in the congested areas to form the ground surface that can tolerate the heavy visitor traffic. Third, remote gardens are in general located adjacent to ponds and mountain streams and thus the trees in the waterfront areas require the prompt countermeasures against the decline of growth due to the excess-moisture in the soil. Further the blockage of the sewage system due to the heavy rains dampens the surrounding soil, which results in lethal damages to the trees. Thus, there is a need of the maintenance of the waterfront areas and sewage system before and after the rainy season. In addition, there is a need to establish medium-long term management polices through the recognition of the importance of the main trees of remote villa gardens in scenic spots and prepare the tree management manual depending on the attributes of the corresponding areas. I strongly suggest making manuals for the systematic management as well as the extensive PR activities and education for the preservation of tress on a long-term basis; and furthermore securing the budget and manpower for the research and development of a systematic management system.

Measurement and estimation of transpiration from an evergreen broad-leaved forest in japan

  • Hirose, Shigeki;Humagai, Tomo′omi;Kumi, Atsushi;Takeuchi, Shin′ichi;Otsuki, Kyoichi;Ogawa, Shigeru
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2001.05a
    • /
    • pp.52-59
    • /
    • 2001
  • Methods to measure and estimate transpiration of a forest composed of evergreen broad-leaved trees (Pasania edulis Makino) are studied. Heat pulse velocity has been measured along with soil moisture and micrometeorological factors at the Fukuoka Experimental Forest, the Research Institute of Kyushu University Forests in Fukuoka, Japan (33$^{\circ}$38'N, 130$^{\circ}$31'E, alt. 75m). Tree cutting measurement was conducted to convert the heat pulse velocity into sap flow and transpiration. A big leaf model to calculate transpiration and Interception loss is examined and the estimated values are compared with the measured values obtained from the heat pulse measurement. The results show that 1) Pasania edulis Makino posessing radial pore structure had relatively high water content and high heat pulse velocity even within the central part of the stem near the pith, 2) the heat pulse velocity was well correspond to the water uptake in the tree cutting measurement, 3) the estimation of sap flow based on the heat pulse velocity is accurate, and 4) the big leaf model using the parameters obtained from measurement of a portable photosynthesis system in one day in summer gives reasonable estimation of transpiration independent of seasons and weather.

  • PDF