• Title/Summary/Keyword: Tree Modeling

Search Result 337, Processing Time 0.021 seconds

A Case Study on Machine Learning Applications and Performance Improvement in Learning Algorithm (기계학습 응용 및 학습 알고리즘 성능 개선방안 사례연구)

  • Lee, Hohyun;Chung, Seung-Hyun;Choi, Eun-Jung
    • Journal of Digital Convergence
    • /
    • v.14 no.2
    • /
    • pp.245-258
    • /
    • 2016
  • This paper aims to present the way to bring about significant results through performance improvement of learning algorithm in the research applying to machine learning. Research papers showing the results from machine learning methods were collected as data for this case study. In addition, suitable machine learning methods for each field were selected and suggested in this paper. As a result, SVM for engineering, decision-making tree algorithm for medical science, and SVM for other fields showed their efficiency in terms of their frequent use cases and classification/prediction. By analyzing cases of machine learning application, general characterization of application plans is drawn. Machine learning application has three steps: (1) data collection; (2) data learning through algorithm; and (3) significance test on algorithm. Performance is improved in each step by combining algorithm. Ways of performance improvement are classified as multiple machine learning structure modeling, $+{\alpha}$ machine learning structure modeling, and so forth.

An Approach for Integrated Modeling of Protein Data using a Fact Constellation Schema and a Tree based XML Model (Fact constellation 스키마와 트리 기반 XML 모델을 적용한 실험실 레벨의 단백질 데이터 통합 기법)

  • Park, Sung-Hee;Li, Rong-Hua;Ryu, Keun-Ho
    • The KIPS Transactions:PartD
    • /
    • v.11D no.3
    • /
    • pp.519-532
    • /
    • 2004
  • With the explosion of bioinformatics data such proteins and genes, biologists need a integrated system to analyze and organize large datasets that interact with heterogeneous types of biological data. In this paper, we propose a integration system based on a mediated data warehouse architecture using a XML model in order to combine protein related data at biology laboratories. A fact constellation model in this system is used at a common model for integration and an integrated schema it translated to a XML schema. In addition, to track source changes and provenance of data in an integrated database employ incremental update and management of sequence version. This paper shows modeling of integration for protein structures, sequences and classification of structures using the proposed system.

A Novel Test Scheduling Algorithm Considering Variations of Power Consumption in Embedded Cores of SoCs (시스템 온 칩(system-on-a-chip) 내부 코어들의 전력소모 변화를 고려한 새로운 테스트 스케쥴링 알고리듬 설계)

  • Lee, Jae-Min;Lee, Ho-Jin;Park, Jin-Sung
    • Journal of Digital Contents Society
    • /
    • v.9 no.3
    • /
    • pp.471-481
    • /
    • 2008
  • Test scheduling considering power dissipation is an effective technique to reduce the testing time of complex SoCs and to enhance fault coverage under limitation of allowed maximum power dissipation. In this paper, a modeling technique of test resources and a test scheduling algorithm for efficient test procedures are proposed and confirmed. For test resources modeling, two methods are described. One is to use the maximum point and next maximum point of power dissipation in test resources, the other one is to model test resources by partitioning of them. A novel heuristic test scheduling algorithm, using the extended-tree-growing-graph for generation of maximum embedded cores usable simultaneously by using relations between test resources and cores and power-dissipation-changing-graph for power optimization, is presented and compared with conventional algorithms to verify its efficiency.

  • PDF

Modelling Analysis of Climate and Soil Depth Effects on Pine Tree Dieback in Korea Using BIOME-BGC (BIOME-BGC 모형을 이용한 국내 소나무 고사의 기후 및 토심 영향 분석)

  • Kang, Sinkyu;Lim, Jong-Hwan;Kim, Eun-Sook;Cho, Nanghyun
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.18 no.4
    • /
    • pp.242-252
    • /
    • 2016
  • A process-based ecosystem model, BIOME-BGC, was applied to simulate seasonal and inter-annual dynamics of carbon and water processes for potential evergreen needleleaf forest (ENF) biome in Korea. Two simulation sites, Milyang and Unljin, were selected to reflect warm-and-dry and cool-and-wet climate regimes, where massive diebacks of pines including Pinus densiflora, P. koraiensis and P thunbergii, were observed in 2009 and 2014, respectively. Standard Precipitation Index (SPI) showed periodic drought occurrence at every 5 years or so for both sites. Since mid-2000s, droughts occurred with hotter climate condition. Among many model variables, Cpool (i.e., a temporary carbon pool reserving photosynthetic compounds before allocations for new tissue production) was identified as a useful proxy variable of tree carbon starvation caused by reduction of gross primary production (GPP) and/or increase of maintenance respiration (Rm). Temporal Cpool variation agreed well with timings of pine tree diebacks for both sites. Though water stress was important, winter- and spring-time warmer temperature also played critical roles in reduction of Cpool, especially for the cool-and-wet Uljin. Shallow soil depth intensified the drought effect, which was, however, marginal for soil depth shallower than 0.5 m. Our modeling analysis implicates seasonal drought and warmer climate can intensify vulnerability of ENF dieback in Korea, especially for shallower soils, in which multi-year continued stress is of concern more than short-term episodic stress.

Exploration of CHAID Algorithm by Sampling Proportion

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.215-228
    • /
    • 2003
  • Decision tree algorithms are used extensively for data mining in many domains such as retail target marketing, fraud dection, data reduction and variable screening, interaction effect identification, category merging and discretizing continuous variable, etc. CHAID(Chi-square Automatic Interaction Detector), is an exploratory method used to study the relationship between a dependent variable and a series of predictor variables. CHAID modeling selects a set of predictors and their interactions that optimally predict the dependent measure. In this paper we explore CHAID algorithm in view of accuracy and speed by sampling proportion.

  • PDF

Environmental Consciousness Data Modeling by Association Rules

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.16 no.3
    • /
    • pp.529-538
    • /
    • 2005
  • Data mining is the method to find useful information for large amounts of data in database. It is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are association rules, decision tree, clustering, neural network and so on. Association rule mining searches for interesting relationships among items in a riven large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary quality measures for association rule, support and confidence and lift. We analyze Gyeongnam social indicator survey data using association rule technique for environmental information discovery. We can use to environmental preservation and environmental improvement by association rule outputs.

  • PDF

A Web GPS based Logistics Vehicle Control Management System using MVC Design Patterns (MVC 디자인 패턴을 활용한 Web GPS 기반의 물류차량 출하 관제 시스템)

  • Sim, Choon Bo;Kim, Kyoung Jong
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.6 no.1
    • /
    • pp.131-142
    • /
    • 2010
  • In this paper, we propose a web GPS based logistics vehicle control management system using MVC design patterns. The proposed system is designed by applying design patterns of object oriented modeling called mini-architecture to enhance reliability of software as well as promote stability of overall system design. In addition, we can get a position information by means of the GPS embedded in PDA and communicate between client and monitoring server using CDMA network so that the position of client can be identified directly by the map service. The system provides an moving object indexing technique which extends the existing TB-tree to manage and retrieve a transporting trajectory of logistics efficiently. Finally, with development of the logistics vehicle control service called WG-LOGICS system, we can verify the usefulness of our system which is able for monitoring a vehicle preparation, allocating registration, loading a burden, transfer path, and destination arrival in real world.

Multi-Resolution Representation of Solid Models using the Selective Boolean Operations (선택적 불리안 연산자를 이용한 솔리드 모델의 다중해상도 구현)

  • 이상헌;이강수;박상근
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2002.05a
    • /
    • pp.833-835
    • /
    • 2002
  • In this paper, we propose multi-resolutional representation of B-rep solid models using the selective Boolean operations on non-manifold geometric models. Since the union and subtraction operations of the selective Boolean operations are commutative, the integrity of the model is guaranteed for reordering design features. A multi-resolution representation is established using a non-manifold merged set model and a feature modeling tree reordered according to some criterion of level of detail (LOD). Then, a solid model for a specified LOD can be extracted from this multi-resolution model using the selective Boolean operations.

  • PDF

AUTOMATIC IDENTIFICATION OF ROOF TYPES AND ROOF MODELING USING LIDAR

  • Kim, Heung-Sik;Chang, Hwi-Jeong;Cho, Woo-Sug
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.83-86
    • /
    • 2005
  • This paper presents a method for point-based 3D building reconstruction using LiDAR data and digital map. The proposed method consists of three processes: extraction of building roof points, identification of roof types, and 3D building reconstruction. After extracting points inside the polygon of building, the ground surface, wall and tree points among the extracted points are removed through the filtering process. The filtered points are then fitted into the flat plane using ODR(Orthogonal Distance Regression). If the fitting error is within the predefined threshold, the surface is classified as a flat roof. Otherwise, the surface is fitted and classified into a gable or arch roof through RMSE analysis. Based on the roof types identified in automated fashion, the 3D building reconstruction is performed. Experimental results showed that the proposed method classified successfully three different types of roof and that the fusion of LiDAR data and digital map could be a feasible method of modelling 3D building reconstruction.

  • PDF

DEVELOPING THE REFORESTRATION SIMULATION SYSTEM USING 3D GIS

  • Jo Myung-Hee;Jo Yun-Won
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.721-724
    • /
    • 2005
  • In this study the spatial distribution characters of forest in forest damaged area were first considered by analyzing spatial data and monitoring forest landscape. Then suitable tree species on each site were selected through the weighted score analysis of GIS analysis methods. Finally, the best forest stand arrangement method could be presented on the 3D based simulation system for the advanced reforestation technology in Korea. For this purpose, the virtual reforestation system was implemented by using the concept of virtual GIS and CBD (Component Based Development) method. By use of this system the change offorest landscape of burnt forest area some years after reforestation practice could be detected and monitored by applying the site index and 3D modeling method.

  • PDF