• Title/Summary/Keyword: 의사결정나무 분석

Search Result 405, Processing Time 0.029 seconds

Interesting Node Finding Criteria for Regression Trees (회귀의사결정나무에서의 관심노드 찾는 분류 기준법)

  • 이영섭
    • The Korean Journal of Applied Statistics
    • /
    • v.16 no.1
    • /
    • pp.45-53
    • /
    • 2003
  • One of decision tree method is regression trees which are used to predict a continuous response. The general splitting criteria in tree growing are based on a compromise in the impurity between the left and the right child node. By picking or the more interesting subsets and ignoring the other, the proposed new splitting criteria in this paper do not split based on a compromise of child nodes anymore. The tree structure by the new criteria might be unbalanced but plausible. It can find a interesting subset as early as possible and express it by a simple clause. As a result, it is very interpretable by sacrificing a little bit of accuracy.

Eojeol Syntactic Tag Prediction of Korean Text using Entropy Guided CRF (엔트로피 지도 CRF를 이용한 한국어 어절 구문태그 예측)

  • Oh, Jin-Young;Cha, Jeong-Won
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.5
    • /
    • pp.395-399
    • /
    • 2009
  • In this work, we describe the syntactic tag prediction system for Korean using the decision tree and CRFs. Generally they select features by their intuition. It depends on their prior knowledge. In this works, we combine features systematically using the decision tree. We also analyze errors and optimize features for the best performance. From the result of experiments, we can see that the proposed method is effective for the syntactic tag estimation and will be helpful for the syntactic analysis.

A study on removal of unnecessary input variables using multiple external association rule (다중외적연관성규칙을 이용한 불필요한 입력변수 제거에 관한 연구)

  • Cho, Kwang-Hyun;Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.5
    • /
    • pp.877-884
    • /
    • 2011
  • The decision tree is a representative algorithm of data mining and used in many domains such as retail target marketing, fraud detection, data reduction, variable screening, category merging, etc. This method is most useful in classification problems, and to make predictions for a target group after dividing it into several small groups. When we create a model of decision tree with a large number of input variables, we suffer difficulties in exploration and analysis of the model because of complex trees. And we can often find some association exist between input variables by external variables despite of no intrinsic association. In this paper, we study on the removal method of unnecessary input variables using multiple external association rules. And then we apply the removal method to actual data for its efficiencies.

Prediction Model of Construction Safety Accidents using Decision Tree Technique (의사결정나무기법을 이용한 건설재해 사전 예측모델 개발)

  • Cho, Yerim;Kim, Yeon-Choel;Shin, Yoonseok
    • Journal of the Korea Institute of Building Construction
    • /
    • v.17 no.3
    • /
    • pp.295-303
    • /
    • 2017
  • Over the past 7 years, the number of victims of construction disasters has been gradually increasing. Compared with projects in other industries, construction projects are highly exposed to safety risks. For this reason, the research methods of predicting and managing the risk of construction disasters are urgently needed that can be applied to a construction site. This study aims to propose a prediction model for a construction disaster using the decision tree technique. The developed the model is reviewed the applicability by evaluating its accuracy based on disaster data. The top three of the prediction values obtained from the proposed model were enumerated, and then the cumulative accuracy were also calculated. The prediction accuracy was 40 percent for the first value, but the cumulative accuracy was 80 percent. Thus, as more disaster data was accumulated, the cumulative accuracy appeared to be higher. If utilized in construction sites, the model proposed in this study would contribute to a reduction in the rate of construction disasters.

An Analysis of the Characteristics of Companies introducing Smart Factory System Using Data Mining Technique (데이터 마이닝 기법을 활용한 스마트팩토리 도입 기업의 특성 분석)

  • Oh, Jeong-yoon;Choi, Sang-hyun
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.5
    • /
    • pp.179-189
    • /
    • 2018
  • Currently, research on smart factories is steadily being carried out in terms of implementation strategies and considerations in construction. Various studies have not been conducted on companies that introduced smart factories. This study conducted a questionnaire survey for SMEs applying the basic stage of smart factory. And the cluster analysis was conducted to examine the characteristics of the company. In addition, we conducted Decision Tree and Naive Bay to examine how the characteristics of a company are derived and compare the results. As a result of the cluster analysis, it was confirmed that the group was divided into the high satisfaction group and the low satisfaction group. The decision tree and the Naive Bay analysis showed that the higher satisfaction group has high productivity.

Determinants of Satisfaction, Revisit Intention, and Recommendation Intention Using Decision Tree Analysis - Foreign Tourists Visiting Korea during the COVID-19 Pandemic - (의사결정나무분석을 활용한 방문 만족도, 재방문 의사, 타인 권유 의사 결정요인 분석 - 코로나19 상황에서의 한국 방문 외래관광객을 대상으로 -)

  • Won-Sik Kim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.3
    • /
    • pp.129-136
    • /
    • 2023
  • The study aims to examine the determinants that affect satisfaction, revisit intention, and recommendation intention with foreign tourists who visited Korea despite the threat of COVID-19. This study employs the survey data collected by the Korea Tourism Organization from 8,135 foreign tourists who visited Korea in 2020. As the survey data contains a mixture of continuous and categorical variables, decision tree analysis can ensure analytical validity for the research. According to the analytical results, the determinants affecting satisfaction are the purpose of the visit and acceptance of self-quarantine during their stay. The factors influencing revisit intention are the purpose of the visit, frequency of the visit, and acceptance of self-quarantine during their stay. The determinants affecting recommendation intention are the purpose of the visit, length of stay, and gender. Based on the results of this analysis, this study not only explains the relationship between these determinants and tourism satisfaction, revisit intention, and recommendation intention, but also suggests implications for revitalizing tourism activities.

Data Mining-Based Performance Prediction Technology of Geothermal Heat Pump System (지열 히트펌프 시스템의 데이터 마이닝 기반 성능 예측 기술)

  • Hwang, Min Hye;Park, Myung Kyu;Jun, In Ki;Sohn, Byonghu
    • Transactions of the KSME C: Technology and Education
    • /
    • v.4 no.1
    • /
    • pp.27-34
    • /
    • 2016
  • This preliminary study investigated data mining-based methods to assess and predict the performance of geothermal heat pump(GHP) system. Data mining is a key process of the knowledge discovery in database (KDD), which includes five steps: 1) Selection; 2) Pre-processing; 3) Transformation; 4) Analysis(data mining); and 5) Interpretation/Evaluation. We used two analysis models, categorical and numerical decision tree models to ascertain the patterns of performance(COP) and electrical consumption of the GHP system. Prior to applying the decision tree models, we statistically analyzed measurement database to determine the effect of sampling intervals on the system performance. Analysis results showed that 10-min sampling data for the performance analysis had highest accuracy of 97.7% over the actual dataset of the GHP system.

Design Analysis of Current Density in Lithium Secondary Battery Using Data Mining Techniques (데이터 마이닝을 이용한 리튬 이차전지의 전류밀도 영향인자 분석)

  • Jeong, Dong Ho;Lee, Jongsoo;Choi, Ha-Young
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.38 no.6
    • /
    • pp.677-682
    • /
    • 2014
  • In the present study, a decision tree and artificial neural network were used to determine critical design parameters for lithium ion batteries and compare their performances. First, a design method that used a decision tree-artificial neural network model was used to determine the major design factors among early pole plate design factors that showed nonlinearity. Then, the artificial neural network was used to implement a weighted value analysis of the importance of the design factors and their effect on the current density. The second method involved the use of an artificial neural network model to construct artificial networks without separate determinations of the major early design factors to analyze the connections and weighted values related to the current density.

A Case Study on segmentation of Department Store using Decision Tree Analysis (의사결정나무 기법을 활용한 백화점의 고객세분화 사례연구)

  • Chae, Kyung-Hee;Kim, Sang-Cheol
    • Journal of Distribution Science
    • /
    • v.8 no.1
    • /
    • pp.13-19
    • /
    • 2010
  • Segmentation, targeting, and positioning are marketing tools used by a company to gain competitive advantage in the market. For an accurate segmentation, various statistics models or datamining techniques are used. Especially, datamining techniques are introduced in the beginning of the 1980s and solved several marketing problems effectively. In this paper, we research about datamining technique for segmentation and analyze customer's transaction data of Department Store using Decision Tree Analysis, one of the dataming technique. After that, we discuss effects and advantages of segmentation using Decision Tree.

  • PDF

Decision Tree Algorithm with Improved Entropy Using an Expert Opinion (전문가 의견을 반영하는 향상된 의사결정나무의 엔트로피 기법)

  • Bak, Sun-Bin;Kim, Dong-Moon;Yoon, Tae-Bok;Lee, Jee-Hyong
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.11a
    • /
    • pp.239-242
    • /
    • 2007
  • 최근 데이터의 양이 많아지고 다양해짐에 따라서 데이터를 활용하기 위한 데이터 마이닝에 관한 관심이 중대되고 있다. 데이터 분석을 위한 수집 데이터에는 수집 과정에서 분석가가 원치 않은 데이터 잡음이 발생하는 경우가 있고 그 데이터가 다른 데이터들과 같은 가중치로 데이터 마이닝에 반영되는 경우 예상과 다른 결과를 얻을 수 있다. 따라서 데이터 분석 시 데이터와 전문가 의견이 고려된 데이터 엔트로피(Entropy)를 사용하여 잡음 데이터를 다를 필요가 있다. 본 논문에서는 전문가의견을 이용한 전문가 의견 목록을 만들고 이를 데이터와 비교하여 유사한 정도에 따라 각 데이터에 가중치를 부여한다. 그리고 이 데이터를 활용한 의사결정나무(Decision Tree)를 사용하여 기존 데이터를 이용한 의사결정나무 보다 데이터 잡음의 영향을 줄이는 방법을 제안한다. 제안한 방법은 학습자의 학습 활동에서 수집된 학습 행위 데이터를 사용하여 실험하였다.

  • PDF