• Title/Summary/Keyword: 의사결정나무 분석

Search Result 405, Processing Time 0.028 seconds

연관분석을 이용한 데이터마이닝 기법에 관한 사례연구

  • Ryu, Gwi-Yeol;Mun, Yeong-Su;Choi, Seung-Du
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2006.04a
    • /
    • pp.109-120
    • /
    • 2006
  • Huge information has been made due to the current computing environment and could not be acceptable. People want the information which they can understand and accept easily. They may want not only simple information but also knowledge. That is why data mining becomes a center of information. We use RFM analysis in order to create customer score. Customers are classified into five groups(most oxcellenrexcellenycommoflowerilowest) for a various marketing activities. We can found the significant patterns in each group, and classify customers from loyal customers to leaving customers in the near future by the indirect data mining(e.g. association analysis) and the direct data mining(e.g. decision tree, logistic regression analysis, etc.), which are named in this study. Our research focuses on the advanced models by applying the association rules in data mining. Our results indicate that the indirect data mining and the direct data mining seem to have same outputs, but the former shows more clear pattern then the latter one.

  • PDF

Inflow and outflow analysis of double majors using social network analysis (사회 연결망 분석을 이용한 복수전공 유입 및 유출 분석)

  • Cho, Jang-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.693-701
    • /
    • 2012
  • Recently, the number of students who get double majors has tended to increase in many universities. As results, many problems occur because immoderate inflow of double-major students is concentrated in a specific popular department. In this paper, we study the characteristic of inflow and outflow of double majors using social network analysis and decision tree analysis. According to the results, SAT score affected the inflow of double majors the most. Additionally, department category, course evaluation score, employment rate also affected the inflow of double majors in the order named. On the other hand, department category affected the outflow of double majors the most. Additionally, SAT score, employment rate, course evaluation score also affected the outflow of double majors in the order named.

A Study on Walking Analysis and Disease Prediction with Decision Tree (의사결정나무를 통한 걸음걸이 분석 및 질병 예측에 관한 연구)

  • Kim, Young-Jae;Yoo, Kwan-Hee;Nasridinov, Aziz
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.04a
    • /
    • pp.822-825
    • /
    • 2017
  • 본 연구는 키넥트를 통해 사람의 걸음걸이를 측정하고 의사결정트리(Decision Tree)를 통해 분석함으로써 현재의 걸음걸이를 통해 측정자의 허리 또는 무릎에서 발생할 가능성이 높은 문제 또는 질병들을 예측하고 해당결과를 측정자에게 알린다. 본 연구를 진행하며 첫 번째 단계에서는 관련 논문이나 병원 자료 결과들을 통해 판별할 속성들을 정하였다. 두 번째 단계에서는 키넥트를 통해 측정한 실제 데이터를 적용하기에 앞서 첫 번째 단계에서 정한 속성들이 측정자의 문제 또는 질병들을 판단해내는 연관 정도가 높은지 테스트 데이터를 이용하였고 의사결정나무를 통해 분석하였다. 그 결과 7개의 속성 중 6개로 약 85.7%정도의 연관이 있었다. 마지막 세 번째 단계에서는 판별식을 세우고 실제 데이터들을 쌓아나가며 69명의 측정한 데이터를 분석한 결과 6개의 속성 중 5개의 속성이 허리와 연관정도가 높았고 이는 두 번째 단계에서 나왔던 결과인 약85.7%에 가까운 약83%의 결과가 도출되었다. 이를 기반으로 시스템을 개발해 나가며 판별 정확도를 향상시키기 위해 계속 측정해 데이터를 쌓아가고 관련된 식들의 문제점을 보완하며 또한 어떤 환경에서 키넥트의 측정값의 정확도가 올라가는지 연구할 예정이다.

Predictors of Protective Factors for Depression in Adolescent using Decision Making Tree Analysis (의사결정나무분석을 이용한 청소년 우울의 보호요인 예측모형)

  • Kim, Bo-Young
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.5
    • /
    • pp.375-385
    • /
    • 2015
  • The study is to develop specific strategies to prevent adolescents' depression, early detection and intervention services. This study was a descriptive research study with the purpose of predictors of protective factors for depression in adolescent using decision making tree analysis. The subjects for the study were 485 student in G city. This study collected data between September 23, 2013 and September 26, 2013 and analyzed them with frequency analysis, percentage, the mean and standard deviation, ${\chi}^2$-test, t-test, and a decision-making tree by using SPSS 20.0 program. From the data analysis, the predictive model for protective factors related to depression in adolescent with 4 pathways, 12 nodes. The common predicting variables of depression in adolescent were characteristics, family cohesion, parent adolescent communication, peer communication. The specialty of training data and test data was 76.0% and 65.4%. The sensitivity of training data was 78.2% and 63.7%. As for the classification accuracy, training data and test data explained 70.1% and 69.7%. Parent adolescent communication and peer communication to decrease depression of Korean middle and high school students are necessary. This study should contribute as baseline data for intervention strategies and planning ability of depression prevention in adolescents.

Changes in Export Gateways and the Spatial Patterns of Korean Agri-Food Exports: A Classification and Regression Tree Analysis Approach (수출 관문의 변화와 한국 농식품 수출의 공간적 패턴 분석: 의사결정나무 분석의 적용)

  • Hyun, Kisoon
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.21 no.2
    • /
    • pp.90-106
    • /
    • 2018
  • This study suggests a gateway strategy for transporting agri-food exports to expand exports after examining the patterns of Korean agri-food exports by commodities and the role of export gateways. Korean agri-food exports have increased, but processed food exports have increased significantly compared to fresh agricultural products during the last 17 years. More importantly, Busan port is the main agri-food export hub in Korea. Under these circumstances, this paper examines the determinants of processed cereal-based food (HS 19) exports through Busan port using classification and regression tree (CART) analysis. As a result, the main factors that help to predict the real value of Korean exports are the GDP of the export destination countries, their distances from Korea and their GNI per capita. The destinations of Korean agri-food exports are finally classified into eight groups, which reveals the characteristics of clusters and provides useful insights for the strategies to expand agri-food exports.

Mother's Perceived Infant Smartphone Over-immersion Prediction Model: Data Mining Decision Tree Analysis (어머니가 지각한 유아의 스마트폰 과의존 예측모형 탐색: 데이터마이닝 의사결정나무 분석 활용)

  • Jung, Ji-Sook;Oh, Jung-A
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.5
    • /
    • pp.97-105
    • /
    • 2020
  • This study was to identify the main predictors of smartphone overdependece of infants perceived by mothers and to provide basic data useful for education and practice. For this, data-mining decision tree analysis was performed using SPSS program, and the fianl 410 researches were used for analysis. The results. In the case of the whole infants, the most important predictor of smartphone dependence was father's leisure activity parenting participation. For boys, their father's leisure activity was the most dependent on their smartphone dependence. However, even if father's participation in leisure activities was high, smartphone overdependence increased again when mother's overprotection and permissive attitude were high. Finally, For girls, the most influential variable on smartphone dependence was warmth and encouragement among mothers' parenting attitudes.

Estimating the determinants of victory and defeat through analyzing records of Korean pro-basketball (한국남자프로농구 경기기록 분석을 통한 승패결정요인 추정: 2010-2011시즌, 2011-2012시즌 정규리그 기록 적용)

  • Kim, Sae-Hyung;Lee, Jun-Woo;Lee, Mi-Sook
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.5
    • /
    • pp.993-1003
    • /
    • 2012
  • The purpose of this study was to estimate the determinants of victory and defeat through analyzing records of Korean men pro-basketball. Statistical models of victory and defeat were established by collecting present basketball records (2010-2011, 2011-2012 season). Korea Basketball League (KBL) informs records of every pro-basketball game data. The six offence variables (2P%, 3P%, FT%, OR, AS, TO), and the four defense variables (DR, ST, GD, BS) were used in this study. PASW program was used for logistic regression and Answer Tree program was used for the decision tree. All significance levels were set at .05. Major results were as follows. In the logistic regression, 2P%, 3P%, and TO were three offense variables significantly affecting victory and defeat, and DR, ST, and BS were three significant defense variables. Offensive variables 2P%, 3P%, TO, and AS are used in constructing the decision tree. The highest percentage of victory was 80.85% when 2P% was in 51%-58%, 3P% was more than 31 percent, and TO was less than 11 times. In the decision tree of the defence variables, the highest percentage of victory was 94.12% when DR was more than 24, ST was more than six, and BS was more than two times.

Improving the Performance of Supervised Learning Models using Error Pattern Modeling (오차패턴 모델링을 이용한 지도학습 모형에서의 성능 향상)

  • Heo, Jun;Kim, Jong-U
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2005.05a
    • /
    • pp.280-286
    • /
    • 2005
  • 본 논문은 이분형 목적변수를 가지는 데이터에서, 의사결정나무나 신경망과 같은 지도 학습(Supervised Learning)의 훈련을 통한 각종 예측 및 분류 정확도를 향상시키기 위해서 오차 패턴을 이용한 새로운 Hybrid 데이터 마이닝 기법을 제안한다. 오차 패턴을 이용한 Hybrid 기법이란 데이터 마이닝의 서로 다른 기법을 각 데이터에 적용한 다음 기법간의 불일치되는 부분만을 다시 패턴화 하여, 이를 최종 모형에 적용하여, 기존에 1개의 방법만을 사용하였을 경우보다, 더욱 좋은 정확도를 가질 수 있도록 하는 방법이다. 본 기법의 검증을 위하여, 10개의 실제 검증용 자료를 사용하였으며, 분석 결과 신경망과 의사결정나무 분석과 같은 기존의 방법보다 전체적으로 예측력이 향상됨을 보였다.

  • PDF

An Application of Data-Mining Tool in Fraud Pension Payment Prediction (데이터마이닝을 이용한 국민연금 부정수급 예측모형 개발 - 손해배상금 불성실 신고를 대상으로 -)

  • Cha, Kyung-Yup
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.1
    • /
    • pp.1-8
    • /
    • 2010
  • This study tested the applicability of a Data mining tool in the analysis of massive National Pension data for the purpose of developing fraud pension payment prediction model. This study is identified significant variables for fraud pension payment through the statistical analysis process and developed prediction models using data mining methodology.

데이터 마이닝에서 배깅과 부스팅 알고리즘 비교 분석

  • Lee, Yeong-Seop;O, Hyeon-Jeong
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2003.05a
    • /
    • pp.97-102
    • /
    • 2003
  • 데이터 마이닝의 여러 기법중 모형의 변동성을 줄이고 정확도가 높은 분류자를 형성하기 위하여 다양한 앙상블 기법이 연구되고 있다. 그 중에서 배깅과 부스팅 방법이 가장 널리 알려져 있다. 여러 가지 데이터에 이 두 방법을 적용하여 오분류율을 구하여 비교한 후 각 데이터 특성을 입력변수로 하고 배깅과 부스팅 중 더 낮은 오분류율을 갖는 알고리즘을 목표변수로 하여 의사결정나무를 형성하였다. 이를 통해서 배깅과 부스팅 알고리즘이 어떠한 데이터 특성의 패턴이 존재하는지 분석한 결과 부스팅 알고리즘은 관측치, 입력변수, 목표변수 수가 큰 것이 적합하고 반면에 배깅 알고리즘은 관측치, 입력변수, 목표변수 수의크기가 작은 것이 적합함을 알 수 있었다.

  • PDF