• Title/Summary/Keyword: 의사결정나무 분석

Search Result 407, Processing Time 0.029 seconds

A Study on the Analysis Effect Factors of Illegal Parking Using Data Mining Techniques (데이터마이닝 기법을 활용한 불법주차 영향요인 분석)

  • Lee, Chang-Hee;Kim, Myung-Soo;Seo, So-Min
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.4
    • /
    • pp.63-72
    • /
    • 2014
  • With the rapid development in the economy and other fields as well, the standard of living in South Korea has been improved, and consequently, the demand of automobiles has quickly increased. It leads to various traffic issues such as traffic congestion, traffic accident, and parking problem. In particular, this illegal parking caused by the increase in the number of automobiles has been considered one of the main reasons to bring about traffic congestion as intensifying any dispute between neighbors in relation to a parking space, which has been also coming to the fore as a social issue. Therefore, this study looked into Daejeon Metropolitan City, the city that is understood to have the highest automobile sharing rate in South Korea but with relatively few cases of illegal parking crackdowns. In order to investigate the theoretical problems of the illegal parking, this study conducted a decision-making tree model-based Exhaustive CHAID analysis to figure out not only what makes drivers park illegally when they try to park vehicles but also those factors that would tempt the drivers into the illegal parking. The study, then, comes up with solutions to the problem. According to the analysis, in terms of the influential factors that encourage the drivers to park at some illegal areas, it was learned that these factors, the distance, a driver's experience of getting caught, the occupation and the use time in order, have an effect on the drivers' deciding to park illegally. After working on the prediction model, four nodes were finally extracted. Given the analysis result, as a solution to the illegal parking, it is necessary to establish public parking lots additionally and first secure the parking space for the vehicles used for living and working, and to activate the campaign for enhancing illegal parking crackdown and encouraging civic consciousness.

A Study on the Turbidity Estimation Model Using Data Mining Techniques in the Water Supply System (데이터마이닝 기법을 이용한 상수도 시스템 내의 탁도 예측모형 개발에 관한 연구)

  • Park, No-Suk;Kim, Soonho;Lee, Young Joo;Yoon, Sukmin
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.38 no.2
    • /
    • pp.87-95
    • /
    • 2016
  • Turbidity is a key indicator to the user that the 'Discolored Water' phenomenon known to be caused by corrosion of the pipeline in the water supply system. 'Discolored Water' is defined as a state with a turbidity of the degree to which the user visually be able to recognize water. Therefore, this study used data mining techniques in order to estimate turbidity changes in water supply system. Decision tree analysis was applied in data mining techniques to develop estimation models for turbidity changes in the water supply system. The pH and residual chlorine dataset was used as variables of the turbidity estimation model. As a result, the case of applying both variables(pH and residual chlorine) were shown more reasonable estimation results than models only using each variable. However, the estimation model developed in this study were shown to have underestimated predictions for the peak observed values. To overcome this disadvantage, a high-pass filter method was introduced as a pretreatment of estimation model. Modified model using high-pass filter method showed more exactly predictions for the peak observed values as well as improved prediction performance than the conventional model.

Convergence Study in Development of Severity Adjustment Method for Death with Acute Myocardial Infarction Patients using Machine Learning (머신러닝을 이용한 급성심근경색증 환자의 퇴원 시 사망 중증도 보정 방법 개발에 대한 융복합 연구)

  • Baek, Seol-Kyung;Park, Hye-Jin;Kang, Sung-Hong;Choi, Joon-Young;Park, Jong-Ho
    • Journal of Digital Convergence
    • /
    • v.17 no.2
    • /
    • pp.217-230
    • /
    • 2019
  • This study was conducted to develop a customized severity-adjustment method and to evaluate their validity for acute myocardial infarction(AMI) patients to complement the limitations of the existing severity-adjustment method for comorbidities. For this purpose, the subjects of KCD-7 code I20.0 ~ I20.9, which is the main diagnosis of acute myocardial infarction were extracted using the Korean National Hospital Discharge In-depth Injury survey data from 2006 to 2015. Three tools were used for severity-adjustment method of comorbidities : CCI (charlson comorbidity index), ECI (Elixhauser comorbidity index) and the newly proposed CCS (Clinical Classification Software). The results showed that CCS was the best tool for the severity correction, and that support vector machine model was the most predictable. Therefore, we propose the use of the customized method of severity correction and machine learning techniques from this study for the future research on severity adjustment such as assessment of results of medical service.

Machine-learning Approaches with Multi-temporal Remotely Sensed Data for Estimation of Forest Biomass and Forest Reference Emission Levels (시계열 위성영상과 머신러닝 기법을 이용한 산림 바이오매스 및 배출기준선 추정)

  • Yong-Kyu, Lee;Jung-Soo, Lee
    • Journal of Korean Society of Forest Science
    • /
    • v.111 no.4
    • /
    • pp.603-612
    • /
    • 2022
  • The study aims were to evaluate a machine-learning, algorithm-based, forest biomass-estimation model to estimate subnational forest biomass and to comparatively analyze REDD+ forest reference emission levels. Time-series Landsat satellite imagery and ESA Biomass Climate Change Initiative information were used to build a machine-learning-based biomass estimation model. The k-nearest neighbors algorithm (kNN), which is a non-parametric learning model, and the tree-based random forest (RF) model were applied to the machine-learning algorithm, and the estimated biomasses were compared with the forest reference emission levels (FREL) data, which was provided by the Paraguayan government. The root mean square error (RMSE), which was the optimum parameter of the kNN model, was 35.9, and the RMSE of the RF model was lower at 34.41, showing that the RF model was superior. As a result of separately using the FREL, kNN, and RF methods to set the reference emission levels, the gradient was set to approximately -33,000 tons, -253,000 tons, and -92,000 tons, respectively. These results showed that the machine learning-based estimation model was more suitable than the existing methods for setting reference emission levels.

Factors Associated with Successful Aging of Korean Older People Living in a City (일 도시 노인의 성공적인 노화 관련 요인)

  • Shin, Younghee;Lee, Hyejung
    • 한국노년학
    • /
    • v.29 no.4
    • /
    • pp.1327-1340
    • /
    • 2009
  • The purposes of the study were (1) to identify the level of successful aging of older people living in a city, (2) to identify associated factors with successful aging, and (3) to identify a risk group for successful aging using classification and regression trees (CART) analysis. One hundred eighty seven older people (>65years) participated in the cross-sectional survey. Trained interviewers collected data with a structured questionnaire on demographic information, Korean geriatric depression score, activity of daily living(ADL), instrumental activity of daily living(IADL), and Young's successful aging instrument in subject's home. A CART analysis split subjects into ten homogeneous small groups based on five determinant factors. Older people who are male, with higher education, living with family, and not receiving Medicaid showed better scores in successful aging than their counter parts. Depression was a strong primary determinant for successful aging. A risk group for successful aging of older people was identified by depression and IADL. An intervention to prevent and manage depression and to improve physical function of older people can be developed to promote successful aging of older people. It is suggested to consider an assessment of depression to develop the policies for older people welfare.

Verification Test of High-Stability SMEs Using Technology Appraisal Items (기술력 평가항목을 이용한 고안정성 중소기업 판별력 검증)

  • Jun-won Lee
    • Information Systems Review
    • /
    • v.20 no.4
    • /
    • pp.79-96
    • /
    • 2018
  • This study started by focusing on the internalization of the technology appraisal model into the credit rating model to increase the discriminative power of the credit rating model not only for SMEs but also for all companies, reflecting the items related to the financial stability of the enterprises among the technology appraisal items. Therefore, it is aimed to verify whether the technology appraisal model can be applied to identify high-stability SMEs in advance. We classified companies into industries (manufacturing vs. non-manufacturing) and the age of company (initial vs. non-initial), and defined as a high-stability company that has achieved an average debt ratio less than 1/2 of the group for three years. The C5.0 was applied to verify the discriminant power of the model. As a result of the analysis, there is a difference in importance according to the type of industry and the age of company at the sub-item level, but in the mid-item level the R&D capability was a key variable for discriminating high-stability SMEs. In the early stage of establishment, the funding capacity (diversification of funding methods, capital structure and capital cost which taking into account profitability) is an important variable in financial stability. However, we concluded that technology development infrastructure, which enables continuous performance as the age of company increase, becomes an important variable affecting financial stability. The classification accuracy of the model according to the age of company and industry is 71~91%, and it is confirmed that it is possible to identify high-stability SMEs by using technology appraisal items.

Determination of Fire Severity and Deduction of Influence Factors Through Landsat-8 Satellite Image Analysis - A Case Study of Gangneung and Donghae Forest Fires - (Landsat-8 위성영상 분석을 통한 산불피해 심각도 판정 및 영향 인자 도출 - 강릉, 동해 산불을 사례로 -)

  • Soo-Dong Lee;Gyoung-Sik Park;Chung-Hyeon Oh;Bong-Gyo Cho;Byeong-Hyeok Yu
    • Korean Journal of Environment and Ecology
    • /
    • v.38 no.3
    • /
    • pp.277-292
    • /
    • 2024
  • In order to manage large-scale forest fires concentrated in Gangwon-do and Gyeongsangbuk-do with severe topographical heterogeneity, a decision-making process through efficient and rapid damage assessment using satellite images is essential. Accordingly, this study targets a large-scale forest fire that ignited in Gangneung and the Donghae, Gangwon-do on March 5, 2022, and was extinguished around 19:00 on March 8, to estimate the fire severity using dNBR and derive environmental factors that affect the grade. As environmental factors, we quantified the regular vegetation index representing vegetation or fuel type, the forest index that classifies tree species, the regular moisture index representing moisture content, and DEM in relation to topography, and then analyzed the correlation with the fire severity. In terms of fire severity, the widest range was 'Unbured' at 52.4%, followed by low severity at 42.9%, medium-low severity at 4.3%, and medium-high severity at 0.4%. Environmental factors showed a negative correlation with dNDVI and dNDWI, and a positive correlation with slope. Regarding vegetation, the differences between coniferous, broad-leaved, and other groups in dNDVI, dNIWI, and slope, which were analyzed to affect the fire severity, were analyzed to be significant with p-value < 2.2e-16. In particular, the difference between coniferous and broad-leaved forests was clear, and it was confirmed that coniferous forest suffered more damage than broad-leaved forest due to the higher fire severity in the Gangwon-do region, including Pinus densiflora, which are dominant species, as well as P. koraiensis, P. rigida and P. thunbergii.

A Study on Strategy for success of tourism e-marketplace (관광 e-마켓플레이스의 성공전략에 관한 연구)

  • Hong, Ji-Whan;Kim, Keun-Hyung
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2006.11a
    • /
    • pp.333-336
    • /
    • 2006
  • E-marketplace is a kind of B2B e-Business system that supports business transactions among companies. If e-marketplace is revitalized, we expect not only the development of related industry but also decrease of transaction cost among companies. It is necessary for the introduction and revitalization of e-marketplace in tourist industry from this point of view. Participants of tour e-marketplace are tour-related companies(travel agencies, lodging enterprises, shipping enterprises, etc.). Also tourists want to search a variety of tour products or contents. So tour e-marketplace has characteristics of B2C e-Business systems as well as B2B e-Business systems at once. The purpose of this study is to classify success factors that determine characteristics of tour e-marketplace through statistics survey from e-marketplace factors related tourism websites. First of all, we analyze success factors of B2B and B2C e-marketplace. Then we will set up influence factors of tour e-marketplace and conduct a survey about success factors of tour e-marketplace. Therefore, we could expect to find these good attributes in tour e-marketplace success through logistic regression and decision tree analysis from source data.

  • PDF

Development of Sentiment Analysis Model for the hot topic detection of online stock forums (온라인 주식 포럼의 핫토픽 탐지를 위한 감성분석 모형의 개발)

  • Hong, Taeho;Lee, Taewon;Li, Jingjing
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.187-204
    • /
    • 2016
  • Document classification based on emotional polarity has become a welcomed emerging task owing to the great explosion of data on the Web. In the big data age, there are too many information sources to refer to when making decisions. For example, when considering travel to a city, a person may search reviews from a search engine such as Google or social networking services (SNSs) such as blogs, Twitter, and Facebook. The emotional polarity of positive and negative reviews helps a user decide on whether or not to make a trip. Sentiment analysis of customer reviews has become an important research topic as datamining technology is widely accepted for text mining of the Web. Sentiment analysis has been used to classify documents through machine learning techniques, such as the decision tree, neural networks, and support vector machines (SVMs). is used to determine the attitude, position, and sensibility of people who write articles about various topics that are published on the Web. Regardless of the polarity of customer reviews, emotional reviews are very helpful materials for analyzing the opinions of customers through their reviews. Sentiment analysis helps with understanding what customers really want instantly through the help of automated text mining techniques. Sensitivity analysis utilizes text mining techniques on text on the Web to extract subjective information in the text for text analysis. Sensitivity analysis is utilized to determine the attitudes or positions of the person who wrote the article and presented their opinion about a particular topic. In this study, we developed a model that selects a hot topic from user posts at China's online stock forum by using the k-means algorithm and self-organizing map (SOM). In addition, we developed a detecting model to predict a hot topic by using machine learning techniques such as logit, the decision tree, and SVM. We employed sensitivity analysis to develop our model for the selection and detection of hot topics from China's online stock forum. The sensitivity analysis calculates a sentimental value from a document based on contrast and classification according to the polarity sentimental dictionary (positive or negative). The online stock forum was an attractive site because of its information about stock investment. Users post numerous texts about stock movement by analyzing the market according to government policy announcements, market reports, reports from research institutes on the economy, and even rumors. We divided the online forum's topics into 21 categories to utilize sentiment analysis. One hundred forty-four topics were selected among 21 categories at online forums about stock. The posts were crawled to build a positive and negative text database. We ultimately obtained 21,141 posts on 88 topics by preprocessing the text from March 2013 to February 2015. The interest index was defined to select the hot topics, and the k-means algorithm and SOM presented equivalent results with this data. We developed a decision tree model to detect hot topics with three algorithms: CHAID, CART, and C4.5. The results of CHAID were subpar compared to the others. We also employed SVM to detect the hot topics from negative data. The SVM models were trained with the radial basis function (RBF) kernel function by a grid search to detect the hot topics. The detection of hot topics by using sentiment analysis provides the latest trends and hot topics in the stock forum for investors so that they no longer need to search the vast amounts of information on the Web. Our proposed model is also helpful to rapidly determine customers' signals or attitudes towards government policy and firms' products and services.

Development of Needs Assessment tool and Extraction Algorithm Fitting for Individuals in Care Management for the disabled in Home (재가장애인 사례관리의 욕구사정 정확도 향상을 위한 사정도구 개발과 욕구추출 알고리즘 과정 연구 - 데이터 마이닝 분석기법을 활용하여 -)

  • Kim, Young-Sook;Jung, Kook-In
    • Korean Journal of Social Welfare
    • /
    • v.60 no.2
    • /
    • pp.155-173
    • /
    • 2008
  • The study aims to develop a assessment tool to provide the in-home disabled in a local community with appropriate services in consideration of physical, emotional, social and environmental circumstances. After collection of assesment data of 200 in-home disabled through use of the tool, a desire-extracting algorithm was developed to provide a service to real needs through the use of decision tree analysis on data mining. The study was conducted for Five months from June 2006 through October 2006, and it is divided into development of an assessment tool and extraction of real needs through the use of the tool. The basic framework of the development of the tool was established through the examination of related literature, the subjective satisfaction of the assessment tool and items were developed through the use of a focus group and experts, and verification was implemented through the use of statistics to confirm the validity of the tool. As a result of the verification, the tool secured following validity and credibility as seen in

    and
    . In addition, real needs-extraction algorithm was established through the use of the assessment tool, and the algorithm according each desire was suggested as seen in . The assessment tool and algorithm suggested as a result of the study can be used as data to conduct systematic management of examples through the confirmation of objective desire of in-home disabled.

  • PDF

  • (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.