• Title/Summary/Keyword: 의사결정트리 알고리즘

Search Result 80, Processing Time 0.036 seconds

P2P Traffic Classification using Advanced Heuristic Rules and Analysis of Decision Tree Algorithms (개선된 휴리스틱 규칙 및 의사 결정 트리 분석을 이용한 P2P 트래픽 분류 기법)

  • Ye, Wujian;Cho, Kyungsan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.3
    • /
    • pp.45-54
    • /
    • 2014
  • In this paper, an improved two-step P2P traffic classification scheme is proposed to overcome the limitations of the existing methods. The first step is a signature-based classifier at the packet-level. The second step consists of pattern heuristic rules and a statistics-based classifier at the flow-level. With pattern heuristic rules, the accuracy can be improved and the amount of traffic to be classified by statistics-based classifier can be reduced. Based on the analysis of different decision tree algorithms, the statistics-based classifier is implemented with REPTree. In addition, the ensemble algorithm is used to improve the performance of statistics-based classifier Through the verification with the real datasets, it is shown that our hybrid scheme provides higher accuracy and lower overhead compared to other existing schemes.

A Spam Mail Classification Using Link Structure Analysis (링크구조분석을 이용한 스팸메일 분류)

  • Rhee, Shin-Young;Khil, A-Ra;Kim, Myung-Won
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.1
    • /
    • pp.30-39
    • /
    • 2007
  • The existing content-based spam mail filtering algorithms have difficulties in filtering spam mails when e-mails contain images but little text. In this thesis we propose an efficient spam mail classification algorithm that utilizes the link structure of e-mails. We compute the number of hyperlinks in an e-mail and the in-link frequencies of the web pages hyperlinked in the e-mail. Using these two features we classify spam mails and legitimate mails based on the decision tree trained for spam mail classification. We also suggest a hybrid system combining three different algorithms by majority voting: the link structure analysis algorithm, a modified link structure analysis algorithm, in which only the host part of the hyperlinked pages of an e-mail is used for link structure analysis, and the content-based method using SVM (support vector machines). The experimental results show that the link structure analysis algorithm slightly outperforms the existing content-based method with the accuracy of 94.8%. Moreover, the hybrid system achieves the accuracy of 97.0%, which is a significant performance improvement over the existing method.

A Study of Extraction of Variables Affecting the Adolescents' Computer Use Type with Decision Tree (의사결정트리 기반의 분석을 통한 청소년의 컴퓨터 사용 유형별 관련 변수 추출)

  • Lee, Hye-Joo;Jung, Eui-Hyun
    • The Journal of Korean Association of Computer Education
    • /
    • v.15 no.2
    • /
    • pp.9-18
    • /
    • 2012
  • This study investigated the extraction algorithm fitting for variables of adolescents' computer use type with the sample from KYPS data (3409 in the second grade of the junior high school; 1704 boys and 1705 girls). The results of the decision tree model revealed that : (1) Gender, computer use time, misdeed friends, parent supervision, other agreement of misdeed, parent study expectation, self-control, teacher attachment, and sibling relation were significant for entertainment type. (2) Gender, cyberclub, computer use time, self-belief, online misdeed were significant for relation type. (3) Study enthusiasm, personal study time, optimistic disposition, study and spare time, cyberclub, self-belief, and other people criticism were significant for information type. These results suggest that adolescents' diverse conditions should be considered for using computer more efficiently.

  • PDF

Adaptive Decision Tree Algorithm for Machine Diagnosis (기계 진단을 위한 적응형 의사결정 트리 알고리즘)

  • 백준걸;김강호;김창욱;김성식
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2000.04a
    • /
    • pp.235-238
    • /
    • 2000
  • This article presents an adaptive decision tree algorithm for dynamically reasoning machine failure cause out of real-time, large-scale machine status database. On the basis of experiment using semiconductor etching machine, it has been verified that our model outperforms previously proposed decision tree models.

  • PDF

Design of a Hopeful Career Forecasting Program for the Career Education (진로교육을 위한 희망진로 예측프로그램 설계)

  • Kim, Geun-Ho;Kim, Eui-Jeong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.8
    • /
    • pp.1055-1060
    • /
    • 2018
  • In the wake of the 4th Industrial Revolution, the problem of career education in schools has become a big issue. While various studies are being conducted on services or technologies to effectively handle artificial intelligence and big data, in the field of education, data on students is simply processed. Therefore, in this paper, we are going to design and present career prediction programs for students using artificial intelligence and big data. Using observational data from students at the institute, the decision tree is constructed with the C4.5 algorithm known to be most intelligent and effective in the decision tree and is used to predict students' path of hope. As a result, the coefficient of kappa exceeded 0.7 and showed a fairly low average error of 0.1 degrees. As shown in this study, a number of studies and data will be deployed to help guide students in their consultation and to provide them with classroom attitudes and directions.

Sequence Mining based Manufacturing Process using Decision Model in Cognitive Factory (스마트 공장에서 의사결정 모델을 이용한 순차 마이닝 기반 제조공정)

  • Kim, Joo-Chang;Jung, Hoill;Yoo, Hyun;Chung, Kyungyong
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.3
    • /
    • pp.53-59
    • /
    • 2018
  • In this paper, we propose a sequence mining based manufacturing process using a decision model in cognitive factory. The proposed model is a method to increase the production efficiency by applying the sequence mining decision model in a small scale production process. The data appearing in the production process is composed of the input variables. And the output variable is composed the production rate and the defect rate per hour. We use the GSP algorithm and the REPTree algorithm to generate rules and models using the variables with high significance level through t-test. As a result, the defect rate are improved by 0.38% and the average hourly production rate was increased by 1.89. This has a meaning results for improving the production efficiency through data mining analysis in the small scale production of the cognitive factory.

Pattern Analysis of Traffic Accident data and Prediction of Victim Injury Severity Using Hybrid Model (교통사고 데이터의 패턴 분석과 Hybrid Model을 이용한 피해자 상해 심각도 예측)

  • Ju, Yeong Ji;Hong, Taek Eun;Shin, Ju Hyun
    • Smart Media Journal
    • /
    • v.5 no.4
    • /
    • pp.75-82
    • /
    • 2016
  • Although Korea's economic and domestic automobile market through the change of road environment are growth, the traffic accident rate has also increased, and the casualties is at a serious level. For this reason, the government is establishing and promoting policies to open traffic accident data and solve problems. In this paper, describe the method of predicting traffic accidents by eliminating the class imbalance using the traffic accident data and constructing the Hybrid Model. Using the original traffic accident data and the sampled data as learning data which use FP-Growth algorithm it learn patterns associated with traffic accident injury severity. Accordingly, In this paper purpose a method for predicting the severity of a victim of a traffic accident by analyzing the association patterns of two learning data, we can extract the same related patterns, when a decision tree and multinomial logistic regression analysis are performed, a hybrid model is constructed by assigning weights to related attributes.

A preliminary study on the determination of drought stages at the local level (지역 단위 가뭄단계 판단규칙 개발에 관한 연구)

  • Lee, Jongso;Jeon, Daeun;Yoon, Hyeoncheol;Kam, Jonghun;Lee, Sangeun
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.12
    • /
    • pp.929-937
    • /
    • 2023
  • This study aims to develop rules for the Determination of Drought Stages at the Local Level based on the drought cases in Gwangju and Jeollanam-do in 2022-2023. Among the eight drought indicators provided, six indicators (Agricultural drought stage (for paddy), Residential & industrial drought stage, SPI-12, Relative agricultural water storage, Residential water consumption change (for domestic use), Residential water consumption change (for non-domestic use) were confirmed to have statistical correlations with the perceptions of local government officials and experts. Additionally, this drought indicator was applied to a decision tree algorithm to develop rules for determining the severity of drought. Although it presented results similar to those of the existing method presented in previous studies, it showed a significant comparative advantage in explaining the temporal and spatial patterns of drought in the Gwangju and Jeollanam-do.

Smart Farm Expert System for Paprika using Decision Tree Technique (의사결정트리 기법을 이용한 파프리카용 스마트팜 전문가 시스템)

  • Jeong, Hye-sun;Lee, In-yong;Lim, Joong-seon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.373-376
    • /
    • 2018
  • Traditional paprika smart farm systems are often harmful to paprika growth because they are set to follow the values of several sensors to the reference value, so the system is often unable to make optimal judgement. Using decision tree techniques, the expert system for the paprika smart farm is designed to create a control system with a decision-making structure similar to that of farmers using data generated by factors that depend on their surroundings. With the current smart farm control system, it is essential for farmers to intervene in the surrounding environment because it is designed to follow sensor values to the reference values set by the farmer. To solve this problem even slightly, it is going to obtain environmental data and design controllers that apply decision tree method. The expert system is established for complex control by selecting the most influential environmental factors before controlling the paprika smart farm equipment, including criteria for selecting decisions by farmers. The study predicts that each environmental element will be a standard when creating smart farms for professionals because of the interrelationships of data, and more surrounding environmental factors affecting growth.

  • PDF

Development of a model to analyze the relationship between smart pig-farm environmental data and daily weight increase based on decision tree (의사결정트리를 이용한 돈사 환경데이터와 일당증체 간의 연관성 분석 모델 개발)

  • Han, KangHwi;Lee, Woongsup;Sung, Kil-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.12
    • /
    • pp.2348-2354
    • /
    • 2016
  • In recent days, IoT (Internet of Things) technology has been widely used in the field of agriculture, which enables the collection of environmental data and biometric data into the database. The availability of big data on agriculture results in the increase of the machine learning based analysis. Through the analysis, it is possible to forecast agricultural production and the diseases of livestock, thus helping the efficient decision making in the management of smart farm. Herein, we use the environmental and biometric data of Smart Pig farm to derive the accurate relationship model between the environmental information and the daily weight increase of swine and verify the accuracy of the derived model. To this end, we applied the M5P tree algorithm of machine learning which reveals that the wind speed is the major factor which affects the daily weight increase of swine.