• Title/Summary/Keyword: Tree mining

Search Result 566, Processing Time 0.024 seconds

Protein Disorder/Order Region Classification Using EPs-TFP Mining Method (EPs-TFP 마이닝 기법을 이용한 단백질 Disorder/Order 지역 분류)

  • Lee, Heon Gyu;Shin, Yong Ho
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.17 no.6
    • /
    • pp.59-72
    • /
    • 2012
  • Since a protein displays its specific functions when disorder region of protein sequence transits to order region with provoking a biological reaction, the separation of disorder region and order region from the sequence data is urgently necessary for predicting three dimensional structure and characteristics of the protein. To classify the disorder and order region efficiently, this paper proposes a classification/prediction method using sequence data while acquiring a non-biased result on a specific characteristics of protein and improving the classification speed. The emerging patterns based EPs-TFP methods utilizes only the essential emerging pattern in which the redundant emerging patterns are removed. This classification method finds the sequence patterns of disorder region, such sequence patterns are frequently shown in disorder region but relatively not frequently in the order region. We expand P-tree and T-tree conceptualized TFP method into a classification/prediction method in order to improve the performance of the proposed algorithm. We used Disprot 4.9 and CASP 7 data to evaluate EPs-TFP technique, the results of order/disorder classification show sensitivity 73.6, specificity 69.51 and accuracy 74.2.

Comparative analysis of Lecture Evaluation using Decision Tree: Ways to Improve University Classes after COVID-19

  • Bok-Ju Jung;Sang-Chul Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.4
    • /
    • pp.197-208
    • /
    • 2023
  • In this study, we attempted to examine the changing ways of thinking about lecture evaluation before and after COVID-19. To this end, decision tree analysis(Decision Tree) was used among data mining techniques based on lecture evaluation data for liberal arts and major classes conducted before and after COVID-19 for A university. According to the results of the study, liberal arts changed from 'method' to 'content', and 'knowledge improvement' was an important factor both before and after majors. In particular, 'Assignment' was found to be an important factor after the COVID-19 in common in the evaluation of lectures in the liberal arts department, which means that in the future, professors will be provided with appropriate teaching methods during class, interaction with students, and feedback on assignments or test results, indicates the need for competence. Based on the results of this study, a plan to improve communication with students and activation of blended learning was suggested.

Enhancing Workers' Job Tenure Using Directions Derived from Data Mining Techniques (데이터 마이닝 기법을 활용한 근로자의 고용유지 강화 방안 개발)

  • An, Minuk;Kim, Taeun;Yoo, Donghee
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.5
    • /
    • pp.265-279
    • /
    • 2018
  • This study conducted an experiment using data mining techniques to develop prediction models of worker job turnover. The experiment used data from the '2015 Graduate Occupational Mobility Survey' by the Korea Employment Information Service. We developed the prediction models using a decision tree, Bayes net, and artificial neural network. We found that the decision tree-based prediction model reported the best accuracy. We also found that the six influential factors affecting employees' turnover intention are type of working time, job status, full-time or not full-time, regular working hours per week, regular working days per week, and personal development opportunities. From the decision tree-based prediction model, we derived 12 rules of employee turnover for all job types. Using the derived rules, we proposed helpful directions for enhancing workers' job tenure. In addition, we analyzed the influential factors affecting employees' job turnover intention according to four job types and derived rules for each: office (ten rules), culture and art (nine rules), construction (four rules), and information technology (six rules). Using the derived rules, we proposed customized directions for improving the job tenure for each group.

A Spatial Entropy based Decision Tree Method Considering Distribution of Spatial Data (공간 데이터의 분포를 고려한 공간 엔트로피 기반의 의사결정 트리 기법)

  • Jang, Youn-Kyung;You, Byeong-Seob;Lee, Dong-Wook;Cho, Sook-Kyung;Bae, Hae-Young
    • The KIPS Transactions:PartB
    • /
    • v.13B no.7 s.110
    • /
    • pp.643-652
    • /
    • 2006
  • Decision trees are mainly used for the classification and prediction in data mining. The distribution of spatial data and relationships with their neighborhoods are very important when conducting classification for spatial data mining in the real world. Spatial decision trees in previous works have been designed for reflecting spatial data characteristic by rating Euclidean distance. But it only explains the distance of objects in spatial dimension so that it is hard to represent the distribution of spatial data and their relationships. This paper proposes a decision tree based on spatial entropy that represents the distribution of spatial data with the dispersion and dissimilarity. The dispersion presents the distribution of spatial objects within the belonged class. And dissimilarity indicates the distribution and its relationship with other classes. The rate of dispersion by dissimilarity presents that how related spatial distribution and classified data with non-spatial attributes we. Our experiment evaluates accuracy and building time of a decision tree as compared to previous methods. We achieve an improvement in performance by about 18%, 11%, respectively.

Development of a convergence inpatient medical service patient experience management model using data mining (데이터마이닝을 이용한 융복합 입원 의료서비스 환자경험 관리모형 개발)

  • Yoo, Jin-Yeong
    • Journal of Digital Convergence
    • /
    • v.18 no.6
    • /
    • pp.401-409
    • /
    • 2020
  • The purpose of this study is to develop a convergence inpatient medical service patient experience management model(IMSPEMM) that can help in the management strategy of a medical institution to create a patient-centered medical culture. Using the original data from the 2018 Medical Service Experience Survey, 593 people with medical services inpatient(MSI) over the age of 15 were analyzed. By using the decision tree model, we developed a prediction model for overall satisfaction(OS) with the inpatient medical service experience(IMSE) and the intention to recommend patient experience(RI), and were classified into 4 and 7 types. The accuracy of the model was 68.9% and 78.3%. The OS level of IMSE was the nurse area and the hospital room noise management area, and the RI decision factor was the nurse area. It is significant that the IMSPEMM for MSI was presented and confirmed that the nurse area and the noise management area of the hospital room are important factors for the inpatient experience. It is considered that further research is needed to generalize the IMSPEMM.

A Study on Predictors of Academic Achievement in College Students : Focused on J University (대학생의 학업성취도 예측요인 연구 : J 대학을 중심으로)

  • Son, Yo-Han;Kim, In-Gyu
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.1
    • /
    • pp.519-529
    • /
    • 2020
  • The purpose of this study is to establish a model for predicting academic achievement of college students and to reveal the interrelationship and relative influence of each factor. For this, we surveyed the personal factors and learning strategy factors of 1,310 learners at J University, and analyzed the discriminant factors and patterns of the predictors of academic achievement through the decision tree analysis, a data mining method, and examined the relative effects of each factor. Binary logistic regression analysis was performed for viewing. As a result, the most important factor for predicting academic achievement was efficacy, and other factors such as motivation, time management, and depression were predictive of academic achievement. The patterns of factors predicting academic achievement were found to be high in efficacy and time management, and high in motivation for learning even if the efficacy was moderate. Low efficacy and learning motivation, and high depression have been shown to decrease academic achievement. Based on these results, the study suggested the efficacy and motivation to improve academic achievement of college students, strengthening time management education, and managing negative emotions.

Development of the Fraud Detection Model for Injury in National Health Insurance using Data Mining -Focusing on Injury Claims of Self-employed Insured of National Health Insurance (데이터마이닝을 이용한 건강보험 상해요인 조사 대상 선정 모형 개발 -건강보험 지역가입자 상해상병 진료건을 중심으로-)

  • Park, Il-Su;Park, So-Jeong;Han, Jun-Tae;Kang, Sung-Hong
    • Journal of Digital Convergence
    • /
    • v.11 no.10
    • /
    • pp.593-608
    • /
    • 2013
  • According to increasing number of injury claims, the challenge is reducing investigation of cases of injuries by selecting them more delicately, while also increasing the redemption rates and the amount of restitution. In this regards, we developed the fraud detection model for injury claims of self-employed insured by using decision tree after collecting medical claim data from 2006 to 2011 of the National Health Insurance in Korea. As a result of this model, subject types were classified into 18 types. If applying these types to the actual survey compared with if not applying, the redumption collecting rate will be increasing by 12.8%. Also, the effectiveness of this model will be maximize when the number of claims handlers considering their survey volume and management plans are examined thoroughly.

Predicting Factors on the Increase in Computer Entertainment Behavior with Data Mining (데이터마이닝을 이용한 컴퓨터 오락추구 행동 상승의 예측요인)

  • Lee, Hyejoo;Jung, Euihyun
    • The Journal of Korean Association of Computer Education
    • /
    • v.20 no.2
    • /
    • pp.47-55
    • /
    • 2017
  • The purpose of this study is to investigate the predicting factors on the increase in computer entertainment behavior with the sample from KYPS data. The results of the Decision Tree model revealed that: (1) Neighbor supervision, self-belief, parent attachment, life satisfaction, and peer attachment were significant for the increase in computer entertainment behavior. (2) Neighbor supervision, class participation and leisure satisfaction were significant for male students' increase in computer entertainment behavior. (3) Optimistic disposition, teacher attachment, and peer attachment were significant for female students' increase in computer entertainment behavior. These results suggest that meaningful factors and their divers interactions should be considered in methods and programs for regulating and preventing the increase in computer entertainment behavior.

An Efficient Method for Mining Frequent Patterns based on Weighted Support over Data Streams (데이터 스트림에서 가중치 지지도 기반 빈발 패턴 추출 방법)

  • Kim, Young-Hee;Kim, Won-Young;Kim, Ung-Mo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.8
    • /
    • pp.1998-2004
    • /
    • 2009
  • Recently, due to technical developments of various storage devices and networks, the amount of data increases rapidly. The large volume of data streams poses unique space and time constraints on the data mining process. The continuous characteristic of streaming data necessitates the use of algorithms that require only one scan over the stream for knowledge discovery. Most of the researches based on the support are concerned with the frequent itemsets, but ignore the infrequent itemsets even if it is crucial. In this paper, we propose an efficient method WSFI-Mine(Weighted Support Frequent Itemsets Mine) to mine all frequent itemsets by one scan from the data stream. This method can discover the closed frequent itemsets using DCT(Data Stream Closed Pattern Tree). We compare the performance of our algorithm with DSM-FI and THUI-Mine, under different minimum supports. As results show that WSFI-Mine not only run significant faster, but also consume less memory.

A recommendation system for assisting devices in long-term care insurance (의사결정나무기법을 활용한 장기요양 복지용구 권고모형 개발)

  • Han, Eun-Jeong;Park, Sanghee;Lee, JungSuk;Kim, Dong-Geon
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.6
    • /
    • pp.693-706
    • /
    • 2018
  • It is very important to support the elderly with disability ageing in place. Assisting devices can help them to live independently in their community; however, they have to be used appropriately to meet care needs. This study develops an assisting device recommendation system for the beneficiaries of long-term care insurance that include algorithms to decide the most appropriate type of assisting device for beneficiaries. We used long-term care (LTC) insurance data for grade assessment including 8,084 beneficiaries from July 2015 to June 2016. In addition, we collected standard care plans for assisting devices, that power-assessors made, considering their performance and ability that could subsequently be matched with grade assessment data. We used a decision-tree model in data-mining to develop the model. Finally, we developed 15 algorithms for recommending assisting devices. The findings might be useful in evidence-based care planning for assisting devices and can contribute to enhancing independence and safety in LTC.