• Title/Summary/Keyword: ROC곡선

Search Result 159, Processing Time 0.027 seconds

Prediction of Safety Grade of Bridges Using the Classification Models of Decision Tree and Random Forest (의사결정나무 및 랜덤포레스트 분류 모델을 이용한 교량 안전등급 예측)

  • Hong, Jisu;Jeon, Se-Jin
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.43 no.3
    • /
    • pp.397-411
    • /
    • 2023
  • The number of deteriorated bridges with a service period of more than 30 years has been rapidly increasing in Korea. Accordingly, the importance of advanced maintenance technologies through the predictions of age-induced deterioration degree, condition, and performance of bridges is more and more noticed. The prediction method of the safety grade of bridges was proposed in this study using the classification models of the Decision Tree and the Random Forest based on machine learning. As a result of analyzing these models for the 8,850 bridges located in national roads with various evaluation indexes such as confusion matrix, balanced accuracy, recall, ROC curve, and AUC, the Random Forest largely showed better predictive performance than that of the Decision Tree. In particular, random under-sampling in the Random Forest showed higher predictive performance than that of other sampling techniques for the C and D grade bridges, with the recall of 83.4%, which need more attention to maintenance because of the significant deterioration degree. The proposed model can be usefully applied to rapidly identify the safety grade and to establish an efficient and economical maintenance plan of bridges that have not recently been inspected.

The cutoff criterion and the accuracy of the polygraph test for crime investigation (범죄수사를 위한 거짓말탐지 검사(polygraph test)의 판정기준과 정확성)

  • Yu Hwa Han ;Kwangbai Park
    • Korean Journal of Culture and Social Issue
    • /
    • v.14 no.4
    • /
    • pp.103-117
    • /
    • 2008
  • The polygraph test administered by the Korean Prosecutors Office for crime investigations customarily uses the score of -12 as the cutoff point separating the subjects who lie from those who tell the truth. The criterion used by the KPO is different from the one (-13) suggested by Backster (1963) who invented the particular method for lie detection. Based on the signal detection theory applied to the real polygraph test data obtained from real crime suspects by the KPO, the present study identified the score of -8 as an optimal criterion resulting in the highest overall accuracy of the polygraph test. The classification of the subjects with the score of -8 as the criterion resulted in the highest accuracy (83.17%) compared with the accuracies of classifications with the Backster's criterion (76.24%) and the KPO's criterion (80.20%). However, the new criterion was also found to result in more false-positive cases. Based on the results from the present study, it was recommended to use the score of -8 as the criterion when the overall accuracy is important but the score of -12 or -13 when avoiding false-positive is more important than securing the overall accuracy.

  • PDF

Deep Learning in Thyroid Ultrasonography to Predict Tumor Recurrence in Thyroid Cancers (인공지능 딥러닝을 이용한 갑상선 초음파에서의 갑상선암의 재발 예측)

  • Jieun Kil;Kwang Gi Kim;Young Jae Kim;Hye Ryoung Koo;Jeong Seon Park
    • Journal of the Korean Society of Radiology
    • /
    • v.81 no.5
    • /
    • pp.1164-1174
    • /
    • 2020
  • Purpose To evaluate a deep learning model to predict recurrence of thyroid tumor using preoperative ultrasonography (US). Materials and Methods We included representative images from 229 US-based patients (male:female = 42:187; mean age, 49.6 years) who had been diagnosed with thyroid cancer on preoperative US and subsequently underwent thyroid surgery. After selecting each representative transverse or longitudinal US image, we created a data set from the resulting database of 898 images after augmentation. The Python 2.7.6 and Keras 2.1.5 framework for neural networks were used for deep learning with a convolutional neural network. We compared the clinical and histological features between patients with and without recurrence. The predictive performance of the deep learning model between groups was evaluated using receiver operating characteristic (ROC) analysis, and the area under the ROC curve served as a summary of the prognostic performance of the deep learning model to predict recurrent thyroid cancer. Results Tumor recurrence was noted in 49 (21.4%) among the 229 patients. Tumor size and multifocality varied significantly between the groups with and without recurrence (p < 0.05). The overall mean area under the curve (AUC) value of the deep learning model for prediction of recurrent thyroid cancer was 0.9 ± 0.06. The mean AUC value was 0.87 ± 0.03 in macrocarcinoma and 0.79 ± 0.16 in microcarcinoma. Conclusion A deep learning model for analysis of US images of thyroid cancer showed the possibility of predicting recurrence of thyroid cancer.

Validation of Instruments to Classify the Frailty of the Elderly in Community (지역사회 거주 노인의 허약선별도구 타당도 평가)

  • Lee, In-Sook;Park, Young-Im;Park, Eun-Ok;Lee, Soon-Hee;Jeong, Ihn-Sook
    • Research in Community and Public Health Nursing
    • /
    • v.22 no.3
    • /
    • pp.302-314
    • /
    • 2011
  • Purpose: This study aimed to validate instruments to classify the frailty of Korean elderly people in community. Methods: For this study, 632 elders were selected from community-based elderly houses and home visiting registries, and data on frailty were collected using three instruments during November, 2008. The Korean Frail Scale (KFS) was composed of 10 domains with the maximum score of 20. The Edmonton Frail Scale (EFS) had 10 domains with the maximum score of 17. The 25_Japan Frail Scale (25_JFS) was composed of 6 domains with the maximum score of 25. Internal consistency was measured with Cronbach's ${\alpha}$. Sensitivity, specificity and area under the curve (AUC) of ROC were measured to see validity with long.term care insurance grade as a gold standard. Results: The Cronbach's ${\alpha}$ was .72 for KFS, .55 for EFS, and .80 for 25_JFS. Sensitivity, specificity, and AUC were 70.0%, 83.2%, and .83, respectively, at cutting point 10.5 for the KFS, 50.0%, 80.9%, and .66, respectively, at 8.5 for EFS, and 80.0%, 85.9%, and .86, respectively, at 12.5 for 25_JFS. Conclusion: KFS and three JFS showed favorable internal consistency and predictive validity. Further longitudinal studies are recommended to confirm predictive validity.

Parameter estimation of linear function using VUS and HUM maximization (VUS와 HUM 최적화를 이용한 선형함수의 모수추정)

  • Hong, Chong Sun;Won, Chi Hwan;Jeong, Dong Gil
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.6
    • /
    • pp.1305-1315
    • /
    • 2015
  • Consider the risk score which is a function of a linear score for the classification models. The AUC optimization method can be applied to estimate the coefficients of linear score. These estimates obtained by this AUC approach method are shown to be better than the maximum likelihood estimators using logistic models under the general situation which does not fit the logistic assumptions. In this work, the VUS and HUM approach methods are suggested by extending AUC approach method for more realistic discrimination and prediction worlds. Some simulation results are obtained with both various distributions of thresholds and three kinds of link functions such as logit, complementary log-log and modified logit functions. It is found that coefficient prediction results by using the VUS and HUM approach methods for multiple categorical classification are equivalent to or better than those by using logistic models with some link functions.

Validation of Critical Care Non-verbal Pain Scale for Critically Ill Patients (중환자 통증사정 도구의 타당성 평가)

  • Choi, Eun Hee;Kim, Jin Hee;Ko, Mi Suk;Kim, Ji Yang;Kwon, Eun Ok;Jang, In Sun
    • Journal of Korean Clinical Nursing Research
    • /
    • v.19 no.2
    • /
    • pp.245-254
    • /
    • 2013
  • Purpose: This study was done to examine predictive validity of Critical Care Non-verbal Pain Scale (CNPS) and develop criteria for pain assessment using CNPS with critically ill patients who have communication problems. Methods: Data were collected from intensive care units at three major general hospitals in Seoul and Kyunggi province. During each observation, a nurse assessed pain severity using CNPS ratings (range 0-9) at four treatment stages: at rest, during central catheter dressing change (nonpainful procedure), position change and suctioning (routine painful procedures). Patients also assessed their pain using a self-report 4-point VRS-4. Results: There were significant differences between the four treatment stages except between "at rest" and "nonpainful procedure". Strong correlations were found between CNPS and VRS-4 for "at rest" (r=.552, p<.001), central catheter dressing change (r=.505, p<.001), position change (r=.709, p<.001), and suctioning (r=.662, p<.001). ROC curve analysis of CNPS based on 3 point on VRS-4 showed the cutoff point was 3 for CNPS, the starting point for pain management with 73% sensitivity, 92.2% specificity, 73% positive predictive value, and 92.8% negative predictive value. Conclusion: Results indicate that CNPS is a valid tool for measuring pain in critically ill patients with communication problems and 3 point should be the standardized pain treatment point.

Classification of the Diagnosis of Diabetes based on Mixture of Expert Model (Mixture of Expert 모형에 기반한 당뇨병 진단 분류)

  • Lee, Hong-Ki;Myoung, Sung-Min
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.11
    • /
    • pp.149-157
    • /
    • 2014
  • Diabetes is a chronic disease that requires continuous medical care and patient-self management education to prevent acute complications and reduce the risk of long-term complications. The worldwide prevalence and incidence of diabetes mellitus are reached epidemic proportions in most populations. Early detection of diabetes could help to prevent its onset by taking appropriate preventive measures and managing lifestyle. The major objective of this research is to develop an automated decision support system for detection of diabetes using mixture of experts model. The performance of the classification algorithms was compared on the Pima Indians diabetes dataset. The result of this study demonstrated that the mixture of expert model achieved diagnostic accuracies were higher than the other automated diagnostic systems.

Nomogram building to predict dyslipidemia using a naïve Bayesian classifier model (순수 베이지안 분류기 모델을 사용하여 이상지질혈증을 예측하는 노모 그램 구축)

  • Kim, Min-Ho;Seo, Ju-Hyun;Lee, Jea-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.4
    • /
    • pp.619-630
    • /
    • 2019
  • Dyslipidemia is a representative chronic disease affecting Koreans that requires continuous management. It is also a known risk factor for cardiovascular disease such as hypertension and diabetes. However, it is difficult to diagnose vascular disease without a medical examination. This study identifies risk factors for the recognition and prevention of dyslipidemia. By integrating them, we construct a statistical instrumental nomogram that can predict the incidence rate while visualizing. Data were from the Korean National Health and Nutrition Examination Survey (KNHANES) for 2013-2016. First, a chi-squared test identified twelve risk factors of dyslipidemia. We used a naïve Bayesian classifier model to construct a nomogram for the dyslipidemia. The constructed nomogram was verified using a receiver operating characteristics curve and calibration plot. Finally, we compared the logistic nomogram previously presented with the Bayesian nomogram proposed in this study.

Life Risk Assessment of Landslide Disaster in Jinbu Area Using Logistic Regression Model (로지스틱 회귀분석모델을 활용한 평창군 진부 지역의 산사태 재해의 인명 위험 평가)

  • Rahnuma, Bintae Rashid Urmi;Al, Mamun;Jang, Dong-Ho
    • Journal of The Geomorphological Association of Korea
    • /
    • v.27 no.2
    • /
    • pp.65-80
    • /
    • 2020
  • This paper deals with risk assessment of life in a landslide-prone area by a GIS-based modeling method. Landslide susceptibility maps can provide a probability of landslide prone areas to mitigate or proper control this problems and to take any development plan and disaster management. A landslide inventory map of the study area was prepared based on past historical information and aerial photography analysis. A total of 550 landslides have been counted at the whole study area. The extracted landslides were randomly selected and divided into two different groups, 50% of the landslides were used for model calibration and the other were used for validation purpose. Eleven causative factors (continuous and thematic) such as slope, aspect, curvature, topographic wetness index, elevation, forest type, forest crown density, geology, land-use, soil drainage, and soil texture were used in hazard analysis. The correlation between landslides and these factors, pixels were divided into several classes and frequency ratio was also extracted. Eventually, a landslide susceptibility map was constructed using a logistic regression model based on entire events. Moreover, the landslide susceptibility map was plotted with a receiver operating characteristic (ROC) curve and calculated the area under the curve (AUC) and tried to extract a success rate curve. Based on the results, logistic regression produced an 85.18% accuracy, so we believed that the model was reliable and acceptable for the landslide susceptibility analysis on the study area. In addition, for risk assessment, vulnerability scale were added for social thematic data layer. The study area predictive landslide affected pixels 2,000 and 5,000 were also calculated for making a probability table. In final calculation, the 2,000 predictive landslide affected pixels were assumed to run. The total population causalities were estimated as 7.75 person that was relatively close to the actual number published in Korean Annual Disaster Report, 2006.

Validity of the New Caries Activity Test using Real-Time Polymerase Chain Reaction (실시간 중합효소연쇄반응 방법을 이용한 새로운 치아우식 활성 검사법의 유효성)

  • Kwon, Doyoun;Kim, Heejin;Nam, Okhyung;Kim, Misun;Choi, Sungchul;Kim, Kwangchul;Lee, Hyoseol
    • Journal of the korean academy of Pediatric Dentistry
    • /
    • v.45 no.3
    • /
    • pp.354-362
    • /
    • 2018
  • Periogen is a new caries activity test using real-time polymerase chain reaction. The aim of this study was to assess the validity of Periogen by evaluating the correlation with dmft, dmfts indices and comparing with Cariview and caries risk assessment tool (CAT). 83 children under 6 participated in this study. Dmft, dmfts indices and CAT were collected through an examination of oral health status. Plaque samples for Periogen and Cariview were collected and manipulated according to the manufactures' instructions. The correlation coefficient of Periogen, Cariview and CAT with the dmfts index were 0.38, 0.56 and 0.66 in each (p < 0.01). The sensitivity of Periogen, Cariview and CAT were 43%, 76% and 95% and specificity were 80%, 72% and 74% respectively. Area under curve under the receiver operating characteristic curves in each method indicated 0.69, 0.81 and 0.85. CAT and Cariview were more effective in evaluation the risk of dental caries than Periogen so far. To be used Periogen clinically, more improvements for higher validity were needed.