• Title/Summary/Keyword: Cross - Validation

Search Result 1,017, Processing Time 0.03 seconds

Development of a Classification Model for Driver's Drowsiness and Waking Status Using Heart Rate Variability and Respiratory Features

  • Kim, Sungho;Choi, Booyong;Cho, Taehwan;Lee, Yongkyun;Koo, Hyojin;Kim, Dongsoo
    • Journal of the Ergonomics Society of Korea
    • /
    • v.35 no.5
    • /
    • pp.371-381
    • /
    • 2016
  • Objective:This study aims to evaluate the features of heart rate variability (HRV) and respiratory signals as indices for a driver's drowsiness and waking status in order to develop the classification model for a driver's drowsiness and waking status using those features. Background: Driver's drowsiness is one of the major causal factors for traffic accidents. This study hypothesized that the application of combined bio-signals to monitor the alertness level of drivers would improve the effectiveness of the classification techniques of driver's drowsiness. Method: The features of three heart rate variability (HRV) measurements including low frequency (LF), high frequency (HF), and LF/HF ratio and two respiratory measurements including peak and rate were acquired by the monotonous car driving simulation experiments using the photoplethysmogram (PPG) and respiration sensors. The experiments were repeated a total of 50 times on five healthy male participants in their 20s to 50s. The classification model was developed by selecting the optimal measurements, applying a binary logistic regression method and performing 3-fold cross validation. Results: The power of LF, HF, and LF/HF ratio, and the respiration peak of drowsiness status were reduced by 38%, 22%, 31%, and 7%, compared to those of waking status, while respiration rate was increased by 3%. The classification sensitivity of the model using both HRV and respiratory features (91.4%) was improved, compared to that of the model using only HRV feature (89.8%) and that using only respiratory feature (83.6%). Conclusion: This study suggests that the classification of driver's drowsiness and waking status may be improved by utilizing a combination of HRV and respiratory features. Application: The results of this study can be applied to the development of driver's drowsiness prevention systems.

Cross-cultural Validation Test and Application of LSS-short Form (여가만족척도(LSS-short form)의 타당도 검증과 적용)

  • Kim, Mi-Lyang;Lee, Yeon-Ju;Hwang, Sun-Hwan
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.11
    • /
    • pp.435-445
    • /
    • 2010
  • The purpose of the current study was to examine the validity of the Leisure Satisfaction Scale(LSS) and confirm the relations between the LSS and subjective well-being. The LSS developed by Beard and Ragheb[9] has been used for about 20 years without the examination of the validity since it was utilized in Lee[2]'s study in Korea. First, all items of the original LSS and the operational definition of factors were translated and then inversely translated, and reviewed by experts in leisure studies. A total of 515 respondents participated in the study. To achieve the goals of this study, item-total correlation, correlation analysis, exploratory factor analysis, reliability analysis, multiple regression and confirmatory factor analysis were conducted by SPSS 14.0 and AMOS 5.0 programs. Main findings are as follows: First, the factor names(psychological, educational, social, physical, esthetic, and relaxation) and items were modified. Second, modified operational definition of factors were presented. The reliability and validity of the LSS consisted of 24 items and five factors were very high. Psychological and relaxation factors of LSS affected subjective well-being positively. The new LSS would be useful for the future studies associated with leisure satisfaction.

Calpain-10 SNP43 and SNP19 Polymorphisms and Colorectal Cancer: a Matched Case-control Study

  • Hu, Xiao-Qin;Yuan, Ping;Luan, Rong-Sheng;Li, Xiao-Ling;Liu, Wen-Hui;Feng, Fei;Yan, Jin;Yang, Yan-Fang
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.14 no.11
    • /
    • pp.6673-6680
    • /
    • 2013
  • Objective: Insulin resistance (IR) is an established risk factor for colorectal cancer (CRC). Given that CRC and IR physiologically overlap and the calpain-10 gene (CAPN10) is a candidate for IR, we explored the association between CAPN10 and CRC risk. Methods: Blood samples of 400 case-control pairs were genotyped, and the lifestyle and dietary habits of these pairs were recorded and collected. Unconditional logistic regression (LR) was used to assess the effects of CAPN10 SNP43 and SNP19, and environmental factors. Both generalized multifactor dimensionality reduction (GMDR) and the classification and regression tree (CART) were used to test gene-environment interactions for CRC risk. Results: The GA+AA genotype of SNP43 and the Del/Ins+Ins/Ins genotype of SNP19 were marginally related to CRC risk (GA+AA: OR = 1.35, 95% CI = 0.92-1.99; Del/Ins+Ins/Ins: OR = 1.31, 95% CI = 0.84-2.04). Notably, a high-order interaction was consistently identified by GMDR and CART analyses. In GMDR, the four-factor interaction model of SNP43, SNP19, red meat consumption, and smoked meat consumption was the best model, with a maximum cross-validation consistency of 10/10 and testing balance accuracy of 0.61 (P < 0.01). In LR, subjects with high red and smoked meat consumption and two risk genotypes had a 6.17-fold CRC risk (95% CI = 2.44-15.6) relative to that of subjects with low red and smoked meat consumption and null risk genotypes. In CART, individuals with high smoked and red meat consumption, SNP19 Del/Ins+Ins/Ins, and SNP43 GA+AA had higher CRC risk (OR = 4.56, 95%CI = 1.94-10.75) than those with low smoked and red meat consumption. Conclusions: Though the single loci of CAPN10 SNP43 and SNP19 are not enough to significantly increase the CRC susceptibility, the combination of SNP43, SNP19, red meat consumption, and smoked meat consumption is associated with elevated risk.

A Study on Estimation of the Greenhouse Gas Emission from the Road Transportation Infrastructure Using the Geostatistical Analysis -A Case of the Daegu- (공간통계기법을 이용한 도로교통기반의 온실가스 관한 연구 -대구광역시를 대상으로-)

  • Lee, Sang Woo;Lee, Seung Wook;Lee, Seung Yeob;Hong, Won Hwa
    • Spatial Information Research
    • /
    • v.22 no.1
    • /
    • pp.9-17
    • /
    • 2014
  • This study was intended to reliably predict the traffic green house gas emission in Daegu with the use of spatial statistical technique and calculate the traffic green house gas emission of each administrative district on the basis of the accurately predicted emission. First, with the use of the traffic actually surveyed at a traffic observation point, and traffic green house gas emission was calculated. Secondly, on the basis of the calculation, and with the use of Universal Kriging technique, this researcher set a suitable variogram modeling to accurately and reliably predict the green house gas emission at non-observation point suitable through spatial correlation, and then performed cross validation to prove the validity of the proper variogram modeling and Kriging technique. Thirdly, with the use of the validated kriging technique, traffic green gas emission was visualized, and its distribution features were analyzed to predict and calculate the traffic green house gas emission of each administrative district. As a result, regarding the traffic green house gas emission of each administration, it was found that Bukgu had the highest green house gas emission of $291,878,020kgCO_2eq/yr$.

Calibration of Portable Particulate Mattere-Monitoring Device using Web Query and Machine Learning

  • Loh, Byoung Gook;Choi, Gi Heung
    • Safety and Health at Work
    • /
    • v.10 no.4
    • /
    • pp.452-460
    • /
    • 2019
  • Background: Monitoring and control of PM2.5 are being recognized as key to address health issues attributed to PM2.5. Availability of low-cost PM2.5 sensors made it possible to introduce a number of portable PM2.5 monitors based on light scattering to the consumer market at an affordable price. Accuracy of light scatteringe-based PM2.5 monitors significantly depends on the method of calibration. Static calibration curve is used as the most popular calibration method for low-cost PM2.5 sensors particularly because of ease of application. Drawback in this approach is, however, the lack of accuracy. Methods: This study discussed the calibration of a low-cost PM2.5-monitoring device (PMD) to improve the accuracy and reliability for practical use. The proposed method is based on construction of the PM2.5 sensor network using Message Queuing Telemetry Transport (MQTT) protocol and web query of reference measurement data available at government-authorized PM monitoring station (GAMS) in the republic of Korea. Four machine learning (ML) algorithms such as support vector machine, k-nearest neighbors, random forest, and extreme gradient boosting were used as regression models to calibrate the PMD measurements of PM2.5. Performance of each ML algorithm was evaluated using stratified K-fold cross-validation, and a linear regression model was used as a reference. Results: Based on the performance of ML algorithms used, regression of the output of the PMD to PM2.5 concentrations data available from the GAMS through web query was effective. The extreme gradient boosting algorithm showed the best performance with a mean coefficient of determination (R2) of 0.78 and standard error of 5.0 ㎍/㎥, corresponding to 8% increase in R2 and 12% decrease in root mean square error in comparison with the linear regression model. Minimum 100 hours of calibration period was found required to calibrate the PMD to its full capacity. Calibration method proposed poses a limitation on the location of the PMD being in the vicinity of the GAMS. As the number of the PMD participating in the sensor network increases, however, calibrated PMDs can be used as reference devices to nearby PMDs that require calibration, forming a calibration chain through MQTT protocol. Conclusions: Calibration of a low-cost PMD, which is based on construction of PM2.5 sensor network using MQTT protocol and web query of reference measurement data available at a GAMS, significantly improves the accuracy and reliability of a PMD, thereby making practical use of the low-cost PMD possible.

The Sub Authentication Method For Driver Using Driving Patterns (운전 패턴을 이용한 운전자 보조 인증방법)

  • Jeong, Jong-Myoung;Kang, Hyung Chul;Jo, Hyo Jin;Yoon, Ji Won;Lee, Dong Hoon
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.23 no.5
    • /
    • pp.919-929
    • /
    • 2013
  • Recently, a variety of IT technologies are applied to the vehicle. However, some vehicle-IT technologies without security considerations may cause security problems. Specially, some researches about a smart key system applied to automobiles for authentication show that the system is insecure from replay attacks and modification attacks using a wireless signal of the smart key. Thus, in this paper, we propose an authentication method for the driver by using driving patterns. Nowadays, we can obtain driving patterns using the In-vehicle network data. In our authentication model, we make driving ppatterns of car owner using standard normal distribution and apply these patterns to driver authentication. To validate our model, we perform an k-fold cross validation test using In-vehicle network data and obtain the result(true positive rate 0.7/false positive rate is 0.35). Considering to our result, it turns out that our model is more secure than existing 'what you have' authentication models such as the smart key if the authentication result is sent to the car owner through mobile networks.

Prediction Models for Solitary Pulmonary Nodules Based on Curvelet Textural Features and Clinical Parameters

  • Wang, Jing-Jing;Wu, Hai-Feng;Sun, Tao;Li, Xia;Wang, Wei;Tao, Li-Xin;Huo, Da;Lv, Ping-Xin;He, Wen;Guo, Xiu-Hua
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.14 no.10
    • /
    • pp.6019-6023
    • /
    • 2013
  • Lung cancer, one of the leading causes of cancer-related deaths, usually appears as solitary pulmonary nodules (SPNs) which are hard to diagnose using the naked eye. In this paper, curvelet-based textural features and clinical parameters are used with three prediction models [a multilevel model, a least absolute shrinkage and selection operator (LASSO) regression method, and a support vector machine (SVM)] to improve the diagnosis of benign and malignant SPNs. Dimensionality reduction of the original curvelet-based textural features was achieved using principal component analysis. In addition, non-conditional logistical regression was used to find clinical predictors among demographic parameters and morphological features. The results showed that, combined with 11 clinical predictors, the accuracy rates using 12 principal components were higher than those using the original curvelet-based textural features. To evaluate the models, 10-fold cross validation and back substitution were applied. The results obtained, respectively, were 0.8549 and 0.9221 for the LASSO method, 0.9443 and 0.9831 for SVM, and 0.8722 and 0.9722 for the multilevel model. All in all, it was found that using curvelet-based textural features after dimensionality reduction and using clinical predictors, the highest accuracy rate was achieved with SVM. The method may be used as an auxiliary tool to differentiate between benign and malignant SPNs in CT images.

Patient-Specific Quality Assurance in a Multileaf Collimator-Based CyberKnife System Using the Planar Ion Chamber Array

  • Yoon, Jeongmin;Lee, Eungman;Park, Kwangwoo;Kim, Jin Sung;Kim, Yong Bae;Lee, Ho
    • Progress in Medical Physics
    • /
    • v.29 no.2
    • /
    • pp.59-65
    • /
    • 2018
  • This paper describes the clinical use of the dose verification of multileaf collimator (MLC)-based CyberKnife plans by combining the Octavius 1000SRS detector and water-equivalent RW3 slab phantom. The slab phantom consists of 14 plates, each with a thickness of 10 mm. One plate was modified to support tracking by inserting 14 custom-made fiducials on surface holes positioned at the outer region of $10{\times}10cm^2$. The fiducial-inserted plate was placed on the 1000SRS detector and three plates were additionally stacked up to build the reference depth. Below the detector, 10 plates were placed to avoid longer delivery times caused by proximity detection program alerts. The cross-calibration factor prior to phantom delivery was obtained by performing with 200 monitor units (MU) on the field size of $95{\times}92.5mm^2$. After irradiation, the measured dose distribution of the coronal plane was compared with the dose distribution calculated by the MultiPlan treatment planning system. The results were assessed by comparing the absolute dose at the center point of 1000SRS and the 3-D Gamma (${\gamma}$) index using 220 patient-specific quality assurance (QA). The discrepancy between measured and calculated doses at the center point of 1000SRS detector ranged from -3.9% to 8.2%. In the dosimetric comparison using 3-D ${\gamma}$-function (3%/3 mm criteria), the mean passing rates with ${\gamma}$-parameter ${\leq}1$ were $97.4%{\pm}2.4%$. The combination of the 1000SRS detector and RW3 slab phantom can be utilized for dosimetry validation of patient-specific QA in the CyberKnife MLC system, which made it possible to measure absolute dose distributions regardless of tracking mode.

Application of Artificial Neural Network for estimation of daily maximum snow depth in Korea (우리나라에서 일최심신적설의 추정을 위한 인공신경망모형의 활용)

  • Lee, Geon;Lee, Dongryul;Kim, Dongkyun
    • Journal of Korea Water Resources Association
    • /
    • v.50 no.10
    • /
    • pp.681-690
    • /
    • 2017
  • This study estimated the daily maximum snow depth using the Artificial Neural Network (ANN) model in Korean Peninsula. First, the optimal ANN model structure was determined through the trial-and-error approach. As a result, daily precipitation, daily mean temperature, and daily minimum temperature were chosen as the input data of the ANN. The number of hidden layer was set to 1 and the number of nodes in the hidden layer was set to 10. In case of using the observed value as the input data of the ANN model, the cross validation correlation coefficient was 0.87, which is higher than that of the case in which the daily maximum snow depth was spatially interpolated using the Ordinary Kriging method (0.40). In order to investigate the performance of the ANN model for estimating the daily maximum snow depth of the ungauged area, the input data of the ANN model was spatially interpolated using Ordinary Kriging. In this case, the correlation coefficient of 0.49 was obtained. The performance of the ANN model in mountainous areas above 200m above sea level was found to be somewhat lower than that in the rest of the study area. This result of this study implies that the ANN model can be used effectively for the accurate and immediate estimation of the maximum snow depth over the whole country.

An Electric Load Forecasting Scheme with High Time Resolution Based on Artificial Neural Network (인공 신경망 기반의 고시간 해상도를 갖는 전력수요 예측기법)

  • Park, Jinwoong;Moon, Jihoon;Hwang, Eenjun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.6 no.11
    • /
    • pp.527-536
    • /
    • 2017
  • With the recent development of smart grid industry, the necessity for efficient EMS(Energy Management System) has been increased. In particular, in order to reduce electric load and energy cost, sophisticated electric load forecasting and efficient smart grid operation strategy are required. In this paper, for more accurate electric load forecasting, we extend the data collected at demand time into high time resolution and construct an artificial neural network-based forecasting model appropriate for the high time resolution data. Furthermore, to improve the accuracy of electric load forecasting, time series data of sequence form are transformed into continuous data of two-dimensional space to solve that problem that machine learning methods cannot reflect the periodicity of time series data. In addition, to consider external factors such as temperature and humidity in accordance with the time resolution, we estimate their value at the time resolution using linear interpolation method. Finally, we apply the PCA(Principal Component Analysis) algorithm to the feature vector composed of external factors to remove data which have little correlation with the power data. Finally, we perform the evaluation of our model through 5-fold cross-validation. The results show that forecasting based on higher time resolution improve the accuracy and the best error rate of 3.71% was achieved at the 3-min resolution.