• Title/Summary/Keyword: Cross - Validation

Search Result 994, Processing Time 0.03 seconds

Severity-Adjusted Mortality Rates : The Case of CABG Surgery (관상동맥우회술 수술환자의 수술 후 사망률 예측모형의 개발)

  • Park, Hyeung-Keun;Kwon, Young-Dae;Shin, You-Cheol;Lee, Jin-Seok;Kim, Hae-Joon;Sohn, Moon-Jun;Ahn, Hyeong-Sik
    • Journal of Preventive Medicine and Public Health
    • /
    • v.34 no.1
    • /
    • pp.21-27
    • /
    • 2001
  • Objectives : To develop a model that will predict the mortality of patients undergoing Coronary Artery Bypass Graft (CABG) and evaluate the perfermance of hospitals. Methods : Data from 564 CABGs peformed in six general hospitals were collected through medical record abstraction by registered nurses. Variables studied involved risk factors determined by severity measures. Risk modeling was performed through logistic repression and validated with cross-validation. The statistical performance of the developed model was evaluated using c-statistic, $R^2$, and Hosmer-Lemeshow statistic. Hospital performance was assessed by severity-adjusted mortalities. Results : The developed model included age, sex, BUN, EKG rhythm, Congestive Heart Failure at admission. acute mental change within 24 hours, and previous angina pectoris history. The c-statistic and $R^2$ were 0.791 and 0.001, respectively. Hosmer-Lemeshow statistic was 10.3(p value=0.2415). One hospital had a significantly higher mortality rate than the average mortality rate, while others were net significantly different. Conclusion : Comparing the quality of service by severity adjusted mortality rates, there were significant differences in hospital performance. The severity adjusted mortality rate of CABG surgery may He an indicator for evaluating hospital performance in Korea.

  • PDF

Performance Analysis of SAR System Using Radar Target Simulation Equipment (표적모의장치를 이용한 SAR 장비의 성능 분석)

  • Kweon, Soon-Koo;Yeo, Hwan-Yong;Park, Sung-Min;Han, Ji-Hoon;Jung, Chang-Sik;Kim, Ki-Wan;Shin, Hyun-Ik
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.29 no.2
    • /
    • pp.118-127
    • /
    • 2018
  • In this work, we have designed and manufactured radar target simulation equipment for the performance analysis of synthetic aperture radar(SAR) systems. First, we have explained the function and performance specification of the target simulation equipment and point target scenario generation for validation of the SAR system. In addition, we have developed a simple and accurate calibration method for the time delay of the SAR system using the manufactured target simulation equipment. We have analyzed the point target impulse response function of the SAR image acquired using the SAR system and the target simulation equipment. It was observed that the measured peak to side lobe ratio(=-13.25 dB) and resolution(=0.49 m) are in good agreement with the corresponding theoretical values.

A Fundamental Study on Detection of Weeds in Paddy Field using Spectrophotometric Analysis (분광특성 분석에 의한 논 잡초 검출의 기초연구)

  • 서규현;서상룡;성제훈
    • Journal of Biosystems Engineering
    • /
    • v.27 no.2
    • /
    • pp.133-142
    • /
    • 2002
  • This is a fundamental study to develop a sensor to detect weeds in paddy field using machine vision adopted spectralphotometric technique in order to use the sensor to spread herbicide selectively. A set of spectral reflectance data was collected from dry and wet soil and leaves of rice and 6 kinds of weed to select desirable wavelengths to classify soil, rice and weeds. Stepwise variable selection method of discriminant analysis was applied to the data set and wavelengths of 680 and 802 m were selected to distinguish plants (including rice and weeds) from dry and wet soil, respectively. And wavelengths of 580 and 680 nm were selected to classify rice and weeds by the same method. Validity of the wavelengths to distinguish the plants from soil was tested by cross-validation test with built discriminant function to prove that all of soil and plants were classified correctly without any failure. Validity of the wavelengths for classification of rice and weeds was tested by the same method and the test resulted that 98% of rice and 83% of weeds were classified correctly. Feasibility of CCD color camera to detect weeds in paddy field was tested with the spectral reflectance data by the same statistical method as above. Central wavelengths of RGB frame of color camera were tried as tile effective wavelengths to distingush plants from soil and weeds from plants. The trial resulted that 100% and 94% of plants in dry soil and wet soil, respectively, were classified correctly by the central wavelength or R frame only, and 95% of rice and 85% of weeds were classified correctly by the central wavelengths of RGB frames. As a result, it was concluded that CCD color camera has good potential to be used to detect weeds in paddy field.

Extraction Method of Significant Clinical Tests Based on Data Discretization and Rough Set Approximation Techniques: Application to Differential Diagnosis of Cholecystitis and Cholelithiasis Diseases (데이터 이산화와 러프 근사화 기술에 기반한 중요 임상검사항목의 추출방법: 담낭 및 담석증 질환의 감별진단에의 응용)

  • Son, Chang-Sik;Kim, Min-Soo;Seo, Suk-Tae;Cho, Yun-Kyeong;Kim, Yoon-Nyun
    • Journal of Biomedical Engineering Research
    • /
    • v.32 no.2
    • /
    • pp.134-143
    • /
    • 2011
  • The selection of meaningful clinical tests and its reference values from a high-dimensional clinical data with imbalanced class distribution, one class is represented by a large number of examples while the other is represented by only a few, is an important issue for differential diagnosis between similar diseases, but difficult. For this purpose, this study introduces methods based on the concepts of both discernibility matrix and function in rough set theory (RST) with two discretization approaches, equal width and frequency discretization. Here these discretization approaches are used to define the reference values for clinical tests, and the discernibility matrix and function are used to extract a subset of significant clinical tests from the translated nominal attribute values. To show its applicability in the differential diagnosis problem, we have applied it to extract the significant clinical tests and its reference values between normal (N = 351) and abnormal group (N = 101) with either cholecystitis or cholelithiasis disease. In addition, we investigated not only the selected significant clinical tests and the variations of its reference values, but also the average predictive accuracies on four evaluation criteria, i.e., accuracy, sensitivity, specificity, and geometric mean, during l0-fold cross validation. From the experimental results, we confirmed that two discretization approaches based rough set approximation methods with relative frequency give better results than those with absolute frequency, in the evaluation criteria (i.e., average geometric mean). Thus it shows that the prediction model using relative frequency can be used effectively in classification and prediction problems of the clinical data with imbalanced class distribution.

Improvement of Classification Accuracy of Different Finger Movements Using Surface Electromyography Based on Long Short-Term Memory (LSTM을 이용한 표면 근전도 분석을 통한 서로 다른 손가락 움직임 분류 정확도 향상)

  • Shin, Jaeyoung;Kim, Seong-Uk;Lee, Yun-Sung;Lee, Hyung-Tak;Hwang, Han-Jeong
    • Journal of Biomedical Engineering Research
    • /
    • v.40 no.6
    • /
    • pp.242-249
    • /
    • 2019
  • Forearm electromyography (EMG) generated by wrist movements has been widely used to develop an electrical prosthetic hand, but EMG generated by finger movements has been rarely used even though 20% of amputees lose fingers. The goal of this study is to improve the classification performance of different finger movements using a deep learning algorithm, and thereby contributing to the development of a high-performance finger-based prosthetic hand. Ten participants took part in this study, and they performed seven different finger movements forty times each (thumb, index, middle, ring, little, fist and rest) during which EMG was measured from the back of the right hand using four bipolar electrodes. We extracted mean absolute value (MAV), root mean square (RMS), and mean (MEAN) from the measured EMGs for each trial as features, and a 5x5-fold cross-validation was performed to estimate the classification performance of seven different finger movements. A long short-term memory (LSTM) model was used as a classifier, and linear discriminant analysis (LDA) that is a widely used classifier in previous studies was also used for comparison. The best performance of the LSTM model (sensitivity: 91.46 ± 6.72%; specificity: 91.27 ± 4.18%; accuracy: 91.26 ± 4.09%) significantly outperformed that of LDA (sensitivity: 84.55 ± 9.61%; specificity: 84.02 ± 6.00%; accuracy: 84.00 ± 5.87%). Our result demonstrates the feasibility of a deep learning algorithm (LSTM) to improve the performance of classifying different finger movements using EMG.

Korean Syntactic Rules using Composite Labels (복합 레이블을 적용한 한국어 구문 규칙)

  • 김성용;이공주;최기선
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.2
    • /
    • pp.235-244
    • /
    • 2004
  • We propose a format of a binary phrase structure grammar with composite labels. The grammar adopts binary rules so that the dependency between two sub-trees can be represented in the label of the tree. The label of a tree is composed of two attributes, each of which is extracted from each sub-tree so that it can represent the compositional information of the tree. The composite label is generated from part-of-speech tags using an automatic labeling algorithm. Since the proposed rule description scheme is binary and uses only part-of-speech information, it can readily be used in dependency grammar and be applied to other languages as well. In the best-1 context-free cross validation on 31,080 tree-tagged corpus, the labeled precision is 79.30%, which outperforms phrase structure grammar and dependency grammar by 5% and by 4%, respectively. It shows that the proposed rule description scheme is effective for parsing Korean.

Signal Sequence Prediction Based on Hydrophobicity and Substitution Matrix (소수성과 치환행렬에 기반한 신호서열 예측)

  • Chi, Sang-Mun
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.7
    • /
    • pp.595-602
    • /
    • 2007
  • This paper proposes a method that discriminates signal peptide and predicts the cleavage site of the secretory proteins cleaved by the signal peptidase I. The preprocessing stage uses hydrophobicity scales of amino acids in order to predict the presence of signal sequence and the cleavage site. The preprocessing enhances the performance of the prediction method by eliminating the non-secretory proteins in the early stage of prediction. for the effective use of support vector machine for the signal sequence prediction, the biologically relevant distance between the amino acid sequences is defined by using the hydrophobicity and substitution matrix; the hydrophobicity can be used to Predict the location of amino acid in a cell and the substitution matrix represents the evolutionary relationships of amino acids. The proposed method showed 98.9% discrimination rates from signal sequences and 88% correct rate of the cleavage site prediction on Swiss-Prot release 50 protein database using the 5-fold-cross-validation. In the comparison tests, the proposed method has performed significantly better than other prediction methods.

cmicroRNA prediction using Bayesian network with biologically relevant feature set (생물학적으로 의미 있는 특질에 기반한 베이지안 네트웍을 이용한 microRNA의 예측)

  • Nam, Jin-Wu;Park, Jong-Sun;Zhang, Byoung-Tak
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10a
    • /
    • pp.53-58
    • /
    • 2006
  • MicroRNA (miRNA)는 약 22 nt의 작은 RNA 조각으로 이루어져 있으며 stem-loop 구조의 precursor 형태에서 최종적으로 만들어 진다. miRNA는 mRNA의 3‘UTR에 상보적으로 결합하여 유전자의 발현을 억제하거나 mRNA의 분해를 촉진한다. miRNA를 동정하기 위한 실험적인 방법은 조직 특이적인 발현, 적은 발현양 때문에 방법상 한계를 가지고 있다. 이러한 한계는 컴퓨터를 이용한 방법으로 어느 정도 해결될 수 있다. 하지만 miRNA의 서열상의 낮은 보존성은 homology를 기반으로 한 예측을 어렵게 한다. 또한 기계학습 방법인 support vector machine (SVM) 이나 naive bayes가 적용되었지만, 생물학적인 의미를 해석할 수 있는 generative model을 제시해 주지 못했다. 본 연구에서는 우수한 miRNA 예측을 보일 뿐만 아니라 학습된 모델로부터 생물학적인 지식을 얻을 수 있는 Bayesian network을 적용한다. 이를 위해서는 생물학적으로 의미 있는 특질들의 선택이 중요하다. 여기서는 position weighted matrix (PWM)과 Markov chain probability (MCP), Loop 크기, Bulge 수, spectrum, free energy profile 등을 특질로서 선택한 후 Information gain의 특질 선택법을 통해 예측에 기여도가 높은 특질 25개 와 27개를 최종적으로 선택하였다. 이로부터 Bayesian network을 학습한 후 miRNA의 예측 성능을 10 fold cross-validation으로 확인하였다. 그 결과 pre-/mature miRNA 각 각에 대한 예측 accuracy가 99.99% 100.00%를 보여, SVM이나 naive bayes 방법보다 높은 결과를 보였으며, 학습된 Bayesian network으로부터 이전 연구 결과와 일치하는 pre-miRNA 상의 의존관계를 분석할 수 있었다.

  • PDF

Clinical significance of APOB inactivation in hepatocellular carcinoma

  • Lee, Gena;Jeong, Yun Seong;Kim, Do Won;Kwak, Min Jun;Koh, Jiwon;Joo, Eun Wook;Lee, Ju-Seog;Kah, Susie;Sim, Yeong-Eun;Yim, Sun Young
    • Experimental and Molecular Medicine
    • /
    • v.50 no.11
    • /
    • pp.7.1-7.12
    • /
    • 2018
  • Recent findings from The Cancer Genome Atlas project have provided a comprehensive map of genomic alterations that occur in hepatocellular carcinoma (HCC), including unexpected mutations in apolipoprotein B (APOB). We aimed to determine the clinical significance of this non-oncogenetic mutation in HCC. An Apob gene signature was derived from genes that differed between control mice and mice treated with siRNA specific for Apob (1.5-fold difference; P < 0.005). Human gene expression data were collected from four independent HCC cohorts (n = 941). A prediction model was constructed using Bayesian compound covariate prediction, and the robustness of the APOB gene signature was validated in HCC cohorts. The correlation of the APOB signature with previously validated gene signatures was performed, and network analysis was conducted using ingenuity pathway analysis. APOB inactivation was associated with poor prognosis when the APOB gene signature was applied in all human HCC cohorts. Poor prognosis with APOB inactivation was consistently observed through cross-validation with previously reported gene signatures (NCIP A, HS, high-recurrence SNUR, and high RS subtypes). Knowledge-based gene network analysis using genes that differed between low-APOB and high-APOB groups in all four cohorts revealed that low-APOB activity was associated with upregulation of oncogenic and metastatic regulators, such as HGF, MTIF, ERBB2, FOXM1, and CD44, and inhibition of tumor suppressors, such as TP53 and PTEN. In conclusion, APOB inactivation is associated with poor outcome in patients with HCC, and APOB may play a role in regulating multiple genes involved in HCC development.

Comparison of Univariate Kriging Algorithms for GIS-based Thematic Mapping with Ground Survey Data (현장 조사 자료를 이용한 GIS 기반 주제도 작성을 위한 단변량 크리깅 기법의 비교)

  • Park, No-Wook
    • Korean Journal of Remote Sensing
    • /
    • v.25 no.4
    • /
    • pp.321-338
    • /
    • 2009
  • The objective of this paper is to compare spatial prediction capabilities of univariate kriging algorithms for generating GIS-based thematic maps from ground survey data with asymmetric distributions. Four univariate kriging algorithms including traditional ordinary kriging, three non-linear transform-based kriging algorithms such as log-normal kriging, multi-Gaussian kriging and indicator kriging are applied for spatial interpolation of geochemical As and Pb elements. Cross validation based on a leave-one-out approach is applied and then prediction errors are computed. The impact of the sampling density of the ground survey data on the prediction errors are also investigated. Through the case study, indicator kriging showed the smallest prediction errors and superior prediction capabilities of very low and very high values. Other non-linear transform based kriging algorithms yielded better prediction capabilities than traditional ordinary kriging. Log-normal kriging which has been widely applied, however, produced biased estimation results (overall, overestimation). It is expected that such quantitative comparison results would be effectively used for the selection of an optimal kriging algorithm for spatial interpolation of ground survey data with asymmetric distributions.