• 제목/요약/키워드: Missing Values

검색결과 441건 처리시간 0.022초

The effect of dental scaling noise during intravenous sedation on acoustic respiration rate (RRaTM)

  • Kim, Jung Ho;Chi, Seong In;Kim, Hyun Jeong;Seo, Kwang-Suk
    • Journal of Dental Anesthesia and Pain Medicine
    • /
    • 제18권2호
    • /
    • pp.97-103
    • /
    • 2018
  • Background: Respiration monitoring is necessary during sedation for dental treatment. Recently, acoustic respiration rate ($RRa^{TM}$), an acoustics-based respiration monitoring method, has been used in addition to auscultation or capnography. The accuracy of this method may be compromised in an environment with excessive noise. This study evaluated whether noise from the ultrasonic scaler affects the performance of RRa in respiratory rate measurement. Methods: We analyzed data from 49 volunteers who underwent scaling under intravenous sedation. Clinical tests were divided into preparation, sedation, and scaling periods; respiratory rate was measured at 2-s intervals for 3 min in each period. Missing values ratios of the RRa during each period were measuerd; correlation analysis and Bland-Altman analysis were performed on respiratory rates measured by RRa and capnogram. Results: Respective missing values ratio from RRa were 5.62%, 8.03%, and 23.95% in the preparation, sedation, and scaling periods, indicating an increased missing values ratio in the scaling period (P < 0.001). Correlation coefficients of the respiratory rate, measured with two different methods, were 0.692, 0.677, and 0.562 in each respective period. Mean capnography-RRa biases in Bland-Altman analyses were -0.03, -0.27, and -0.61 in each respective period (P < 0.001); limits of agreement were -4.84-4.45, -4.89-4.15, and -6.18-4.95 (P < 0.001). Conclusions: The probability of missing respiratory rate values was higher during scaling when RRa was used for measurement. Therefore, the use of RRa alone for respiration monitoring during ultrasonic scaling may not be safe.

Imputation of Medical Data Using Subspace Condition Order Degree Polynomials

  • Silachan, Klaokanlaya;Tantatsanawong, Panjai
    • Journal of Information Processing Systems
    • /
    • 제10권3호
    • /
    • pp.395-411
    • /
    • 2014
  • Temporal medical data is often collected during patient treatments that require personal analysis. Each observation recorded in the temporal medical data is associated with measurements and time treatments. A major problem in the analysis of temporal medical data are the missing values that are caused, for example, by patients dropping out of a study before completion. Therefore, the imputation of missing data is an important step during pre-processing and can provide useful information before the data is mined. For each patient and each variable, this imputation replaces the missing data with a value drawn from an estimated distribution of that variable. In this paper, we propose a new method, called Newton's finite divided difference polynomial interpolation with condition order degree, for dealing with missing values in temporal medical data related to obesity. We compared the new imputation method with three existing subspace estimation techniques, including the k-nearest neighbor, local least squares, and natural cubic spline approaches. The performance of each approach was then evaluated by using the normalized root mean square error and the statistically significant test results. The experimental results have demonstrated that the proposed method provides the best fit with the smallest error and is more accurate than the other methods.

결정트리를 이용하는 불완전한 데이터 처리기법 (Incomplete data handling technique using decision trees)

  • 이종찬
    • 한국융합학회논문지
    • /
    • 제12권8호
    • /
    • pp.39-45
    • /
    • 2021
  • 본 논문은 손실값을 포함하는 불완전한 데이터를 처리하는 방법에 대해 논한다. 손실값을 최적으로 처리한다는 것은 학습 데이터가 가지고 있는 정보들에서 본래값과 가장 근사한 추정치를 구하고, 이 값으로 손실값을 대치하는 것이다. 이것을 실현하기 위한 방안으로 분류기가 정보를 분류하는 과정에서 완성되어가는 결정트리를 이용한다. 다시말해 이 결정트리는 전체 학습 데이터 중에서 손실값을 포함하지 않는 완전한 정보만을 C4.5 분류기에 입력하여 학습하는 과정에서 얻어진다. 이 결정트리의 노드들은 분류 변수의 정보를 가지는데, 루트에 가까운 상위 노드일수록 많은 정보를 포함하게 되고 말단 노드에서는 루트로부터의 경로를 통해 분류 영역을 형성하게 된다. 또한 각 영역에는 분류된 데이터 사건들의 평균이 기록된다. 손실값을 포함하는 사건들은 이러한 결정트리에 입력되어 각 노드의 정보에 따라 순회과정을 통해 사건과 가장 근접한 영역을 찾아가게 된다. 이 영역에 기록된 평균값을 손실값의 추정치로 간주하고, 보상 과정은 완성된다.

결측이 있는 이산형 공변량에 대한 Cox비례위험모형의 패턴-혼합 모델 (Pattern-Mixture Model of the Cox Proportional Hazards Model with Missing Binary Covariates)

  • 육태미;송주원
    • 응용통계연구
    • /
    • 제25권2호
    • /
    • pp.279-291
    • /
    • 2012
  • 공변량에 결측이 발생한 Cox 비례위험 모형을 적합할 때, 결측이 발생하는 개체를 모두 제거한 후 분석을 실시한다면 정보 손실에 의해 비효율적이고 결측의 발생 메커니즘이 완전 임의 결측(missing completely at random; MCAR)이 아니라면 모수의 추정값에 편향이 발생할 수 있다. Cox 비례위험 회귀모형의 공변량에 결측이 있는 경우 적용할 수 있는 여러 가지 방법들이 제안되어져 왔으나 이 분석들은 선택모델(selection model)에 기반하고 있다. 본 연구에서는 Little (1993)이 제안한 패턴-혼합 모델(pattern-mixture model)을 사용하여 Cox 비례위험 회귀모형에서 생존시간과 결측 메커니즘의 결합분포를 모델화 하고, 여러 가지 제약에 근거한 생존 분석의 결과를 비교하였다. 모의실험을 통해서 패턴-혼합 모델의 제약(restrictions)에 따른 모수 추정의 민감도를 확인하였고 결측을 무시한 채 분석한 결과 및 선택모형에 근거한 분석결과와 비교하였다. 패턴-혼합 모델의 제약에 따라 공변량의 결측으로 인한 모수 추정의 민감성 정도를 쥐백혈병 자료 예제를 통해 설명하였다.

화학반응 공정에서의 공정조건 및 허용차설계 (Parameter and Tolerance Designs of the Chemical Reaction Process)

  • 안종석;윤원영
    • 품질경영학회지
    • /
    • 제30권1호
    • /
    • pp.97-117
    • /
    • 2002
  • We apply Taguchi method to the process optimization problem of chemical reaction process, and some case studies are done to find out the way for cost reduction and quality improvement The parameter and tolerance designs of Taguchi mettled are done with operation data of a chemical process and we propose a procedure how to use and analyze the operation data to find the optimal process conditions and tolerance limits. In order to use the continuous values in experiment conditions, it is suggested how to determine the interval of each level by discrete values and to treat any missing values caused from discrete 4 levels.

앙상블 기법을 활용한 대학생 중도탈락 예측 모형 개발 (A Study on the Development of University Students Dropout Prediction Model Using Ensemble Technique)

  • 박상성
    • 디지털산업정보학회논문지
    • /
    • 제17권1호
    • /
    • pp.109-115
    • /
    • 2021
  • The number of freshmen at universities is decreasing due to the recent decline in the school-age population, and the survival of many universities is threatened. To overcome this situation, universities are seeking ways to use big data within the school to improve the quality of education. A study on the prediction of dropout students is a representative case of using big data in universities. The dropout prediction can prepare a systematic management plan by identifying students who will drop out of school due to reasons such as dropout or expulsion. In the case of actual on-campus data, a large number of missing values are included because it is collected and managed by various departments. For this reason, it is necessary to construct a model by effectively reflecting the missing values. In this study, we propose a university student dropout prediction model based on eXtreme Gradient Boost that can be applied to data with many missing values and shows high performance. In order to examine the practical applicability of the proposed model, an experiment was performed using data from C University in Chungbuk. As a result of the experiment, the prediction performance of the proposed model was found to be excellent. The management strategy of dropout students can be established through the prediction results of the model proposed in this paper.

Association Rule Mining Algorithm and Analysis of Missing Values

  • Lee, Jae-Wan;Bobby D. Gerardo;Kim, Gui-Tae;Jeong, Jin-Seob
    • Journal of information and communication convergence engineering
    • /
    • 제1권3호
    • /
    • pp.150-156
    • /
    • 2003
  • This paper explored the use of an algorithm for the data mining and method in handling missing data which had generated enhanced association patterns observed using the data illustrated here. The evaluations showed that more association patterns are generated in the second analysis which suggests more meaningful rules than in the first situation. It showed that the model offer more precise and important association rules that is more valuable when applied for business decision making. With the discovery of accurate association rules or business patterns, strategies could be efficiently planned out and implemented to improve marketing schemes. This investigation gives rise to a number of interesting issues that could be explored further like the effect of outliers and missing data for detecting fraud and devious database entries.

Conditional bootstrap confidence intervals for classification error rate when a block of observations is missing

  • Chung, Hie-Choon;Han, Chien-Pai
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권1호
    • /
    • pp.189-200
    • /
    • 2013
  • In this paper, it will be assumed that there are two distinct populations which are multivariate normal with equal covariance matrix. We also assume that the two populations are equally likely and the costs of misclassification are equal. The classification rule depends on the situation whether the training samples include missing values or not. We consider the conditional bootstrap confidence intervals for classification error rate when a block of observation is missing.

A New Estimation Model for Wireless Sensor Networks Based on the Spatial-Temporal Correlation Analysis

  • Ren, Xiaojun;Sug, HyonTai;Lee, HoonJae
    • Journal of information and communication convergence engineering
    • /
    • 제13권2호
    • /
    • pp.105-112
    • /
    • 2015
  • The estimation of missing sensor values is an important problem in sensor network applications, but the existing approaches have some limitations, such as the limitations of application scope and estimation accuracy. Therefore, in this paper, we propose a new estimation model based on a spatial-temporal correlation analysis (STCAM). STCAM can make full use of spatial and temporal correlations and can recognize whether the sensor parameters have a spatial correlation or a temporal correlation, and whether the missing sensor data are continuous. According to the recognition results, STCAM can choose one of the most suitable algorithms from among linear interpolation algorithm of temporal correlation analysis (TCA-LI), multiple regression algorithm of temporal correlation analysis (TCA-MR), spatial correlation analysis (SCA), spatial-temporal correlation analysis (STCA) to estimate the missing sensor data. STCAM was evaluated over Intel lab dataset and a traffic dataset, and the simulation experiment results show that STCAM has good estimation accuracy.

2×2 교차계획법에서 결측치가 있을 때의 결측치 처리 방법 비교에 관한 연구 (Comparison of Single Imputation Methods in 2×2 Cross-Over Design with Missing Observations)

  • 조보배;김동재
    • 응용통계연구
    • /
    • 제28권3호
    • /
    • pp.529-540
    • /
    • 2015
  • 의과학 분야에서 교차계획법은 임상시험을 통한 두 처리의 비교 검정에 이용되고 있으며 생물학적 동등성 시험에 자주 이용되고 있다. $2{\times}2$ 교차계획법에서 2시기에 결측치가 발생했을 때 통상적으로 결측치가 발생한 개체를 삭제하고 모수적 검정을 한다. 하지만 소표본으로 진행되는 $2{\times}2$ 교차계획법에서 일부 관측치의 삭제가 통계적 분석에 크게 영향을 미칠 수 있다. 본 논문에서는 소표본으로 이루어지는 $2{\times}2$ 교차계획법에서 2시기에 결측치가 발생했을 때 단순대체법들을 적용한 후 Hills-Armitage (1979)의 모수적 검정법과 Koch (1972)와 Kim (1999)이 제안한 비모수적 검정법들의 제 1종오류와 검정력을 몬테카를로 모의실험(Monte-Carlo simulation)을 통하여 비교하였다.