• 제목/요약/키워드: suspicious data

검색결과 105건 처리시간 0.023초

한국의 기온자료 품질관리 알고리즘의 검증 (Validation of Quality Control Algorithms for Temperature Data of the Republic of Korea)

  • 박창용;최영은
    • 대기
    • /
    • 제22권3호
    • /
    • pp.299-307
    • /
    • 2012
  • This study is aimed to validate errors for detected suspicious temperature data using various quality control procedures for 61 weather stations in the Republic of Korea. The quality control algorithms for temperature data consist of four main procedures (high-low extreme check, internal consistency check, temporal outlier check, and spatial outlier check). Errors of detected suspicious temperature data are judged by examining temperature data of nearby stations, surface weather charts, hourly temperature data, daily precipitation, and daily maximum wind direction. The number of detected errors in internal consistency check and spatial outlier check showed 4 days (3 stations) and 7 days (5 stations), respectively. Effective and objective methods for validation errors through this study will help to reduce manpower and time for conduct of quality management for temperature data.

Bayesian Outlier Detection in Regression Model

  • Younshik Chung;Kim, Hyungsoon
    • Journal of the Korean Statistical Society
    • /
    • 제28권3호
    • /
    • pp.311-324
    • /
    • 1999
  • The problem of 'outliers', observations which look suspicious in some way, has long been one of the most concern in the statistical structure to experimenters and data analysts. We propose a model for an outlier problem and also analyze it in linear regression model using a Bayesian approach. Then we use the mean-shift model and SSVS(George and McCulloch, 1993)'s idea which is based on the data augmentation method. The advantage of proposed method is to find a subset of data which is most suspicious in the given model by the posterior probability. The MCMC method(Gibbs sampler) can be used to overcome the complicated Bayesian computation. Finally, a proposed method is applied to a simulated data and a real data.

  • PDF

Clinical Value of Dividing False Positive Urine Cytology Findings into Three Categories: Atypical, Indeterminate, and Suspicious of Malignancy

  • Matsumoto, Kazumasa;Ikeda, Masaomi;Hirayama, Takahiro;Nishi, Morihiro;Fujita, Tetsuo;Hattori, Manabu;Sato, Yuichi;Ohbu, Makoto;Iwam, Masatsugu
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권5호
    • /
    • pp.2251-2255
    • /
    • 2014
  • Background: The aim of this study was to evaluate 10 years of false positive urine cytology records, along with follow-up histologic and cytologic data, to determine the significance of suspicious urine cytology findings. Materials and Methods: We retrospectively reviewed records of urine samples harvested between January 2002 and December 2012 from voided and catheterized urine from the bladder. Among the 21,283 urine samples obtained during this period, we located 1,090 eligible false positive findings for patients being evaluated for the purpose of confirming urothelial carcinoma (UC). These findings were divided into three categories: atypical, indeterminate, and suspicious of malignancy. Results: Of the 1,090 samples classified as false positive, 444 (40.7%) were categorized as atypical, 367 (33.7%) as indeterminate, and 279 (25.6%) as suspicious of malignancy. Patients with concomitant UC accounted for 105 (23.6%) of the atypical samples, 147 (40.1%) of the indeterminate samples, and 139 (49.8%) of the suspicious of malignancy samples (p<0.0001). The rate of subsequent diagnosis of UC during a 1-year follow-up period after harvesting of a sample with false positive urine cytology initially diagnosed as benign was significantly higher in the suspicious of malignancy category than in the other categories (p<0.001). The total numbers of UCs were 150 (33.8%) for atypical samples, 213 (58.0%) for indeterminate samples, and 199 (71.3%) for samples categorized as suspicious of malignancy. Conclusions: Urine cytology remains the most specific adjunctive method for the surveillance of UC. We demonstrated the clinical value of dividing false positive urine cytology findings into three categories, and our results may help clinicians better manage patients with suspicious findings.

A Bayesian Approach to Detecting Outliers Using Variance-Inflation Model

  • Lee, Sangjeen;Chung, Younshik
    • Communications for Statistical Applications and Methods
    • /
    • 제8권3호
    • /
    • pp.805-814
    • /
    • 2001
  • The problem of 'outliers', observations which look suspicious in some way, has long been one of the most concern in the statistical structure to experimenters and data analysts. We propose a model for outliers problem and also analyze it in linear regression model using a Bayesian approach with the variance-inflation model. We will use Geweke's(1996) ideas which is based on the data augmentation method for detecting outliers in linear regression model. The advantage of the proposed method is to find a subset of data which is most suspicious in the given model by the posterior probability The sampling based approach can be used to allow the complicated Bayesian computation. Finally, our proposed methodology is applied to a simulated and a real data.

  • PDF

Malignancy Risk Stratification of Thyroid Nodules with Macrocalcification and Rim Calcification Based on Ultrasound Patterns

  • Hwa Seon Shin;Dong Gyu Na;Wooyul Paik;So Jin Yoon;Hye Yun Gwon;Byeong-Joo Noh;Won Jun Kim
    • Korean Journal of Radiology
    • /
    • 제22권4호
    • /
    • pp.663-671
    • /
    • 2021
  • Objective: To determine the association of macrocalcification and rim calcification with malignancy and to stratify the malignancy risk of thyroid nodules with macrocalcification and rim calcification based on ultrasound (US) patterns. Materials and Methods: The study included a total of 3603 consecutive nodules (≥ 1 cm) with final diagnoses. The associations of macrocalcification and rim calcification with malignancy and malignancy risk of the nodules were assessed overall and in subgroups based on the US patterns of the nodules. The malignancy risk of the thyroid nodules was categorized as high (> 50%), intermediate (upper-intermediate: > 30%, ≤ 50%; lower-intermediate: > 10%, ≤ 30%), and low (≤ 10%). Results: Macrocalcification was independently associated with malignancy in all nodules and solid hypoechoic (SH) nodules (p < 0.001). Rim calcification was not associated with malignancy in all nodules (p = 0.802); however, it was independently associated with malignancy in partially cystic or isoechoic and hyperechoic (PCIH) nodules (p = 0.010). The malignancy risks of nodules with macrocalcification were classified as upper-intermediate and high in SH nodules, and as low and lower-intermediate in PCIH nodules based on suspicious US features. The malignancy risks of nodules with rim calcification were stratified as low and lower-intermediate based on suspicious US features. Conclusion: Macrocalcification increased the malignancy risk in all and SH nodules with or without suspicious US features, with low to high malignancy risks depending on the US patterns. Rim calcification increased the malignancy risk in PCIH nodules, with low and lower-intermediate malignancy risks based on suspicious US features. However, the role of rim calcification in risk stratification of thyroid nodules remains uncertain.

다중 필터를 이용한 실시간 악성코드 탐지 기법 (A Realtime Malware Detection Technique Using Multiple Filter)

  • 박재경
    • 한국컴퓨터정보학회논문지
    • /
    • 제19권7호
    • /
    • pp.77-85
    • /
    • 2014
  • 최근의 클라우드 환경, 빅데이터 환경 등 다양한 환경에서 악성코드나 의심 코드에 의한 피해가 늘어나고 있으며 이를 종합적으로 대응할 수 있는 시스템에 대한 연구가 활발히 이루어지고 있다. 이러한 악성행위가 내포된 의심코드는 사용자의 동의 없이도 PC에 설치되어 사용자가 인지하지 못하는 피해를 양산하고 있다. 또한 다양한 시스템으로부터 수집되는 방대한 양의 데이터를 실시간으로 처리하고 가공하는 기술뿐만 아니라 정교하게 발전하고 있는 악성코드를 탐지 분석하기 위한 대응기술 또한 고도화 되어야 한다. 최근의 악성코드를 원천적으로 탐지하기 위해서는 실행파일에 포함된 악성코드에 대한 정적, 동적 분석을 포함한 분석뿐만 아니라 평판에 의한 검증도 병행되어야 한다. 또한 대량의 데이터를 통해 유사성도 판단하여 실시간으로 대응하는 방안이 절실히 필요하다. 본 논문에서는 이러한 탐지 및 검증 기법을 다중으로 설계하고 이를 실시간으로 처리할 수 있는 방안을 제시하여 의심코드에 대한 대응을 근본적으로 할 수 있도록 연구하였다.

3차원 인체치수 조사 자료의 품질 개선을 위한 연구 (A Study for Quality Improvement of Three-dimensional Body Measurement Data)

  • 박선미;남윤자;박진우
    • 대한인간공학회지
    • /
    • 제28권4호
    • /
    • pp.117-124
    • /
    • 2009
  • To inspect the quality of data collected from a large-scale body measurement and investigation project, it is necessary to establish a proper data editing process. The three-dimensional body measurement may have measuring errors caused from measurer's proficiency or changes in the subject's posture. And it may also have errors caused in the process of algorithm expressing the information obtained from the three-dimensional scanner into numerical values, and in the course of data-processing dealing with numerous data for individuals. When those errors are found, the quality of the measured data is deteriorated, and they consequently reduce the quality of statistics which was conducted on the basis of it. Therefore this study intends to suggest a new way to improve the quality of the data collected from the three-dimensional body measurement by proposing a working procedure identifying data errors and correcting them from the whole data processing procedure-collecting, processing, and analyzing- of the 2004 Size Korea Three-dimensional Body Measurement Project. This study was carried out into three stages: Firstly, we detected erroneous data by examining of logical relations among variables under each edit rule. Secondly, we detected suspicious data through independent examination of individual variable value by sex and age. Finally, we examined scatter-plot matrix of many variables to consider the relationships among them. This simple graphical tool helps us to find out whether some suspicious data exist in the data set or not. As a result of this study, we detected some erroneous data included in the raw data. We figured out that the main errors are not because of the system errors that the three-dimensional body measurement system has but because of the subject's original three-dimensional shape data. Therefore by correcting some erroneous data, we have enhanced data quality.

Simulated Dynamic C&C Server Based Activated Evidence Aggregation of Evasive Server-Side Polymorphic Mobile Malware on Android

  • Lee, Han Seong;Lee, Hyung-Woo
    • International journal of advanced smart convergence
    • /
    • 제6권1호
    • /
    • pp.1-8
    • /
    • 2017
  • Diverse types of malicious code such as evasive Server-side Polymorphic are developed and distributed in third party open markets. The suspicious new type of polymorphic malware has the ability to actively change and morph its internal data dynamically. As a result, it is very hard to detect this type of suspicious transaction as an evidence of Server-side polymorphic mobile malware because its C&C server was shut downed or an IP address of remote controlling C&C server was changed irregularly. Therefore, we implemented Simulated C&C Server to aggregate activated events perfectly from various Server-side polymorphic mobile malware. Using proposed Simulated C&C Server, we can proof completely and classify veiled server-side polymorphic malicious code more clearly.

퍼지-다윈의 불량 신용 탐지 시스템 (Fuzzy Darwinian Detection of Credit Card Fraud)

  • ;김정원;정길호;최종욱
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2000년도 추계학술발표논문집 (상)
    • /
    • pp.277-280
    • /
    • 2000
  • Credit evaluation is one of the most important and difficult tasks fur credit card companies, mortgage companies, banks and other financial institutes. Incorrect credit judgement causes huge financial losses. This work describes the use of an evolutionary-fuzzy system capable of classifying suspicious and non-suspicious credit card transactions. The paper starts with the details of the system used in this work. A series of experiments are described, showing that the complete system is capable of attaining good accuracy and intelligibility levels for real data.

  • PDF

치매노인, 치매의심노인 및 일반노인의 우울에 영향을 미치는 요인 (A Study on the Factors Influencing Depression among Elderly People with, and without, Dementia)

  • 이금재;이신영
    • 기본간호학회지
    • /
    • 제11권2호
    • /
    • pp.166-176
    • /
    • 2004
  • Purpose: The purpose of this study was to identify the factors that affect depression among elderly people with, and without, dementia. Method: The participants were 903 people who were 65 or older and resided in Sungnam City. Data were collected from April to July 2002 using a questionnaire. The collected data were analyzed using descriptive statistics and hierarchical multiple regression aided by SPSS/PC. Result: The variables at the final step of the regression equation accounted for 28.2% of variance in the dementia group, 21.4% in the group with suspicious dementia, and 18.9% in the normal group. The multiple regression analysis revealed that ADL and instrumental support were related significantly to depression in the dementia group. Self-rated health, IADL, social activity support, and instrumental support were significantly related to depression in the group with suspicious dementia. In the normal group, education, self-rated health, and living arrangement with family were significantly related to depression. Conclusion: Social support and health condition are important to decrease depression in elderly people with dementia.

  • PDF