• Title/Summary/Keyword: Outlier analysis

Search Result 238, Processing Time 0.039 seconds

Prediction from Linear Regression Equation for Nitrogen Content Measurement in Bentgrasses leaves Using Near Infrared Reflectance Spectroscopy (근적외선 분광분석기를 이용한 잔디 생체잎의 질소 함량 측정을 위한 검량식 개발)

  • Cha, Jung-Hoon;Kim, Kyung-Duck;Park, Dae-Sup
    • Asian Journal of Turfgrass Science
    • /
    • v.23 no.1
    • /
    • pp.77-90
    • /
    • 2009
  • Near Infrared Reflectance Spectroscopy(NIRS) is a quick, accurate, and non-destructive method to measure multiple nutrient components in plant leaves. This study was to acquire a liner regression equation by evaluating the nutrient contents of 'CY2' creeping bentgrass rapidly and accurately using NIRS. In particular, nitrogen fertility is a primary element to keep maintaining good quality of turfgrass. Nitrogen, moisture, carbohydrate, and starch were assessed and analyzed from 'CY2' creeping bentgrass clippings. A linear regression equation was obtained from accessing NIRS values from NIR spectrophotometer(NIR system, Model XDS, XM-1100 series, FOSS, Sweden) programmed with WinISI III project manager v1.50e and ISIscan(R) (Infrasoft International) and calibrated with laboratory values via chemical analysis from an authorized institute. The equation was formulated as MPLS(modified partial least squares) analyzing laboratory values and mathematically pre-treated spectra. The accuracy of the acquired equation was confirmed with SEP(standard error of prediction), which indicated as correlation coefficient($r^2$) and prediction error of sample unacquainted, followed by the verification of model equation of real values and these monitoring results. As results of monitoring, $r^2$ of nitrogen, moisture, and carbohydrate in 'CY2' creeping bentgrass was 0.840, 0.904, and 0.944, respectively. SEP was 0.066, 1.868, and 0.601, respectively. After outlier treatment, $r^2$ was 0.892, 0.925, and 0.971, while SEP was 0.052, 1.577, and 0.394, respectively, which totally showed a high correlation. However, $r^2$ of starch was 0.464, which appeared a low correlation. Thereof, the verified equation appearing higher $r^2$ of nitrogen, moisture, and carbohydrate showed its higher accuracy of prediction model, which finally could be put into practical use for turf management system.

The Application State of the Sunnybrook Facial Grading System for Facial Palsy Patients : A retrospective study (안면마비 환자에 대한 Sunnybrook Facial Grading System의 적용 실태 분석 : 후향적 관찰연구)

  • Han, Ji Sun;Kwon, Min Soo;Kim, Jung Hwan;Jo, Dae Hyun;Jo, Hee Jin;Choi, Ji Eun;Kim, Ji Hye;Kim, Hyun Ho;Lee, Sang Hoon;Park, Young Jae;Park, Young Bae
    • Journal of Acupuncture Research
    • /
    • v.33 no.4
    • /
    • pp.101-108
    • /
    • 2016
  • Objectives : Among the assessment tools for evaluating facial function, the House-Brackmann scale is used as a standard tool, but it has some shortcomings. The Sunnybrook Facial Grading System can assess the after effects of facial palsy and facial movement by each part of the face. By understanding the application state of this Sunnybrook Facial Grading System, we intend to analyze the relationship between House-Brackmann scale score and Sunnybrook Facial Grading System score so that we can examine the advantages of the Sunnybrook Facial Grading System as a more accurate tool. Methods : We screened both inpatients and outpatients who visited the Facial Palsy Center at Kyung Hee University Hospital for Korean medical treatment and were evaluated with the Sunnybrook Facial Grading System from December 2015 to October 2016. A total of 159 out of 166 patients were studied, including basic characteristics and missing data. We used descriptive statistics for general features of patients and SPSS Ver.18 for statistical analysis. Results : House-Brackmann scale and Sunnybrook Facial Grading System have high negative correlation through Pearson Correlation Coefficient with a score of -0.884. Analyzing outlier data resulting from relation analysis between the House-Brackmann scale and the Sunnybrook Facial Grading System showed many outliers when the damaged state of each part of the face is different. Conclusion : Sunnybrook Facial Grading System can make up for faults of the House-Brackmann scale, which is inferior in accuracy when each damage status of each part of the face is different. Sunnybrook Facial Grading System performs a detailed assessment of facial function and sequelae of facial palsy easier than the House-Brackmann scale.

Comparison of Expression Profiling of Gastric Cancer by O1igonucleotide and cDNA Microarrays (O1igonucleotide Microarray와 cDNA Microarray를 이용한 위암조직의 대단위 유전자 발현 비교)

  • Jung, Kwang-Hwa;Kim, Jung-Kyu;Noh, Ji-Heon;Eun, Jung-Woo;Bae, Hyun-Jin;Lee, Sug-Hyung;Park, Won-Sang;Yoo, Nam-Jin;Lee, Jung-Young;Nam, Suk-Woo
    • YAKHAK HOEJI
    • /
    • v.51 no.3
    • /
    • pp.179-185
    • /
    • 2007
  • Gastric cancer is one of the most common malignancies in Korea, but the predominant molecular event underlying gastric carcinogenesis remain unknown. Recently, DNA microarray technology has enabled the comprehensive analysis of gene expression level, and as such has yielded great insight into the molecular nature of cancer, However, despite the powerful approach of this techniques, the technical artifacts and/or bias in applied array platform limited the liability of resultant tens of thousand data points from microarray experiments. Therefore, we applied two different any platforms, such as olignucleotide microarray and cDNA microarray, to identify gastric cancer related large-scale molecular signature of the same human specimens. When thirty sets of matched human gastric cancer and normal tissues subjected to oligonucleotide microarray, total 623 genes were resulted as differently expressed genes in gastric cancer compared to normal tissues, and 252 genes for cDNA microarray analysis. In addition, forty three outlier genes which reflect the characteristic expression signature of gastric cancer beyond array platform and analytical protocol was recapitulated from two different expression profile. In conclusion, we were able to identify robust large-scale molecular changes in gastric cancer by applying two different platform of DNA microarray, this may facilitate to understand molecular carcinogenesis of gastric cancer.

Pupil Data Measurement and Social Emotion Inference Technology by using Smart Glasses (스마트 글래스를 활용한 동공 데이터 수집과 사회 감성 추정 기술)

  • Lee, Dong Won;Mun, Sungchul;Park, Sangin;Kim, Hwan-jin;Whang, Mincheol
    • Journal of Broadcast Engineering
    • /
    • v.25 no.6
    • /
    • pp.973-979
    • /
    • 2020
  • This study aims to objectively and quantitatively determine the social emotion of empathy by collecting pupillary response. 52 subjects (26 men and 26 women) voluntarily participated in the experiment. After the measurement of the reference of 30 seconds, the experiment was divided into the task of imitation and spontaneously self-expression. The two subjects were interacted through facial expressions, and the pupil images were recorded. The pupil data was processed through binarization and circular edge detection algorithm, and outlier detection and removal technique was used to reject eye-blinking. The pupil size according to the empathy was confirmed for statistical significance with test of normality and independent sample t-test. Statistical analysis results, the pupil size was significantly different between empathy (M ± SD = 0.050 ± 1.817)) and non-empathy (M ± SD = 1.659 ± 1.514) condition (t(92) = -4.629, p = 0.000). The rule of empathy according to the pupil size was defined through discriminant analysis, and the rule was verified (Estimation accuracy: 75%) new 12 subjects (6 men and 6 women, mean age ± SD = 22.84 ± 1.57 years). The method proposed in this study is non-contact camera technology and is expected to be utilized in various virtual reality with smart glasses.

Comparative Analysis of Anomaly Detection Models using AE and Suggestion of Criteria for Determining Outliers

  • Kang, Gun-Ha;Sohn, Jung-Mo;Sim, Gun-Wu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.8
    • /
    • pp.23-30
    • /
    • 2021
  • In this study, we present a comparative analysis of major autoencoder(AE)-based anomaly detection methods for quality determination in the manufacturing process and a new anomaly discrimination criterion. Due to the characteristics of manufacturing site, anomalous instances are few and their types greatly vary. These properties degrade the performance of an AI-based anomaly detection model using the dataset for both normal and anomalous cases, and incur a lot of time and costs in obtaining additional data for performance improvement. To solve this problem, the studies on AE-based models such as AE and VAE are underway, which perform anomaly detection using only normal data. In this work, based on Convolutional AE, VAE, and Dilated VAE models, statistics on residual images, MSE, and information entropy were selected as outlier discriminant criteria to compare and analyze the performance of each model. In particular, the range value applied to the Convolutional AE model showed the best performance with AUC PRC 0.9570, F1 Score 0.8812 and AUC ROC 0.9548, accuracy 87.60%. This shows a performance improvement of an accuracy about 20%P(Percentage Point) compared to MSE, which was frequently used as a standard for determining outliers, and confirmed that model performance can be improved according to the criteria for determining outliers.

A Review of Statistical Methods in the Korean Journal of Orthodontics and the American Journal of Orthodontics and Dentofacial Orthopedics (대한치과교정학회지(KJO)와 미국교정학회지(AJODO)에서 사용된 통계기법의 비교분석 및 고찰(1999-2003))

  • Lim, Hoi-Jeong
    • The korean journal of orthodontics
    • /
    • v.34 no.5 s.106
    • /
    • pp.371-379
    • /
    • 2004
  • The purpose of this study was to investigate the changes and types of statistical methods used in the Korean Journal of Orthodontics (KJO) and the American Journal of Orthodontics and Dentofacial Orthopedics (AJODO) from )999 to 2003. The frequency of use, transitions, assumption check of statistical methods and types of advanced statistical methods were examined from each journal. The study consisted of 247 articles published in the KJO and randomly chosen 50 articles per year which were original articles and used statistical methods T-test, analysis of variance(ANOVA), correlation analysis, nonparametric analysis. regression analysis chi-square test. factor analysis, were the order of statistical methods most frequently used in the KJO, while t-test. ANOVA, nonparametric analysis, correlation analysis, regression analysis, chi-square test. factor analysis. were the order of statistical methods used in the AJODO The changes of statistical methods observed in the KJO were not significant $(X^2=17.4\;p=0.5881)$ but the changes observed in the AJODO was seen to be significant $(x^2=42.4,\;p=0.0397)$ Some of the studies examined had overlooked the assumptions of the statistical methods employed. Data investigation such as outlier should be performed before analysis and alternative statistical approaches are applied for a small sample size. Types of advanced statistical methods were factor analysis and discriminant analysis in the KJO and Intention-To-Treat (ITT) analysis in clinical trials through multi-center, survival analysis and Generalized Estimating Equations (GEE) in the AJODO. Appropriate analysis approaches and interpretations should be applied for the correlated and repeated measurements of the orthodontic data set.

Precision Improvement Methodology of Geotechnical Information through Outlier Analysis (이상치 분석을 통한 3차원 지반정보 정밀도 향상 방안)

  • Lee, Boyoung;Hwang, Bumsik;Kim, Hansaem;Cho, Wanjei
    • Journal of the Korean GEO-environmental Society
    • /
    • v.19 no.2
    • /
    • pp.23-35
    • /
    • 2018
  • Recently, ground disasters such as road collapses and cavities have been frequently occurred in Seoul and downtown areas. As a result, studies on the integrated underground space map is underway as a government's solution. On the other hand, the geotechnical information underlying the integrated underground space map has been being built with more than 220 thousands borehole DB informations through the Integrated DB Center of National Geotechnical Information. To build a three-dimensional integrated underground space map based on the geotechnical information, the reliability of the geotechnical information should be verified by analyzing and evaluating the precision of the geotechnical information. Thereby, studies were conducted on the precision verification and evaluation of the constructed geotechnical information. Thereafter, it has been reviewed how to utilize geotechnical information in addition to analyzing the precision of the geotechnical information in order to visualize three dimensions in geotechnical information. As a further step to the practical DB application, a module is suggested in this study to improve the precision of geotechnical information for establishing reliable three dimensional integrated underground space maps based on the previous research results.

Analysis of Riding Quality Acceptability and Characteristics of Expressway Users and Evaluation of MRI Thresholds using Receiver Operating Characteristic curves (고속도로 이용자의 승차감 평가특성 및 만족도 분석과 ROC 곡선을 이용한 평탄성 관리기준 적정성 검토)

  • Lee, Jaehoon;Sohn, Ducksu;Ryu, SungWoo;Kim, Youngwon;Park, Junyoung
    • International Journal of Highway Engineering
    • /
    • v.20 no.2
    • /
    • pp.35-44
    • /
    • 2018
  • PURPOSES : The purpose of this research is to analyze the characteristics of panels that affect the evaluating results of riding quality and to evaluate the appropriateness of roughness management criteria based on ride comfort satisfaction. METHODS : In order to analyze the influence of panel characteristics of riding quality, 33 panels, consisting of civilians and experts, were selected. Also, considering the roughness distribution of the expressway, 35 sections with MRI ranging from 1.17 m/km to 4.65 m/km were selected. Each panel boarded a passenger car and evaluated the riding quality with grades from 0 to 10, and assessed whether it was satisfied or not. After removing outlier results using a box plot technique, 964 results were analyzed. An ANOVA was conducted to evaluate the effects of panel expertise, age, driving experience, vehicle ownership, and gender on the evaluation results. In addition, by using the receiver operating characteristics (ROC) curve, the MRI value, which can most accurately evaluate the satisfaction with riding quality, was derived. Then, the compatibility of MRI was evaluated using AUC as a criterion to assess whether the riding quality was satisfactory. RESULTS : Only the age of the panel participants were found to have an effect on the riding quality satisfaction. It was found that satisfaction with riding quality and MRI are strongly correlated. The satisfaction rate of roughness management criteria on new (MRI 1.6 m/km) and maintenance (MRI 3.0 m/km) expressways were 95% and 53%, respectively. As a result of evaluating the roughness management criteria by using the ROC curve, it was found that the accuracy of satisfaction was the highest at MRI 3.1-3.2 m/km. In addition, the AUC of the MRI was about 0.8, indicating that the MRI was an appropriate index for evaluating the riding quality satisfaction. CONCLUSIONS : Based on the results, the distribution of the panels' age should be considered when panel rating is conducted. From the results of the ROC curve, MRI of 3.0 m/km, which is a criterion of roughness management on maintenance expressways, is considered as appropriate.

Performance Enhancement of Algorithms based on Error Distributions under Impulsive Noise (충격성 잡음하에서 오차 분포에 기반한 알고리듬의 성능향상)

  • Kim, Namyong;Lee, Gyoo-yeong
    • Journal of Internet Computing and Services
    • /
    • v.19 no.3
    • /
    • pp.49-56
    • /
    • 2018
  • Euclidean distance (ED) between error distribution and Dirac delta function has been used as an efficient performance criterion in impulsive noise environmentsdue to the outlier-cutting effect of Gaussian kernel for error signal. The gradient of ED for its minimization has two components; $A_k$ for kernel function of error pairs and the other $B_k$ for kernel function of errors. In this paper, it is analyzed that the first component is to govern gathering close together error samples, and the other one $B_k$ is to conduct error-sample concentration on zero. Based upon this analysis, it is proposed to normalize $A_k$ and $B_k$ with power of inputs which are modified by kernelled error pairs or errors for the purpose of reinforcing their roles of narrowing error-gap and drawing error samples to zero. Through comparison of fluctuation of steady state MSE and value of minimum MSE in the results of simulation of multipath equalization under impulsive noise, their roles and efficiency of the proposed normalization method are verified.

Genetic signature of strong recent positive selection at interleukin-32 gene in goat

  • Asif, Akhtar Rasool;Qadri, Sumayyah;Ijaz, Nabeel;Javed, Ruheena;Ansari, Abdur Rahman;Awais, Muhammd;Younus, Muhammad;Riaz, Hasan;Du, Xiaoyong
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.30 no.7
    • /
    • pp.912-919
    • /
    • 2017
  • Objective: Identification of the candidate genes that play key roles in phenotypic variations can provide new information about evolution and positive selection. Interleukin (IL)-32 is involved in many biological processes, however, its role for the immune response against various diseases in mammals is poorly understood. Therefore, the current investigation was performed for the better understanding of the molecular evolution and the positive selection of single nucleotide polymorphisms in IL-32 gene. Methods: By using fixation index ($F_{ST}$) based method, IL-32 (9375) gene was found to be outlier and under significant positive selection with the provisional combined allocation of mean heterozygosity and $F_{ST}$. Using nucleotide sequences of 11 mammalian species from National Center for Biotechnology Information database, the evolutionary selection of IL-32 gene was determined using Maximum likelihood model method, through four models (M1a, M2a, M7, and M8) in Codeml program of phylogenetic analysis by maximum liklihood. Results: IL-32 is detected under positive selection using the $F_{ST}$ simulations method. The phylogenetic tree revealed that goat IL-32 was in close resemblance with sheep IL-32. The coding nucleotide sequences were compared among 11 species and it was found that the goat IL-32 gene shared identity with sheep (96.54%), bison (91.97%), camel (58.39%), cat (56.59%), buffalo (56.50%), human (56.13%), dog (50.97%), horse (54.04%), and rabbit (53.41%) respectively. Conclusion: This study provides evidence for IL-32 gene as under significant positive selection in goat.