• Title/Summary/Keyword: statistical methods

Search Result 11,646, Processing Time 0.036 seconds

Understanding and Misuse Type of Quality Improvement Tools According to the Kind of Data and the Number of Population in DMAIC Process of Six Sigma (식스시그마 DMAIC 프로세스에서 모집단의 수와 데이터 종류에 따른 품질개선 기법의 오적용 유형 및 이해)

  • Choi, Sung-Woon
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2010.04a
    • /
    • pp.509-517
    • /
    • 2010
  • The paper proposes the misuse types of statistical quality tools according to the kind of data and the number of population in DMAIC process of six sigma. The result presented in this paper can be extended to the QC story 15 steps of QC circle. The study also provides the improvement methods about control chart, measurement system analysis, statistical difference, and practical equivalence.

  • PDF

Feature Extraction and Statistical Pattern Recognition for Image Data using Wavelet Decomposition

  • Kim, Min-Soo;Baek, Jang-Sun
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.831-842
    • /
    • 1999
  • We propose a wavelet decomposition feature extraction method for the hand-written character recognition. Comparing the recognition rates of which methods with original image features and with selected features by the wavelet decomposition we study the characteristics of the proposed method. LDA(Linear Discriminant Analysis) QDA(Quadratic Discriminant Analysis) RDA(Regularized Discriminant Analysis) and NN(Neural network) are used for the calculation of recognition rates. 6000 hand-written numerals from CENPARMI at Concordia University are used for the experiment. We found that the set of significantly selected wavelet decomposed features generates higher recognition rate than the original image features.

  • PDF

Modeling Extreme Values of Ground-Level Ozone Based on Threshold Methods for Markov Chains

  • Seokhoon Yun
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.2
    • /
    • pp.249-273
    • /
    • 1996
  • This paper reviews and develops several statistical models for extreme values, based on threshold methodology. Extreme values of a time series are modeled in terms of tails which are defined as truncated forms of original variables, and Markov property is imposed on the tails. Tails of the generalized extreme value distribution and a multivariate extreme value distributively, of the tails of the series. These models are then applied to real ozone data series collected in the Chicago area. A major concern is given to detecting any possible trend in the extreme values.

  • PDF

Overview of frequent pattern mining

  • Jurg Ott;Taesung Park
    • Genomics & Informatics
    • /
    • v.20 no.4
    • /
    • pp.39.1-39.9
    • /
    • 2022
  • Various methods of frequent pattern mining have been applied to genetic problems, specifically, to the combined association of two genotypes (a genotype pattern, or diplotype) at different DNA variants with disease. These methods have the ability to come up with a selection of genotype patterns that are more common in affected than unaffected individuals, and the assessment of statistical significance for these selected patterns poses some unique problems, which are briefly outlined here.

Application of Statistical Models for Default Probability of Loans in Mortgage Companies

  • Jung, Jin-Whan
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.2
    • /
    • pp.605-616
    • /
    • 2000
  • Three primary interests frequently raised by mortgage companies are introduced and the corresponding statistical approaches for the default probability in mortgage companies are examined. Statistical models considered in this paper are time series, logistic regression, decision tree, neural network, and discrete time models. Usage of the models is illustrated using an artificially modified data set and the corresponding models are evaluated in appropriate manners.

  • PDF

Bootstrap Bandwidth Selection Methods for Local Linear Jump Detector

  • Park, Dong-Ryeon
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.4
    • /
    • pp.579-590
    • /
    • 2012
  • Local linear jump detection in a discontinuous regression function involves the choice of the bandwidth and the performance of a local linear jump detector depends heavily on the choice of the bandwidth. However, little attention has been paid to this important issue. In this paper we propose two fully data adaptive bandwidth selection methods for a local linear jump detector. The performance of the proposed methods are investigated through a simulation study.

Comparison of Bootstrap Methods for LAD Estimator in AR(1) Model

  • Kang, Kee-Hoon;Shin, Key-Il
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.3
    • /
    • pp.745-754
    • /
    • 2006
  • It has been shown that LAD estimates are more efficient than LS estimates when the error distribution is double exponential in AR(1) model. In order to explore the performance of LAD estimates one can use bootstrap approaches. In this paper we consider the efficiencies of bootstrap methods when we apply LAD estimates with highly variable data. Monte Carlo simulation results are given for comparing generalized bootstrap, stationary bootstrap and threshold bootstrap methods.

Improving Bagging Predictors

  • Kim, Hyun-Joong;Chung, Dong-Jun
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2005.11a
    • /
    • pp.141-146
    • /
    • 2005
  • Ensemble method has been known as one of the most powerful classification tools that can improve prediction accuracy. Ensemble method also has been understood as ‘perturb and combine’ strategy. Many studies have tried to develop ensemble methods by improving perturbation. In this paper, we propose two new ensemble methods that improve combining, based on the idea of pattern matching. In the experiment with simulation data and with real dataset, the proposed ensemble methods peformed better than bagging. The proposed ensemble methods give the most accurate prediction when the pruned tree was used as the base learner.

  • PDF

Graphical Methods for Correlation and Independence

  • Hong, Chong-Sun;Yoon, Jang-Sub
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.2
    • /
    • pp.219-231
    • /
    • 2006
  • When the correlation of two random variables is weak, the value of one variable can not be used effectively to predict the other. Even when most of the values are overlapped, it is difficult to find a linear relationship. In this paper, we propose two graphical methods of representing the measures of correlation and independence between two random variables. The first method is used to represent their degree of correlation, and the other is used to represent their independence. Both of these methods are based on the cumulative distribution functions defined in this work.

Classification via principal differential analysis

  • Jang, Eunseong;Lim, Yaeji
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.2
    • /
    • pp.135-150
    • /
    • 2021
  • We propose principal differential analysis based classification methods. Computations of squared multiple correlation function (RSQ) and principal differential analysis (PDA) scores are reviewed; in addition, we combine principal differential analysis results with the logistic regression for binary classification. In the numerical study, we compare the principal differential analysis based classification methods with functional principal component analysis based classification. Various scenarios are considered in a simulation study, and principal differential analysis based classification methods classify the functional data well. Gene expression data is considered for real data analysis. We observe that the PDA score based method also performs well.