• Title/Summary/Keyword: Mixture Normal Distribution

Search Result 84, Processing Time 0.026 seconds

Statistical Tests for Process Capability Index Cp Based on Mixture Normal Process (혼합 정규공정 하에서의 공정능력지수 Cp에 대한 가설검정)

  • Cho, Joong Jae;Heo, Tae-Young;Jeong, Jun Chel
    • Journal of Korean Society for Quality Management
    • /
    • v.42 no.2
    • /
    • pp.209-219
    • /
    • 2014
  • Purpose: The purpose of this study is to develop the statistical test for process capability index $C_p$ based on mixture normal process. Methods: This study uses Bootstrap method to calculate the approximate P-value for various simulation conditions under mixture normal process. Results: This study indicates that our proposed method is effective way to test for process capability index $C_p$ based on mixture normal process. Conclusion: This study finds out that statistical test for process capability index $C_p$ based on mixture normal process is useful for real application.

Estimating Suitable Probability Distribution Function for Multimodal Traffic Distribution Function

  • Yoo, Sang-Lok;Jeong, Jae-Yong;Yim, Jeong-Bin
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.21 no.3
    • /
    • pp.253-258
    • /
    • 2015
  • The purpose of this study is to find suitable probability distribution function of complex distribution data like multimodal. Normal distribution is broadly used to assume probability distribution function. However, complex distribution data like multimodal are very hard to be estimated by using normal distribution function only, and there might be errors when other distribution functions including normal distribution function are used. In this study, we experimented to find fit probability distribution function in multimodal area, by using AIS(Automatic Identification System) observation data gathered in Mokpo port for a year of 2013. By using chi-squared statistic, gaussian mixture model(GMM) is the fittest model rather than other distribution functions, such as extreme value, generalized extreme value, logistic, and normal distribution. GMM was found to the fit model regard to multimodal data of maritime traffic flow distribution. Probability density function for collision probability and traffic flow distribution will be calculated much precisely in the future.

Modeling on asymmetric circular data using wrapped skew-normal mixture (겹친왜정규혼합분포를 이용한 비대칭 원형자료의 모형화)

  • Na, Jong-Hwa;Jang, Young-Mi
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.2
    • /
    • pp.241-250
    • /
    • 2010
  • Over the past few decades, several studies have been made on the modeling of circular data. But these studies focused mainly on the symmetrical cases including von Mises distribution. Recently, many studies with skew-normal distribution have been conducted in the linear case. In this paper, we dealt the problem of fitting of non-symmetrical circular data with wrapped skew-normal distribution which can be derived by using the principle of wrapping. Wrapped skew-normal distribution is very flexible to asymmetical data as well as to symmetrical data. Multi-modal data are also fitted by using the mixture of wrapped skew-normal distributions. To estimate the parameters of mixture, we suggested the EM algorithm. Finally we verified the accuracy of the suggested algorithm through simulation studies. Application with real data is also considered.

ESTIMATION IN A MIXTURE NORMAL DISTRIBUTION

  • Jee-Seon Baik
    • Journal of applied mathematics & informatics
    • /
    • v.4 no.1
    • /
    • pp.223-234
    • /
    • 1997
  • By Stochastic simulations we discuss the fitness of a mix-ture normal distribution to observations from general mixture distribu-tions using the MLE method and the EM algorithm. We calulate the probability of misclassifying objects and estimate the optimal number of mixture components with mutual information measure.

Application of Finite Mixture to Characterise Degraded Gmelina arborea Roxb Plantation in Omo Forest Reserve, Nigeria

  • Ogana, Friday Nwabueze
    • Journal of Forest and Environmental Science
    • /
    • v.34 no.6
    • /
    • pp.451-456
    • /
    • 2018
  • The use of single component distribution to describe the irregular stand structure of degraded forest often lead to bias. Such biasness can be overcome by the application of finite mixture distribution. Therefore, in this study, finite mixture distribution was used to characterise the irregular stand structure of the Gmelina arborea plantation in Omo forest reserve. Thirty plots, ten each from the three stands established in 1984, 1990 and 2005 were used. The data were pooled per stand and fitted. Four finite mixture distributions including normal mixture, lognormal mixture, gamma mixture and Weibull mixture were considered. The method of maximum likelihood was used to fit the finite mixture distributions to the data. Model assessment was based on negative loglikelihood value ($-{\Lambda}{\Lambda}$), Akaike information criterion (AIC), Bayesian information criterion (BIC) and root mean square error (RMSE). The results showed that the mixture distributions provide accurate and precise characterisation of the irregular diameter distribution of the degraded Gmelina arborea stands. The $-{\Lambda}{\Lambda}$, AIC, BIC and RMSE values ranged from -715.233 to -348.375, 703.926 to 1433.588, 718.598 to 1451.334 and 3.003 to 7.492, respectively. Their performances were relatively the same. This approach can be used to describe other irregular forest stand structures, especially the multi-species forest.

Estimating Discriminatory Power with Non-normality and a Small Number of Defaults

  • Hong, C.S.;Kim, H.J.;Lee, J.L.
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.5
    • /
    • pp.803-811
    • /
    • 2012
  • For credit evaluation models, we extend the study of discriminatory power based on AUC obtained from a ROC curve when the number of defaults is small and distribution functions of the defaults and non-defaults are normal distributions. Since distribution functions do not satisfy normality in real world, the distribution functions of the defaults and non-defaults are assumed as normal mixture distributions based on results that the normal mixture could be better fitted than other distribution estimation methods for non-normal data. By using several AUC statistics, the discriminatory power under such a circumstance is explored and compared with those of normal distributions.

An approximate fitting for mixture of multivariate skew normal distribution via EM algorithm (EM 알고리즘에 의한 다변량 치우친 정규분포 혼합모형의 근사적 적합)

  • Kim, Seung-Gu
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.3
    • /
    • pp.513-523
    • /
    • 2016
  • Fitting a mixture of multivariate skew normal distribution (MSNMix) with multiple skewness parameter vectors via EM algorithm often requires a highly expensive computational cost to calculate the moments and probabilities of multivariate truncated normal distribution in E-step. Subsequently, it is common to fit an asymmetric data set with MSNMix with a simple skewness parameter vector since it allows us to compute them in E-step in an univariate manner that guarantees a cheap computational cost. However, the adaptation of a simple skewness parameter is unrealistic in many situations. This paper proposes an approximate estimation for the MSNMix with multiple skewness parameter vectors that also allows us to treat them in an univariate manner. We additionally provide some experiments to show its effectiveness.

Variable Selection in Clustering by Recursive Fit of Normal Distribution-based Salient Mixture Model (정규분포기반 두각 혼합모형의 순환적 적합을 이용한 군집분석에서의 변수선택)

  • Kim, Seung-Gu
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.5
    • /
    • pp.821-834
    • /
    • 2013
  • Law et al. (2004) proposed a normal distribution based salient mixture model for variable selection in clustering. However, this model has substantial problems such as the unidentifiability of components an the inaccurate selection of informative variables in the case of a small cluster size. We propose an alternative method to overcome problems and demonstrate a good performance through experiments on simulated data and real data.

Reject Inference of Incomplete Data Using a Normal Mixture Model

  • Song, Ju-Won
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.2
    • /
    • pp.425-433
    • /
    • 2011
  • Reject inference in credit scoring is a statistical approach to adjust for nonrandom sample bias due to rejected applicants. Function estimation approaches are based on the assumption that rejected applicants are not necessary to be included in the estimation, when the missing data mechanism is missing at random. On the other hand, the density estimation approach by using mixture models indicates that reject inference should include rejected applicants in the model. When mixture models are chosen for reject inference, it is often assumed that data follow a normal distribution. If data include missing values, an application of the normal mixture model to fully observed cases may cause another sample bias due to missing values. We extend reject inference by a multivariate normal mixture model to handle incomplete characteristic variables. A simulation study shows that inclusion of incomplete characteristic variables outperforms the function estimation approaches.

ROC Function Estimation (ROC 함수 추정)

  • Hong, Chong-Sun;Lin, Mei Hua;Hong, Sun-Woo
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.6
    • /
    • pp.987-994
    • /
    • 2011
  • From the point view of credit evaluation whose population is divided into the default and non-default state, two methods are considered to estimate conditional distribution functions: one is to estimate under the assumption that the data is followed the mixture normal distribution and the other is to use the kernel density estimation. The parameters of normal mixture are estimated using the EM algorithm. For the kernel density estimation, five kinds of well known kernel functions and four kinds of the bandwidths are explored. In addition, the corresponding ROC functions are obtained based on the estimated distribution functions. The goodness-of-fit of the estimated distribution functions are discussed and the performance of the ROC functions are compared. In this work, it is found that the kernel distribution functions shows better fit, and the ROC function obtained under the assumption of normal mixture shows better performance.