• 제목/요약/키워드: normality assumption

검색결과 85건 처리시간 0.018초

NORMALIZED SAMPLE LORENZ CURVE를 이용한 검정력이 높은 정규성 검정 (More Powerful Test for Normality Based on the Normalized Sample Lorenz Curve)

  • 강석복;조영석
    • 응용통계연구
    • /
    • 제15권2호
    • /
    • pp.415-421
    • /
    • 2002
  • 통계적분석에서 가장 대표적인 가정이 정규성 가정이므로 데이터의 정규성 검정은 매우 중요하다. 이 논문에서는 정규성 검정을 위해 경제학에서 소득분배의 불균형에 관한 척도로 널리 이용되는 Lorenz curve를 변형한 새로운 플롯과 검정통계량을 제시한다. 그리고 제한한 검정을 W검정 (Shapiro and Wilk (1965)), Lorenz curve를 이용한 TL검정(Kang and Cho (1999))과 몬테칼로 방법을 이용하여 검정력을 비교한다. 제안된 검정이 특별한 대립분포의 경우를 제외하고는 대부분 검정력이 높았다.

A comparison of tests for homoscedasticity using simulation and empirical data

  • Anastasios Katsileros;Nikolaos Antonetsis;Paschalis Mouzaidis;Eleni Tani;Penelope J. Bebeli;Alex Karagrigoriou
    • Communications for Statistical Applications and Methods
    • /
    • 제31권1호
    • /
    • pp.1-35
    • /
    • 2024
  • The assumption of homoscedasticity is one of the most crucial assumptions for many parametric tests used in the biological sciences. The aim of this paper is to compare the empirical probability of type I error and the power of ten parametric and two non-parametric tests for homoscedasticity with simulations under different types of distributions, number of groups, number of samples per group, variance ratio and significance levels, as well as through empirical data from an agricultural experiment. According to the findings of the simulation study, when there is no violation of the assumption of normality and the groups have equal variances and equal number of samples, the Bhandary-Dai, Cochran's C, Hartley's Fmax, Levene (trimmed mean) and Bartlett tests are considered robust. The Levene (absolute and square deviations) tests show a high probability of type I error in a small number of samples, which increases as the number of groups rises. When data groups display a nonnormal distribution, researchers should utilize the Levene (trimmed mean), O'Brien and Brown-Forsythe tests. On the other hand, if the assumption of normality is not violated but diagnostic plots indicate unequal variances between groups, researchers are advised to use the Bartlett, Z-variance, Bhandary-Dai and Levene (trimmed mean) tests. Assessing the tests being considered, the test that stands out as the most well-rounded choice is the Levene's test (trimmed mean), which provides satisfactory type I error control and relatively high power. According to the findings of the study and for the scenarios considered, the two non-parametric tests are not recommended. In conclusion, it is suggested to initially check for normality and consider the number of samples per group before choosing the most appropriate test for homoscedasticity.

Box-Cox변환을 이용한 다변량 공정능력 분석 (Analysis of Multivariate Process Capability Using Box-Cox Transformation)

  • 문혜진;정영배
    • 산업경영시스템학회지
    • /
    • 제42권2호
    • /
    • pp.18-27
    • /
    • 2019
  • The process control methods based on the statistical analysis apply the analysis method or mathematical model under the assumption that the process characteristic is normally distributed. However, the distribution of data collected by the automatic measurement system in real time is often not followed by normal distribution. As the statistical analysis tools, the process capability index (PCI) has been used a lot as a measure of process capability analysis in the production site. However, PCI has been usually used without checking the normality test for the process data. Even though the normality assumption is violated, if the analysis method under the assumption of the normal distribution is performed, this will be an incorrect result and take a wrong action. When the normality assumption is violated, we can transform the non-normal data into the normal data by using an appropriate normal transformation method. There are various methods of the normal transformation. In this paper, we consider the Box-Cox transformation among them. Hence, the purpose of the study is to expand the analysis method for the multivariate process capability index using Box-Cox transformation. This study proposes the multivariate process capability index to be able to use according to both methodologies whether data is normally distributed or not. Through the computational examples, we compare and discuss the multivariate process capability index between before and after Box-Cox transformation when the process data is not normally distributed.

Effects on Regression Estimates under Misspecified Generalized Linear Mixed Models for Counts Data

  • Jeong, Kwang Mo
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.1037-1047
    • /
    • 2012
  • The generalized linear mixed model(GLMM) is widely used in fitting categorical responses of clustered data. In the numerical approximation of likelihood function the normality is assumed for the random effects distribution; subsequently, the commercial statistical packages also routinely fit GLMM under this normality assumption. We may also encounter departures from the distributional assumption on the response variable. It would be interesting to investigate the impact on the estimates of parameters under misspecification of distributions; however, there has been limited researche on these topics. We study the sensitivity or robustness of the maximum likelihood estimators(MLEs) of GLMM for counts data when the true underlying distribution is normal, gamma, exponential, and a mixture of two normal distributions. We also consider the effects on the MLEs when we fit Poisson-normal GLMM whereas the outcomes are generated from the negative binomial distribution with overdispersion. Through a small scale Monte Carlo study we check the empirical coverage probabilities of parameters and biases of MLEs of GLMM.

A study on Robust Estimation of ARCH models

  • 김삼용;황선영
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2002년도 추계 학술발표회 논문집
    • /
    • pp.3-9
    • /
    • 2002
  • In financial time series, the autoregressive conditional heteroscedastic (ARCH) models have been widely used for modeling conditional variances. In many cases, non-normality or heavy-tailed distributions of the data have influenced the estimation methods under normality assumption. To solve this problem, a robust function for the conditional variances of the errors is proposed and compared the relative efficiencies of the estimators with other conventional models.

  • PDF

A note on Box-Cox transformation and application in microarray data

  • Rahman, Mezbahur;Lee, Nam-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권5호
    • /
    • pp.967-976
    • /
    • 2011
  • The Box-Cox transformation is a well known family of power transformations that brings a set of data into agreement with the normality assumption of the residuals and hence the response variable of a postulated model in regression analysis. Normalization (studentization) of the regressors is a common practice in analyzing microarray data. Here, we implement Box-Cox transformation in normalizing regressors in microarray data. Pridictabilty of the model can be improved using data transformation compared to studentization.

PERIODOGRAM ANALYSIS WITH MISSING OBSERVATIONS

  • Ghazal M.A.;Elhassanein A.
    • Journal of applied mathematics & informatics
    • /
    • 제22권1_2호
    • /
    • pp.209-222
    • /
    • 2006
  • Estimation of the spectral measure, covariance and spectral density functions of a strictly stationary r-vector valued time series is considered, under the assumption that some of the observations are missed. The modified periodograms are calculated using data window. The asymptotic normality is studied.

On Computing a Cholesky Decomposition

  • Park, Jong-Tae
    • Communications for Statistical Applications and Methods
    • /
    • 제3권2호
    • /
    • pp.37-42
    • /
    • 1996
  • Maximum likelihood estimation of Cholesky decomposition is considered under normality assumption. It is shown that maximum liklihood estimation gives a Cholesky decomposition of the sample covariance matrix. The joint distribution of the maximum likelihood estimators is derived. The ussual algorithm for a Cholesky decomposition is shown to be equivalent to a maximumlikelihood estimation of a Cholesky root when the underlying distribution is a multivariate normal one.

  • PDF

분할 역회귀모형에서 차원결정을 위한 점근검정법 (Asymptotic Test for Dimensionality in Sliced Inverse Regression)

  • 박종선;곽재근
    • 응용통계연구
    • /
    • 제18권2호
    • /
    • pp.381-393
    • /
    • 2005
  • 회귀모형에서 필요한 설명변수들의 선형결합들을 탐색하기 위한 방법 중의 하나로 분할역회귀모형을 들 수 있다. 이러한 분할역회귀모형에서 모형에 필요한 설명변수들의 선형결합의 수, 즉 차원을 결정하기 위한 여러 가지의 검정법들이 소개 되었으나 설명변수들의 정규성 가정을 필요로 하거나 다른 제약이 있다. 본 논문에서는 주성분분석에 대한 확률모형을 이 용하여 정규성가정을 필요로하지 않으며 분할의 수에 로버스트한 검정법을 소개하고 모의실험과 실제자료에 대한 적용결과를 통하여 기존의 검정법과 비교하였다.

예측치 결합을 위한 PNN 접근방법 (A PNN approach for combining multiple forecasts)

  • 전덕빈;신효덕;이정진
    • 대한산업공학회지
    • /
    • 제26권3호
    • /
    • pp.193-199
    • /
    • 2000
  • In many studies, considerable attention has been focussed upon choosing a model which represents underlying process of time series and forecasting the future. In the real world, however, there may be some cases that one model can not reflect all the characteristics of original time series. Under such circumstances, we may get better performance by combining the forecasts from several models. The most popular methods for combining forecasts involve taking a weighted average of multiple forecasts. But the weights are usually unstable. In cases the assumptions of normality and unbiasedness for forecast errors are satisfied, a Bayesian method can be used for updating the weights. In the real world, however, there are many circumstances the Bayesian method is not appropriate. This paper proposes a PNN(Probabilistic Neural Net) approach as a method for combining forecasts that can be applied when the assumption of normality or unbiasedness for forecast errors is not satisfied. In this paper, PNN method, which is similar to Bayesian approach, is suggested as an updating method of the unstable weights in the combination of the forecasts. The PNN method has been usually used in the field of pattern recognition. Unlike the Bayesian approach, it requires no assumption of a specific prior distribution because it gets probabilities by using the distribution estimated from given data. Empirical results reveal that the PNN method offers superior predictive capabilities.

  • PDF