• Title/Summary/Keyword: Statistical distributions

Search Result 1,007, Processing Time 0.028 seconds

A spatial heterogeneity mixed model with skew-elliptical distributions

  • Farzammehr, Mohadeseh Alsadat;McLachlan, Geoffrey J.
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.3
    • /
    • pp.373-391
    • /
    • 2022
  • The distribution of observations in most econometric studies with spatial heterogeneity is skewed. Usually, a single transformation of the data is used to approximate normality and to model the transformed data with a normal assumption. This assumption is however not always appropriate due to the fact that panel data often exhibit non-normal characteristics. In this work, the normality assumption is relaxed in spatial mixed models, allowing for spatial heterogeneity. An inference procedure based on Bayesian mixed modeling is carried out with a multivariate skew-elliptical distribution, which includes the skew-t, skew-normal, student-t, and normal distributions as special cases. The methodology is illustrated through a simulation study and according to the empirical literature, we fit our models to non-life insurance consumption observed between 1998 and 2002 across a spatial panel of 103 Italian provinces in order to determine its determinants. Analyzing the posterior distribution of some parameters and comparing various model comparison criteria indicate the proposed model to be superior to conventional ones.

The Effective Training Method for the Statistical Classification of Remotely Sensed Imagery (위성영상의 통계적 분류를 위한 유효 트레이닝 기법에 관한 연구)

  • 이병길;김용일;어양담
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.17 no.3
    • /
    • pp.225-231
    • /
    • 1999
  • In statistical analysis of remotely sensed data, means and variances of each classes are used as the basis of statistical similarity determination. Therefore, the overall accuracy of classification is affected by the training results. It is assumed that the ideal distributions of pixel values follow normal distributions, but practically they have some aggregations and biases. non anomalies of distribution can affect the classification results greatly as well as the variances of training results. In this study, relationships between the inferential variances of the training sets and the distributions of pixel values are examined. and the resulting changes of classification results are studied. Furthermore, the training method which minimizes the effect of underestimation of variances is proposed.

  • PDF

Significant Genotype Difference in the CYP2E1 PstI Polymorphism of Indigenous Groups in Sabah, Malaysia with Asian and Non-Asian Populations

  • Goh, Lucky Poh Wah;Chong, Eric Tzyy Jiann;Chua, Kek Heng;Chuah, Jitt Aun;Lee, Ping-Chin
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.17
    • /
    • pp.7377-7381
    • /
    • 2014
  • CYP2E1 PstI polymorphism G-1259C (rs3813867) genotype distributions vary significantly among different populations and are associated with both diseases, like cancer, and adverse drug effects. To date, there have been limited genotype distributions and allele frequencies of this polymorphism reported in the three major indigenous ethnic groups (KadazanDusun, Bajau, and Rungus) in Sabah, also known as North Borneo. The aim of this study was to investigate the genotype distributions and allele frequencies of the CYP2E1 PstI polymorphism G-1259C in these three major indigenous peoples in Sabah. A total of 640 healthy individuals from the three dominant indigenous groups were recruited for this study. Polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) at G-1259C polymorphic site of CYP2E1 gene was performed using the Pst I restriction enzyme. Fragments were analyzed using agarose gel electrophoresis and confirmed by direct sequencing. Overall, the allele frequencies were 90.3% for c1 allele and 9.7% for c2 allele. The genotype frequencies for c1/c1, c1/c2 and c2/c2 were observed as 80.9%, 18.8%, and 0.3%, respectively. A highly statistical significant difference (p<0.001) was observed in the genotype distributions between indigenous groups in Sabah with all Asian and non-Asian populations. However, among these three indigenous groups, there was no statistical significant difference (p>0.001) in their genotype distributions. The three major indigenous ethnic groups in Sabah show unique genotype distributions when compared with other populations. This finding indicates the importance of establishing the genotype distributions of CYP2E1 PstI polymorphism in the indigenous populations.

A Projected Exponential Family for Modeling Semicircular Data

  • Kim, Hyoung-Moon
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.6
    • /
    • pp.1125-1145
    • /
    • 2010
  • For modeling(skewed) semicircular data, we derive a new exponential family of distributions. We extend it to the l-axial exponential family of distributions by a projection for modeling any arc of arbitrary length. It is straightforward to generate samples from the l-axial exponential family of distributions. Asymptotic result reveals that the linear exponential family of distributions can be used to approximate the l-axial exponential family of distributions. Some trigonometric moments are also derived in closed forms. The maximum likelihood estimation is adopted to estimate model parameters. Some hypotheses tests and confidence intervals are also developed. The Kolmogorov-Smirnov test is adopted for a goodness of t test of the l-axial exponential family of distributions. Samples of orientations are used to demonstrate the proposed model.

BOOTSTRAP TESTS FOR THE EQUALITY OF DISTRIBUTIONS

  • Ping, Jing
    • Journal of applied mathematics & informatics
    • /
    • v.7 no.2
    • /
    • pp.467-482
    • /
    • 2000
  • Testing equality of two and k distributions has long been an interesting issue in statistical inference. To overcome the sparseness of data points in high-dimensional space and deal with the general cases, we suggest several projection pursuit type statistics. Some results on the limiting distributions of the statistics are obtained, some properties of Bootstrap approximation are investigated. Furthermore, for computational reasons an approximation for the statistics the based on Number theoretic method is applied. Several simulation experiments are performed.

Selection of Appropriate Probability Distribution Types for Ten Days Evaporation Data (순별증발량 자료의 적정 확률분포형 선정)

  • 김선주;박재흥;강상진
    • Proceedings of the Korean Society of Agricultural Engineers Conference
    • /
    • 1998.10a
    • /
    • pp.338-343
    • /
    • 1998
  • This study is to select appropriate probability distributions for ten days evaporation data for the purpose of representing statistical characteristics of real evaporation data in Korea. Nine probability distribution functions were assumed to be underlying distributions for ten days evaporation data of 20 stations with the duration of 20 years. The parameter of each probability distribution function were estimated by the maximum likelihood approach, and appropriate probability distributions were selected from the goodness of fit test. Log Pearson type III model was selected as an appropriate probability distribution for ten days evaporation data in Korea.

  • PDF

Adaptive L-estimation for regression slope under asymmetric error distributions (비대칭 오차모형하에서의 회귀기울기에 대한 적합된 L-추정법)

  • 한상문
    • The Korean Journal of Applied Statistics
    • /
    • v.6 no.1
    • /
    • pp.79-93
    • /
    • 1993
  • We consider adaptive L-estimation of estimating slope parameter in regression model. The proposed estimator is simple extension of trimmed least squares estimator proposed by ruppert and carroll. The efficiency of the proposed estimator is especially well compared with usual least squares estimator, least absolute value estimator, and M-estimators designed for asymmetric distributions under asymmetric error distributions.

  • PDF

Variable Selection with Log-Density in Logistic Regression Model (로지스틱회귀모형에서 로그-밀도비를 이용한 변수의 선택)

  • Kahng, Myung-Wook;Shin, Eun-Young
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.1
    • /
    • pp.1-11
    • /
    • 2012
  • We present methods to study the log-density ratio of the conditional densities of the predictors given the response variable in the logistic regression model. This allows us to select which predictors are needed and how they should be included in the model. If the conditional distributions are skewed, the distributions can be considered as gamma distributions. A simulation study shows that the linear and log terms are required in general. If the conditional distributions of xjy for the two groups overlap significantly, we need both the linear and log terms; however, only the linear or log term is needed in the model if they are well separated.

Power Investigation of the Entropy-Based Test of Fit for Inverse Gaussian Distribution by the Information Discrimination Index

  • Choi, Byungjin
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.837-847
    • /
    • 2012
  • Inverse Gaussian distribution is widely used in applications to analyze and model right-skewed data. To assess the appropriateness of the distribution prior to data analysis, Mudholkar and Tian (2002) proposed an entropy-based test of fit. The test is based on the entropy power fraction(EPF) index suggested by Gokhale (1983). The simulation results report that the power of the entropy-based test is superior compared to other goodness-of-fit tests; however, this observation is based on the small-scale simulation results on the standard exponential, Weibull W(1; 2) and lognormal LN(0:5; 1) distributions. A large-scale simulation should be performed against various alternative distributions to evaluate the power of the entropy-based test; however, the use of a theoretical method is more effective to investigate the powers. In this paper, utilizing the information discrimination(ID) index defined by Ehsan et al. (1995) as a mathematical tool, we scrutinize the power of the entropy-based test. The selected alternative distributions are the gamma, Weibull and lognormal distributions, which are widely used in data analysis as an alternative to inverse Gaussian distribution. The study results are provided and an illustrative example is analyzed.

Classical and Bayesian methods of estimation for power Lindley distribution with application to waiting time data

  • Sharma, Vikas Kumar;Singh, Sanjay Kumar;Singh, Umesh
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.3
    • /
    • pp.193-209
    • /
    • 2017
  • The power Lindley distribution with some of its properties is considered in this article. Maximum likelihood, least squares, maximum product spacings, and Bayes estimators are proposed to estimate all the unknown parameters of the power Lindley distribution. Lindley's approximation and Markov chain Monte Carlo techniques are utilized for Bayesian calculations since posterior distribution cannot be reduced to standard distribution. The performances of the proposed estimators are compared based on simulated samples. The waiting times of research articles to be accepted in statistical journals are fitted to the power Lindley distribution with other competing distributions. Chi-square statistic, Kolmogorov-Smirnov statistic, Akaike information criterion and Bayesian information criterion are used to access goodness-of-fit. It was found that the power Lindley distribution gives a better fit for the data than other distributions.