• Title/Summary/Keyword: Models, statistical

Search Result 3,026, Processing Time 0.027 seconds

Cumulative Sums of Residuals in GLMM and Its Implementation

  • Choi, DoYeon;Jeong, KwangMo
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.5
    • /
    • pp.423-433
    • /
    • 2014
  • Test statistics using cumulative sums of residuals have been widely used in various regression models including generalized linear models(GLM). Recently, Pan and Lin (2005) extended this testing procedure to the generalized linear mixed models(GLMM) having random effects, in which we encounter difficulties in computing the marginal likelihood that is expressed as an integral of random effects distribution. The Gaussian quadrature algorithm is commonly used to approximate the marginal likelihood. Many commercial statistical packages provide an option to apply this type of goodness-of-fit test in GLMs but available programs are very rare for GLMMs. We suggest a computational algorithm to implement the testing procedure in GLMMs by a freely accessible R package, and also illustrate through practical examples.

Dual Generalized Maximum Entropy Estimation for Panel Data Regression Models

  • Lee, Jaejun;Cheon, Sooyoung
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.5
    • /
    • pp.395-409
    • /
    • 2014
  • Data limited, partial, or incomplete are known as an ill-posed problem. If the data with ill-posed problems are analyzed by traditional statistical methods, the results obviously are not reliable and lead to erroneous interpretations. To overcome these problems, we propose a dual generalized maximum entropy (dual GME) estimator for panel data regression models based on an unconstrained dual Lagrange multiplier method. Monte Carlo simulations for panel data regression models with exogeneity, endogeneity, or/and collinearity show that the dual GME estimator outperforms several other estimators such as using least squares and instruments even in small samples. We believe that our dual GME procedure developed for the panel data regression framework will be useful to analyze ill-posed and endogenous data sets.

A comparative Study of ARIMA and Neural Network Model;Case study in Korea Corporate Bond Yields

  • Kim, Steven H.;Noh, Hyunju
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1996.10a
    • /
    • pp.19-22
    • /
    • 1996
  • A traditional approach to the prediction of economic and financial variables takes the form of statistical models to summarize past observations and to project them into the envisioned future. Over the past decade, an increasing number of organizations has turned to the use of neural networks. To date, however, many spheres of interest still lack a systematic evaluation of the statistical and neural approaches. One of these lies in the prediction of corporate bond yields for Korea. This paper reports on a comparative evaluation of ARIMA models and neural networks in the context of interest rate prediction. An additional experiment relates to an integration of the two methods. More specifically, the statistical model serves as a filter by providing estimtes which are then used as input into the neural network models.

  • PDF

On statistical Computing via EM Algorithm in Logistic Linear Models Involving Non-ignorable Missing data

  • Jun, Yu-Na;Qian, Guoqi;Park, Jeong-Soo
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2005.11a
    • /
    • pp.181-186
    • /
    • 2005
  • Many data sets obtained from surveys or medical trials often include missing observations. When these data sets are analyzed, it is general to use only complete cases. However, it is possible to have big biases or involve inefficiency. In this paper, we consider a method for estimating parameters in logistic linear models involving non-ignorable missing data mechanism. A binomial response and normal exploratory model for the missing data are used. We fit the model using the EM algorithm. The E-step is derived by Metropolis-hastings algorithm to generate a sample for missing data and Monte-carlo technique, and the M-step is by Newton-Raphson to maximize likelihood function. Asymptotic variances of the MLE's are derived and the standard error and estimates of parameters are compared.

  • PDF

A Study on the Forecasing Modeles of Traffic Accident by Region (지역별 교통사고 예측모형에 관한 연구)

  • 박병호
    • Journal of the Korean Regional Science Association
    • /
    • v.11 no.1
    • /
    • pp.21-30
    • /
    • 1995
  • This paper deals with the forecasting models for traffic accident by region. Its objectives are to develop the appropriate model for projecting the accident and to analyze the regional characteristics of the accident model. The main results are as follow. First, the literature review, statistical tests and sensitivity analyses show that the joint model combined both PTM and Exponential functions is appropriate to project the traffic accidents by region. Second, the statistical analyses by region. Second, the statistical analyses on the regional accident models indicate that the levels of significance in terms of t-value, $R^2$ and F-value are very high. Finally, the comparative analyses among regions show that the regional differences on the accident patterns can be explained by the joint models and the accident indices (parameters, $P_{max}$, 1/b, $\eta$ etc.) of each region.

  • PDF

Truncation Parameter Selection in Binary Choice Models (이항 선택 모형에서의 절단 모수 선택)

  • Kim, Kwang-Rae;Cho, Kyu-Dong;Koo, Ja-Yong
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.6
    • /
    • pp.811-827
    • /
    • 2010
  • This paper deals with a density estimation method in binary choice models that can be regarded as a statistical inverse problem. We use an orthogonal basis to estimate density function and consider the choice of an appropriate truncation parameter to reflect the model complexity and the prediction accuracy. We propose a data-dependent rule to choose the truncation parameter in the context of binary choice models. A numerical simulation is provided to illustrate the performance of the proposed method.

Applied linear and nonlinear statistical models for evaluating strength of Geopolymer concrete

  • Prem, Prabhat Ranjan;Thirumalaiselvi, A.;Verma, Mohit
    • Computers and Concrete
    • /
    • v.24 no.1
    • /
    • pp.7-17
    • /
    • 2019
  • The complex phenomenon of the bond formation in geopolymer is not well understood and therefore, difficult to model. This paper present applied statistical models for evaluating the compressive strength of geopolymer. The applied statistical models studied are divided into three different categories - linear regression [least absolute shrinkage and selection operator (LASSO) and elastic net], tree regression [decision and bagging tree] and kernel methods (support vector regression (SVR), kernel ridge regression (KRR), Gaussian process regression (GPR), relevance vector machine (RVM)]. The performance of the methods is compared in terms of error indices, computational effort, convergence and residuals. Based on the present study, kernel based methods (GPR and KRR) are recommended for evaluating compressive strength of Geopolymer concrete.

Korean Welfare Panel Data: A Computational Bayesian Method for Ordered Probit Random Effects Models

  • Lee, Hyejin;Kyung, Minjung
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.1
    • /
    • pp.45-60
    • /
    • 2014
  • We introduce a MCMC sampling for a generalized linear normal random effects model with the ordered probit link function based on latent variables from suitable truncated normal distribution. Such models have proven useful in practice and we have observed numerically reasonable results in the estimation of fixed effects when the random effect term is provided. Applications that utilize Korean Welfare Panel Study data can be difficult to model; subsequently, we find that an ordered probit model with the random effects leads to an improved analyses with more accurate and precise inferences.

Independence Condition in the Repeated Randomized Response Models (반복시행된 확률화 응답(RRD) 모형의 독립조건)

  • Lee Kwan J.;Kook Sejeong
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2000.11a
    • /
    • pp.33-38
    • /
    • 2000
  • Krishnamoorphy and Raghavarao(1993) invented exact binomial and asymptotically normal test procedures for truthful answering in the repeated randomized response models under the assumption that two repeated response measures are independent. Under the same assumption, Lakshmi and Raghavarao(1992) suggested asymptotic chi-square test for respondents' truthful answering in the same models. In this article we detect the factors and the conditions with which two response variables might be independent, and find the condition for independence in the repeated randomized response models with considering untruthful answer. But, the condition of independence make the randomized model no meaning. Under the assumption of conditional independence between two response variables, we can apply the same logical statements on deriving the tests for truthful answering in the repeated randomized response models as in Krishnamoorphy and Raghavarao(1993).

  • PDF