• Title/Summary/Keyword: Variable Statistics

Search Result 1,333, Processing Time 0.02 seconds

Estimation on Modified Proportional Hazards Model

  • Lee, Kwang-Ho;Lee, Mi-Sook
    • Journal of the Korean Data and Information Science Society
    • /
    • v.5 no.1
    • /
    • pp.59-66
    • /
    • 1994
  • Heller and Simonoff(1990) compared several methods of estimating the regression coefficient in a modified proportional hazards model, when the response variable is subject to censoring. We give another method of estimating the parameters in the model which also allows the dependent variable to be censored and the error distribution to be unspecified. The proposed method differs from that of Miller(1976) and that of Buckely and James(1979). We also obtain the variance estimator of the coefficient estimator and compare that with the Buckely-James Variance estimator studied by Hillis(1993).

  • PDF

A VSR $\bar{X}$ Chart with Multi-state VSS and 2-state VSI Scheme

  • Lee, Jae-Heon;Park, Chang-Soon
    • Journal of Korean Society for Quality Management
    • /
    • v.32 no.4
    • /
    • pp.252-264
    • /
    • 2004
  • Variable sampling Interval (VSI) control charts vary the sampling interval according to value of the control statistic while the sample size is fixed. It is known that control charts with 2-state VSI scheme, which uses only two sampling intervals, give good statistical properties. Variable sample size (VSS) control charts vary the sample size according to value of the control statistic while the sampling interval is fixed. In the VSS scheme no optimal results are known for the number of sample sizes. It is also known that the variable sampling rate (VSR) $\bar{X}$ control chart with 2-state VSS and 2-state VSI scheme leads to large improvements In performance over the fixed sampling rate (FSR) $\bar{X}$ chart, but the optimal number of states for sample size Is not known. In this paper, the VSR Χ charts with multi-state VSS and 2-state VSI scheme are designed and compared to 2-state VSS and 2-state VSI scheme. The multi-state VSS scheme is considered to, achieve an additional improvement by switching from the 2-state VSS scheme. On the other hand, the multi-state VSI scheme is not considered because the 2-state scheme is known to be optimal. The 3-state VSS scheme improves substantially the sensitivity of the $\bar{X}$ chart especially for small and moderate mean shifts.

Bayesian test of homogenity in small areas: A discretization approach

  • Kim, Min Sup;Nandram, Balgobin;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.6
    • /
    • pp.1547-1555
    • /
    • 2017
  • This paper studies Bayesian test of homogeneity in contingency tables made by discretizing a continuous variable. Sometimes when we are considering events of interest in small area setup, we can think of discretization approaches about the continuous variable. If we properly discretize the continuous variable, we can find invisible relationships between areas (groups) and a continuous variable of interest. The proper discretization of the continuous variable can support the alternative hypothesis of the homogeneity test in contingency tables even if the null hypothesis was not rejected through k-sample tests involving one-way ANOVA. In other words, the proportions of variables with a particular level can vary from group to group by the discretization. If we discretize the the continuous variable, it can be treated as an analysis of the contingency table. In this case, the chi-squared test is the most commonly employed method. However, further discretization gives rise to more cells in the table. As a result, the count in the cells becomes smaller and the accuracy of the test becomes lower. To prevent this, we can consider the Bayesian approach and apply it to the setup of the homogeneity test.

Ordinal Variable Selection in Decision Trees (의사결정나무에서 순서형 분리변수 선택에 관한 연구)

  • Kim Hyun-Joong
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.149-161
    • /
    • 2006
  • The most important component in decision tree algorithm is the rule for split variable selection. Many earlier algorithms such as CART and C4.5 use greedy search algorithm for variable selection. Recently, many methods were developed to cope with the weakness of greedy search algorithm. Most algorithms have different selection criteria depending on the type of variables: continuous or nominal. However, ordinal type variables are usually treated as continuous ones. This approach did not cause any trouble for the methods using greedy search algorithm. However, it may cause problems for the newer algorithms because they use statistical methods valid for continuous or nominal types only. In this paper, we propose a ordinal variable selection method that uses Cramer-von Mises testing procedure. We performed comparisons among CART, C4.5, QUEST, CRUISE, and the new method. It was shown that the new method has a good variable selection power for ordinal type variables.

Estimation using informative sampling technique when response rate follows exponential function of variable of interest (응답률이 관심변수의 지수함수를 따를 경우 정보적 표본설계 기법을 이용한 모수추정)

  • Chung, Hee Young;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.993-1004
    • /
    • 2017
  • A stratified sampling method is generally used with a sample selected using the same sample weight in each stratum in order to improve the accuracy of the sampling survey estimation. However, the weight should be adjusted to reflect the response rate if the response rate is affected by the value of the variable of interest. It may be also more effective to adjust the weights by subdividing the stratum rather than using the same weight if the variable of interest has a linear relationship with the continuous auxiliary variables. In this study, we propose a method to increase the accuracy of estimation using an informative sampling design technique when the response rate is an exponential function of the variable of interest and the variable of interest has a linear relationship with the auxiliary variable. Simulation results show the superiority of the proposed method.

An Agglomerative Hierarchical Variable-Clustering Method Based on a Correlation Matrix

  • Lee, Kwangjin
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.387-397
    • /
    • 2003
  • Generally, most of researches that need a variable-clustering process use an exploratory factor analysis technique or a divisive hierarchical variable-clustering method based on a correlation matrix. And some researchers apply a object-clustering method to a distance matrix transformed from a correlation matrix, though this approach is known to be improper. On this paper an agglomerative hierarchical variable-clustering method based on a correlation matrix itself is suggested. It is derived from a geometric concept by using variate-spaces and a characterizing variate.

A Study on the Bias Reduction in Split Variable Selection in CART

  • Song, Hyo-Im;Song, Eun-Tae;Song, Moon Sup
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.3
    • /
    • pp.553-562
    • /
    • 2004
  • In this short communication we discuss the bias problems of CART in split variable selection and suggest a method to reduce the variable selection bias. Penalties proportional to the number of categories or distinct values are applied to the splitting criteria of CART. The results of empirical comparisons show that the proposed modification of CART reduces the bias in variable selection.

Graphical Method for Multiple Regression Model (다중회귀모형의 그래픽적 방법)

  • Lee, W.R.;Lee, U.K.;Hong, C.S.
    • The Korean Journal of Applied Statistics
    • /
    • v.20 no.1
    • /
    • pp.195-204
    • /
    • 2007
  • In order to represent multiple regression data, an alternative graphical method, called as SSR Plot, is proposed by using geometrical description methods. This plot uses the relation that the sum of sqaures for regression (SSR) of two explanatory variables is known as the sum of the SSR of one variable and the increase in the SSR due to the addition of other variable to the model that already contains a variable. This half circle shaped SSR plot contains vectors corresponding explanatory variables. We might conclude that some explanatory variables corresponding to vectors which locate near the horisontal axis do affect the response variable. Also, for the regression model with two explanatory variables, a magnitude of the angle between two vectors can be identified for suppression.