• Title/Summary/Keyword: Ordinal Variables

Search Result 43, Processing Time 0.031 seconds

Control Charts for Ordinal Variables (순서형 변수를 위한 관리도)

  • Jang, Dae-Heung
    • Proceedings of the Korean Society for Quality Management Conference
    • /
    • 2006.04a
    • /
    • pp.330-333
    • /
    • 2006
  • Many practical problems of quality control in service management are derived from the use of ordinal variables. Ordered linguistic variables differ from measurement variables. This paper presents a new control chart of a production process based on ordinal variables.

  • PDF

Ordinal Variable Selection in Decision Trees (의사결정나무에서 순서형 분리변수 선택에 관한 연구)

  • Kim Hyun-Joong
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.149-161
    • /
    • 2006
  • The most important component in decision tree algorithm is the rule for split variable selection. Many earlier algorithms such as CART and C4.5 use greedy search algorithm for variable selection. Recently, many methods were developed to cope with the weakness of greedy search algorithm. Most algorithms have different selection criteria depending on the type of variables: continuous or nominal. However, ordinal type variables are usually treated as continuous ones. This approach did not cause any trouble for the methods using greedy search algorithm. However, it may cause problems for the newer algorithms because they use statistical methods valid for continuous or nominal types only. In this paper, we propose a ordinal variable selection method that uses Cramer-von Mises testing procedure. We performed comparisons among CART, C4.5, QUEST, CRUISE, and the new method. It was shown that the new method has a good variable selection power for ordinal type variables.

Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links

  • Jeong, Kwang-Mo;Lee, Hyun-Yung
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.4
    • /
    • pp.697-705
    • /
    • 2009
  • The Pearson chi-squared statistic or the deviance statistic is widely used in assessing the goodness-of-fit of the generalized linear models. But these statistics are not proper in the situation of continuous explanatory variables which results in the sparseness of cell frequencies. We propose a goodness-of-fit test statistic for the cumulative logit models with ordinal responses. We consider the grouping of a dataset based on the ordinal scores obtained by fitting the assumed model. We propose the Pearson chi-squared type test statistic, which is obtained from the cross-classified table formed by the subgroups of ordinal scores and the response categories. Because the limiting distribution of the chi-squared type statistic is intractable we suggest the parametric bootstrap testing procedure to approximate the distribution of the proposed test statistic.

Notes on the Goodness-of-Fit Tests for the Ordinal Response Model

  • Jeong, Kwang-Mo;Lee, Hyun-Yung
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.6
    • /
    • pp.1057-1065
    • /
    • 2010
  • In this paper we discuss some cautionary notes in using the Pearson chi-squared test statistic for the goodness-of-fit of the ordinal response model. If a model includes continuous type explanatory variables, the resulting table from the t of a model is not a regular one in the sense that the cell boundaries are not fixed but randomly determined by some other criteria. The chi-squared statistic from this kind of table does not have a limiting chi-square distribution in general and we need to be very cautious of the use of a chi-squared type goodness-of-t test. We also study the limiting distribution of the chi-squared type statistic for testing the goodness-of-t of cumulative logit models with ordinal responses. The regularity conditions necessary to the limiting distribution will be reformulated in the framework of the cumulative logit model by modifying those of Moore and Spruill (1975). Due to the complex limiting distribution, a parametric bootstrap testing procedure is a good alternative and we explained the suggested method through a practical example of an ordinal response dataset.

Optimal Process Condition for Products with Multi-Categorical Ordinal Quality Characteristic (다범주 순서형 품질특성을 갖는 제품의 최적 공정조건 결정에 관한 연구)

  • Kim Sang-Cheol;Yun Won-Young;Chun Young-Rok
    • Journal of Korean Society for Quality Management
    • /
    • v.32 no.3
    • /
    • pp.109-125
    • /
    • 2004
  • This paper deals with an optimal process control problem in production of hull structural steel plate with high defective rate. The main quality characteristic(dependent variable) is the internal quality(defect) of plates and is dependent on process parameters(independent variables). The dependent variable(quality characteristics) has three categorical ordinal data and there are 35 independent variables(29 continuous variables and 6 categorical variables). In this paper, we determine the main factors and to develop the mathematical model between internal quality predicted probabilities and the main factors. Secondly, we find out the optimal process condition of main factors through analysis of variance(ANOVA) using simulation. We consider three models to obtain the main factors and the optimal process condition: linear, quadratic, error models.

Small Sample Characteristics of Generalized Estimating Equations for Categorical Repeated Measurements (범주형 반복측정자료를 위한 일반화 추정방정식의 소표본 특성)

  • 김동욱;김재직
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.2
    • /
    • pp.297-310
    • /
    • 2002
  • Liang and Zeger proposed generalized estimating equations(GEE) for analyzing repeated data which is discrete or continuous. GEE model can be extended to model for repeated categorical data and its estimator has asymptotic multivariate normal distribution in large sample sizes. But GEE is based on large sample asymptotic theory. In this paper, we study the properties of GEE estimators for repeated ordinal data in small sample sizes. We generate ordinal repeated measurements for two groups using two methods. Through Monte Carlo simulation studies we investigate the empirical type 1 error rates, powers, relative efficiencies of the GEE estimators, the effect of unequal sample size of two groups, and the performance of variance estimators for polytomous ordinal response variables, especially in small sample sizes.

Applications of proportional odds ordinal logistic regression models and continuation ratio models in examining the association of physical inactivity with erectile dysfunction among type 2 diabetic patients

  • Mathew, Anil C.;Siby, Elbin;Tom, Amal;Kumar R, Senthil
    • Korean Journal of Exercise Nutrition
    • /
    • v.25 no.1
    • /
    • pp.30-34
    • /
    • 2021
  • [Purpose] Many studies have observed a high prevalence of erectile dysfunction among individuals performing physical activity in less leisure-time. However, this relationship in patients with type 2 diabetic patients is not well studied. In exposure outcome studies with ordinal outcome variables, investigators often try to make the outcome variable dichotomous and lose information by collapsing categories. Several statistical models have been developed to make full use of all information in ordinal response data, but they have not been widely used in public health research. In this paper, we discuss the application of two statistical models to determine the association of physical inactivity with erectile dysfunction among patients with type 2 diabetes. [Methods] A total of 204 married men aged 20-60 years with a diagnosis of type 2 diabetes at the outpatient unit of the Department of Endocrinology at PSG hospitals during the months of May and June 2019 were studied. We examined the association between physical inactivity and erectile dysfunction using proportional odds ordinal logistic regression models and continuation ratio models. [Results] The proportional odds model revealed that patients with diabetes who perform leisure time physical activity for over 40 minutes per day have reduced odds of erectile dysfunction (odds ratio=0.38) across the severity categories of erectile dysfunction after adjusting for age and duration of diabetes. [Conclusion] The present study suggests that physical inactivity has a negative impact on erectile function. We observed that the simple logistic regression model had only 75% efficiency compared to the proportional odds model used here; hence, more valid estimates were obtained here.

Distribution of Public Service and Individual Job Performance in Peruvian Municipality

  • Ramirez-ASIS, Edwin;Huerta-SOTO, Rosario;Nivin-VARGAS, Laura;Huaranga-TOLEDO, Hober;Valera-AREDO, Julio;Flores-LEIVA, Victor
    • Journal of Distribution Science
    • /
    • v.20 no.10
    • /
    • pp.11-17
    • /
    • 2022
  • Purpose: This research aims to find the link between public service Distribution and individual job performance in the provincial municipality. Research design, data, and methodology: This is a quantitative approach study with a non-experimental and correlational design. The sample consisted of 140 employees appointed and hired by the provincial municipality of Huaraz. For data collection, Two questionnaires with an ordinal Likert-type scale and the Rho Spearman correlation coefficient were used to assess the link between the research variables., For Analysis: two questionnaires with an ordinal Likert-type scale and the Rho Spearman correlation coefficient were used to determine the connection between the research variables. Results: It was determined that both variables have a high degree of correlation (0.725), indicating a direct and significant relationship between the Distribution of public service and skill performance in the provincial municipality (0.614). Conclusion: Finally, this allows us to conclude that the institutional context is essential; that is, there is a significant correlation between the PSM and contextual performance in the provincial municipality of Huaraz, which has a Rho Spearman value of 0.723.

Bayesian ordinal probit semiparametric regression models: KNHANES 2016 data analysis of the relationship between smoking behavior and coffee intake (베이지안 순서형 프로빗 준모수 회귀 모형 : 국민건강영양조사 2016 자료를 통한 흡연양태와 커피섭취 간의 관계 분석)

  • Lee, Dasom;Lee, Eunji;Jo, Seogil;Choi, Taeryeon
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.25-46
    • /
    • 2020
  • This paper presents ordinal probit semiparametric regression models using Bayesian Spectral Analysis Regression (BSAR) method. Ordinal probit regression is a way of modeling ordinal responses - usually more than two categories - by connecting the probability of falling into each category explained by a combination of available covariates using a probit (an inverse function of normal cumulative distribution function) link. The Bayesian probit model facilitates posterior sampling by bringing a latent variable following normal distribution, therefore, the responses are categorized by the cut-off points according to values of latent variables. In this paper, we extend the latent variable approach to a semiparametric model for the Bayesian ordinal probit regression with nonparametric functions using a spectral representation of Gaussian processes based BSAR method. The latent variable is decomposed into a parametric component and a nonparametric component with or without a shape constraint for modeling ordinal responses and predicting outcomes more flexibly. We illustrate the proposed methods with simulation studies in comparison with existing methods and real data analysis applied to a Korean National Health and Nutrition Examination Survey (KNHANES) 2016 for investigating nonparametric relationship between smoking behavior and coffee intake.

What Exacerbates the Probability of Business Closure in the Private Sector During the COVID-19 Pandemic? Evidence from World Bank Enterprise Survey Data

  • PHAM, Thi Bich Duyen;NGUYEN, Hoang Phong
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.6
    • /
    • pp.69-79
    • /
    • 2022
  • The purpose of the study is to look into the likelihood of private sector enterprises going bankrupt due to COVID-19 pandemic-related issues. The data for this study was taken from the World Bank's Enterprise Survey, which was intended to assess the impact of the COVID-19 pandemic on the business sector. This study uses the Ordinal Logit Method to analyze the model with dependent variables having ordinal values. The determinants reflect business performance, innovation, business relationships, and government support. According to the estimation results, a lower probability of business closures, illiquidity, and payment delays are found in businesses that maintain sales growth, operating hours, temporary workers, product portfolio, consumer demand, and input supply. Meanwhile, the increase in online business activities and receiving support from financial institutions and the government do not help businesses reduce the risk. Moreover, higher survival is found in manufacturing and developing countries. This implies the fragility of businesses in the retail and service sectors, especially for mega-enterprises in developed countries. In addition, the negative impact of the COVID-19 pandemic on businesses in Europe and West Asia is less severe than in other regions. The results imply policies to support the private sector during the pandemic, such as increasing labor market flexibility or rapidly implementing supportive policies.