• Title/Summary/Keyword: Survey regression model

Search Result 1,298, Processing Time 0.032 seconds

Estimation of Hard-to-Measure Measurements in Anthropometric Surveys

  • Choi, Jong-Hoo;Kim, Ryu-Jin
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.1
    • /
    • pp.213-220
    • /
    • 2002
  • Anthropometric survey is important as a basis for human engineering fields. According to our experiences, there are difficulties in obtaining the measurements of some body parts because respondents are reluctant to expose. In order to overcome these difficulties, we propose a method for estimating such hard-to-measure measurements by using easy-to-measure measurements those are closely related to them. Multiple Regression Model, Feedforward Neural Network(FNN) Model and Projection Pursuit Regression(PPR) Model will be used as analytical tools for this purpose. The method we propose will be illustrated with real data from the 1992 Korea national anthropometric survey.

Influencing factors of using Korean Medicine services - focusing on the 2017 Korean Medicine Utilization Survey (한방의료이용 선택 요인에 관한 연구 - 2017 한방의료이용실태조사를 중심으로)

  • Lim, Jinwoong;Lee, Kee-Jae
    • The Journal of Korean Medicine
    • /
    • v.42 no.1
    • /
    • pp.12-25
    • /
    • 2021
  • Objectives: The aim of this study was to investigate influencing factors of using Korean medicine services (KMS) using the 2017 Korean Medicine Utilization Survey (KMUS). Methods: Demographic statistics of the survey were summarized and influencing factors of the KMS experience and the intention to visit KMS were analyzed using logistic regression model with complex sample design. Influencing factors were specified based on Andersen's behavioral model of health care utilization and factors associated with individual recognitions of KMS. Additionally, using the ordinary logistic regression model without complex sample design, the survey data were analyzed to compare the results. Results: In the logistic regression analysis, sex, age, health condition, presence of chronic disease, a degree of knowledge about Korean Medicine, and a view about herbal medicine safety were statistically significant both in the KMS experience, and the intention to visit KMS. Marital status was statistically significant in the KMS experience, while family income, a view about the cost of KMS were statistically significant in the intention to visit KMS. Conclusion: Individual recognitions of KMS and enabling components should be considered when establishing KMS policies. In addition, future studies analyzing KMUS need to take into account the complex sample design features of the survey to avoid statistically misleading results.

Comparison of Regression Model Approaches fined to Complex Survey Data (복합표본조사 데이터 분석을 위한 회귀모형 접근법의 비교: 소규모사업체조사 데이터 분석을 중심으로)

  • 이기재
    • Survey Research
    • /
    • v.2 no.1
    • /
    • pp.73-86
    • /
    • 2001
  • In this paper. we conducted an empirical study to investigate the design and weighting effects on descriptive and analytic statistics. We compared the regression models using the design-based approach and the generalized estimating equations (GEEs) approach with the model-based approach through the design and weighting effects analysis.

  • PDF

Modeling clustered count data with discrete weibull regression model

  • Yoo, Hanna
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.4
    • /
    • pp.413-420
    • /
    • 2022
  • In this study we adapt discrete weibull regression model for clustered count data. Discrete weibull regression model has an attractive feature that it can handle both under and over dispersion data. We analyzed the eighth Korean National Health and Nutrition Examination Survey (KNHANES VIII) from 2019 to assess the factors influencing the 1 month outpatient stay in 17 different regions. We compared the results using clustered discrete Weibull regression model with those of Poisson, negative binomial, generalized Poisson and Conway-maxwell Poisson regression models, which are widely used in count data analyses. The results show that the clustered discrete Weibull regression model using random intercept model gives the best fit. Simulation study is also held to investigate the performance of the clustered discrete weibull model under various dispersion setting and zero inflated probabilities. In this paper it is shown that using a random effect with discrete Weibull regression can flexibly model count data with various dispersion without the risk of making wrong assumptions about the data dispersion.

A Study on Estimate Model for Peak Time Congestion

  • Kim, Deug-Bong;Yoo, Sang-Lok
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.20 no.3
    • /
    • pp.285-291
    • /
    • 2014
  • This study applied regression analysis to evaluate the impact of hourly average congestion calculated by bumper model in the congested area of each passage of each port on the peak time congestion, to suggest the model formula that can predict the peak time congestion. This study conducted regression analysis of hourly average congestion and peak time congestion based on the AIS survey study of 20 ports in Korea. As a result of analysis, it was found that the hourly average congestion has a significant impact on the peak time congestion and the prediction model formula was derived. This formula($C_p=4.457C_a+29.202$) can be used to calculate the peak time congestion based on the predicted hourly average congestion.

Machine learning-based Predictive Model of Suicidal Thoughts among Korean Adolescents. (머신러닝 기반 한국 청소년의 자살 생각 예측 모델)

  • YeaJu JIN;HyunKi KIM
    • Journal of Korea Artificial Intelligence Association
    • /
    • v.1 no.1
    • /
    • pp.1-6
    • /
    • 2023
  • This study developed models using decision forest, support vector machine, and logistic regression methods to predict and prevent suicidal ideation among Korean adolescents. The study sample consisted of 51,407 individuals after removing missing data from the raw data of the 18th (2022) Youth Health Behavior Survey conducted by the Korea Centers for Disease Control and Prevention. Analysis was performed using the MS Azure program with Two-Class Decision Forest, Two-Class Support Vector Machine, and Two-Class Logistic Regression. The results of the study showed that the decision forest model achieved an accuracy of 84.8% and an F1-score of 36.7%. The support vector machine model achieved an accuracy of 86.3% and an F1-score of 24.5%. The logistic regression model achieved an accuracy of 87.2% and an F1-score of 40.1%. Applying the logistic regression model with SMOTE to address data imbalance resulted in an accuracy of 81.7% and an F1-score of 57.7%. Although the accuracy slightly decreased, the recall, precision, and F1-score improved, demonstrating excellent performance. These findings have significant implications for the development of prediction models for suicidal ideation among Korean adolescents and can contribute to the prevention and improvement of youth suicide.

Statistical analysis of KNHANES data with measurement error models

  • Hwang, Jinseub
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.3
    • /
    • pp.773-779
    • /
    • 2015
  • We study a statistical analysis about the fifth wave data of the Korea National Health and Nutrition Examination Survey based on linear regression models with measurement errors. The data is obtained from a national population-based complex survey. To demonstrate the availability of measurement error models, two results between the general linear regression model and measurement error model are compared based on the model selection criteria which are Akaike information criterion and Bayesian information criterion. For our study, we use the simulation extrapolation algorithm for measurement error model and the jackknife method for the estimation of standard errors.

Application of discrete Weibull regression model with multiple imputation

  • Yoo, Hanna
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.3
    • /
    • pp.325-336
    • /
    • 2019
  • In this article we extend the discrete Weibull regression model in the presence of missing data. Discrete Weibull regression models can be adapted to various type of dispersion data however, it is not widely used. Recently Yoo (Journal of the Korean Data and Information Science Society, 30, 11-22, 2019) adapted the discrete Weibull regression model using single imputation. We extend their studies by using multiple imputation also with several various settings and compare the results. The purpose of this study is to address the merit of using multiple imputation in the presence of missing data in discrete count data. We analyzed the seventh Korean National Health and Nutrition Examination Survey (KNHANES VII), from 2016 to assess the factors influencing the variable, 1 month hospital stay, and we compared the results using discrete Weibull regression model with those of Poisson, negative Binomial and zero-inflated Poisson regression models, which are widely used in count data analyses. The results showed that the discrete Weibull regression model using multiple imputation provided the best fit. We also performed simulation studies to show the accuracy of the discrete Weibull regression using multiple imputation given both under- and over-dispersed distribution, as well as varying missing rates and sample size. Sensitivity analysis showed the influence of mis-specification and the robustness of the discrete Weibull model. Using imputation with discrete Weibull regression to analyze discrete data will increase explanatory power and is widely applicable to various types of dispersion data with a unified model.

Design and Weighting Effects in Small Firm Server in Korea

  • Lee, Keejae;Lepkowski, James M.
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.3
    • /
    • pp.775-786
    • /
    • 2002
  • In this paper, we conducted an empirical study to investigate the design and weighting effects on descriptive and analytic statistics. The design and weighting effects were calculated for estimates produced from the 1998 small firm survey data. We considered the design and weighting effects on coefficients estimates of regression model using the design-based approach and the GEE approach.

A Study on Diagnostics Method for Categorical Data (범주형 자료의 진단방법에 관한 연구)

  • 이선규;조범석
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.18 no.33
    • /
    • pp.93-102
    • /
    • 1995
  • In this study we are concerned with the diagnostics method of cross-classified categorical data using logistic regression model of binary response models for cell proportions. under this model, we could examine the goodness-of-fit of the models using Pearson's $x^2$test statistic and likelihood ratio statistic. Under this model, these statistics are assumed that sample survey schemes are with replacement sampling model. But these statistics are often inappropriate for analysing contingency tables consists of complex sampling schemes obtained sample survey data. In this study we are examined diagnostics procedures detecting any outlying cell proportions and influential observations on design space in logistic regression modeltake account of the survey design effects.

  • PDF