• 제목/요약/키워드: Survey regression model

검색결과 1,307건 처리시간 0.027초

로지스틱 회귀모형과 의사결정 나무모형을 활용한 청소년 자살 시도 예측모형 비교: 2019 청소년 건강행태 온라인조사를 이용한 2차 자료분석 (Comparison of the Prediction Model of Adolescents' Suicide Attempt Using Logistic Regression and Decision Tree: Secondary Data Analysis of the 2019 Youth Health Risk Behavior Web-Based Survey)

  • 이윤주;김희진;이예슬;정혜선
    • 대한간호학회지
    • /
    • 제51권1호
    • /
    • pp.40-53
    • /
    • 2021
  • Purpose: The purpose of this study was to develop and compare the prediction model for suicide attempts by Korean adolescents using logistic regression and decision tree analysis. Methods: This study utilized secondary data drawn from the 2019 Youth Health Risk Behavior web-based survey. A total of 20 items were selected as the explanatory variables (5 of sociodemographic characteristics, 10 of health-related behaviors, and 5 of psychosocial characteristics). For data analysis, descriptive statistics and logistic regression with complex samples and decision tree analysis were performed using IBM SPSS ver. 25.0 and Stata ver. 16.0. Results: A total of 1,731 participants (3.0%) out of 57,303 responded that they had attempted suicide. The most significant predictors of suicide attempts as determined using the logistic regression model were experience of sadness and hopelessness, substance abuse, and violent victimization. Girls who have experience of sadness and hopelessness, and experience of substance abuse have been identified as the most vulnerable group in suicide attempts in the decision tree model. Conclusion: Experiences of sadness and hopelessness, experiences of substance abuse, and experiences of violent victimization are the common major predictors of suicide attempts in both logistic regression and decision tree models, and the predict rates of both models were similar. We suggest to provide programs considering combination of high-risk predictors for adolescents to prevent suicide attempt.

지역별 건강생활 실천율의 영향요인: 시군구 단위 접근 (Determinants of Healthy Living Practice: County Approach)

  • 정초록;김지만;박종연;신의철;최병호
    • 보건행정학회지
    • /
    • 제30권3호
    • /
    • pp.376-385
    • /
    • 2020
  • Background: The purpose of this study is to investigate the factors affecting the healthy living practice rate such as non-smoking, moderate drinking, walking, and low-salt diet by elementary municipality (so called, 'si-gun-hu'). Methods: The 2016 Korean Community Health Survey was used for the analysis. The theoretical model is founded upon the Anderson model, and both the multiple linear regression analysis and the beta regression analysis was performed for estimation. Results: As a result of the beta regression analysis, healthy living practice rate was found to be significantly higher in the areas with a less number of cigarette retailers, participating in healthy city projects, a low proportion of people who perceive their body type as obesity, a higher proportion of women, and a lower proportion of spouses. Conclusion: In order to improve healthy living practices, the regulations on health risk businesses, the spread of Healthy City project, and policy efforts awaring obesity are recommended.

Measurement Error Variance Estimation Based on Complex Survey Data with Subsample Re-Measurements

  • Heo, Sunyeong;Eltinge, John L.
    • Communications for Statistical Applications and Methods
    • /
    • 제10권2호
    • /
    • pp.553-566
    • /
    • 2003
  • In many cases, the measurement error variances may be functions of the unknown true values or related covariates. This paper considers design-based estimators of the parameters of these variance functions based on the within-unit sample variances. This paper devotes to: (1) define an error scale factor $\delta$; (2) develop estimators of the parameters of the linear measurement error variance function of the true values under large-sample and small-error conditions; (3) use propensity methods to adjust survey weights to account for possible selection effects at the replicate level. The proposed methods are applied to medical examination data from the U.S. Third National Health and Nutrition Examination Survey (NHANES III).

회귀나무 모형을 이용한 패널데이터 분석 (Panel data analysis with regression trees)

  • 장영재
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권6호
    • /
    • pp.1253-1262
    • /
    • 2014
  • 회귀나무 (regression tree)는 독립변수로 이루어진 공간을 재귀적으로 분할하고 해당 영역에서 종속변수의 최선의 예측값을 찾고자 하는 비모수적 방법론이다. 회귀나무 모형이 제안된 이래 로지스틱 회귀나무모형이나 분위수 회귀나무모형과 같이 유연하고 다양한 모형적합을 위한 연구가 진행되어 왔다. 최근에 들어서는 Sela와 Simonoff (2012)의 RE-EM 알고리즘, Loh와 Zheng (2013)의 GUIDE 등 패널데이터와 관련하여 진일보한 나무모형 알고리즘도 제안되었다. 본 논문에서는 각 알고리즘을 소개하고 특징을 살펴보는 한편, 실험 데이터를 생성하여 평균제곱오차 (mean squared error)를 바탕으로 예측력을 비교하였다. 분석결과, RE-EM 알고리즘의 예측력이 상대적으로 우수하게 나타났다. 이 알고리즘을 통해 기업경기실사지수 업종별 패널자료를 분석한 결과 최근의 업황에 가장 큰 영향을 미치는 요소는 매출 실적으로 나타났으며 매출 상위 그룹의 경우 비제조업이 제조업에 비해 업황에 대한 판단이 긍정적인 것으로 나타났다.

Hybrid Internet Business Model using Evolutionary Support Vector Regression and Web Response Survey

  • Jun, Sung-Hae
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2006년도 추계학술대회 학술발표 논문집 제16권 제2호
    • /
    • pp.408-411
    • /
    • 2006
  • Currently, the nano economy threatens the mass economy. This is based on the internet business models. In the nano business models based on internet, the diversely personalized services are needed. Many researches of the personalization on the web have been studied. The web usage mining using click stream data is a tool for personalization model. In this paper, we propose an internet business model using evolutionary support vector machine and web response survey as a web usage mining. After analyzing click stream data for web usage mining, a personalized service model is constructed in our work. Also, using an approach of web response survey, we improve the performance of the customers' satisfaction. From the experimental results, we verify the performance of proposed model using two data sets from KDD Cup 2000 and our web server.

  • PDF

미계측 유역 평균갈수량 산정을 위한 지역회귀모형의 개발 (Development of Regional Regression Model for Estimating Mean Low Flow in Ungauged Basins)

  • 이태희;이민호;이재응
    • 대한토목학회논문집
    • /
    • 제36권3호
    • /
    • pp.407-416
    • /
    • 2016
  • 본 연구에서는 미계측 유역의 평균갈수량 추정을 위한 지역회귀모형을 개발하고자 하였다. 12개 다목적댐과 4개의 용수댐에서 관측된 조절되지 않은 유입량 자료로부터 평균갈수량을 산정하였고, 이를 유역면적, 유역경사, 유역밀도, 연평균강수량, 유출곡선지수 등의 유역특성인자와의 상관분석을 통해 다양한 형태의 지역회귀모형을 개발하였다. 평균갈수량의 관측값과 추정값의 비교를 통해 각 회귀모형의 성능을 평가하였고, 유역면적, 연평균강수량, 유출곡선지수를 설명변량으로 하는 회귀모형이 가장 우수한 성능을 보였다. 또한 비유량법과 기존에 개발된 기존회귀모형과의 비교를 통해서 본 연구에서 개발한 모형의 적용성이 가장 우수한 것으로 분석되었다.

로지스틱 회귀분석과 퍼지 기법을 이용한 산사태 취약성 지도작성: 보은군을 대상으로 (Landslide susceptibility mapping using Logistic Regression and Fuzzy Set model at the Boeun Area, Korea)

  • 알-마문;장동호
    • 한국지형학회지
    • /
    • 제23권2호
    • /
    • pp.109-125
    • /
    • 2016
  • This study aims to identify the landslide susceptible zones of Boeun area and provide reliable landslide susceptibility maps by applying different modeling methods. Aerial photographs and field survey on the Boeun area identified landslide inventory map that consists of 388 landslide locations. A total ofseven landslide causative factors (elevation, slope angle, slope aspect, geology, soil, forest and land-use) were extracted from the database and then converted into raster. Landslide causative factors were provided to investigate about the spatial relationship between each factor and landslide occurrence by using fuzzy set and logistic regression model. Fuzzy membership value and logistic regression coefficient were employed to determine each factor's rating for landslide susceptibility mapping. Then, the landslide susceptibility maps were compared and validated by cross validation technique. In the cross validation process, 50% of observed landslides were selected randomly by Excel and two success rate curves (SRC) were generated for each landslide susceptibility map. The result demonstrates the 84.34% and 83.29% accuracy ratio for logistic regression model and fuzzy set model respectively. It means that both models were very reliable and reasonable methods for landslide susceptibility analysis.

예측소음도를 이용한 어노이언스 예측모델을 위한 로지스틱 회귀분석의 적용방법 (Application Method of Logistic Regression Analysis for Annoyance Prediction Model Based on Predicted Noise Level)

  • 손진희;이건;정태량;장서일
    • 한국소음진동공학회논문집
    • /
    • 제20권6호
    • /
    • pp.555-561
    • /
    • 2010
  • Predicted noise level has been used to assess the annoyance response since noise map was generalized and being the normal method to assess the environmental noise. Unfortunately using predicted noise level to derive the annoyance prediction curve caused some problems. The data have to be grouped manually to use the annoyance prediction curve. The aim of this paper is to propose the method to handle the predicted noise level and the survey data for annoyance prediction curve. This paper used the percentage of persons annoyed(%A) and the percentage of persons highly annoyed as the descriptor of noise annoyance in a population. The logistic regression method was used for deriving annoyance prediction curve. It is concluded that the method of dichotomizing data and logistic regression was suitable to handle the predicted noise level and survey data.

서비스 수요조사와 분류모형을 이용한 수요예측 (Mixture Model with Survey and a Statistical Model)

  • 김윤종;김용철
    • 응용통계연구
    • /
    • 제21권1호
    • /
    • pp.151-157
    • /
    • 2008
  • 수요예측은 모든 생산적 활동을 수립하기 위한 기반이 되기 때문에 수요가 어느 정도 발생할 것인가에 대한 방향성에 대하여 파악하려고 일반적으로 설문조사를 이용하지만 무응답 및 불성실한 응답으로 인하여 설문응답 자료만으로 수요를 예측하기에는 부족하다. 따라서 수요와 관련 있는 변수를 이용한 분류모형으로 설문조사의 수요예측을 보정하고자 하였다. 본 논문에서는 설문조사를 통하여 평가 할 수 있는 직접적인 수요와 통계적 모형을 이용한 간접적 수요를 혼합하여 서비스 수요를 예측하는 혼합 모형을 제시하고자 한다.

인체변수의 계층적 추정기법 개발 및 적용 (Development and application of a hierarchical estimation method for anthropometric variables)

  • 류태범;유희천
    • 대한인간공학회지
    • /
    • 제22권4호
    • /
    • pp.59-78
    • /
    • 2003
  • Most regression models of anthropometric variables use stature and/or weight as regressors; however, these 'flat' regression models result in large errors for anthropometric variables having low correlations with the regressors. To develop more accurate regression models for anthropometric variables, this study proposed a method to estimate anthropometric variables in a hierarchical manner based on the relationships among the variables and a process to develop and improve corresponding regression models. By applying the proposed approach, a hierarchical estimation structure was constructed for 59 anthropometric variables selected for the occupant package design of a passenger car and corresponding regression models were developed with the 1988 US Army anthropometric survey data. The hierarchical regression models were compared with the corresponding flat regression models in terms of accuracy. As results, the standard errors of the hierarchical regression models decreased by 28% (4.3mm) on average compared with those of the flat models.