• 제목/요약/키워드: Multiple Logistic Regression Analysis

검색결과 1,037건 처리시간 0.027초

빅데이터 통합모형 비교분석 (Comparison analysis of big data integration models)

  • 정병호;임동훈
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권4호
    • /
    • pp.755-768
    • /
    • 2017
  • 빅데이터가 4차 산업혁명의 핵심으로 자리하면서 빅데이터 기반 처리 및 분석 능력이 기업의 미래 경쟁력을 좌우할 전망이다. 빅데이터 처리 및 분석을 위한 RHadoop과 RHIPE 모형은 R과 Hadoop의 통합모형으로 지금까지 각각의 모형에 대해서는 연구가 많이 진행되어 왔으나 두 모형간 비교 연구는 거의 이루어 지지 않았다. 본 논문에서는 대용량의 실제 데이터와 모의실험 데이터에서 다중 회귀 (multiple regression)와 로지스틱 회귀 (logistic regression) 추정을 위한 머신러닝 (machine learning) 알고리즘을 MapReduce 프로그램 구현을 통해 RHadoop과 RHIPE 간의 비교 분석하고자 한다. 구축된 분산 클러스터 (distributed cluster) 하에서 두 모형간 성능 실험 결과, RHIPE은 RHadoop에 비해 대체로 빠른 처리속도를 보인 반면에 설치, 사용면에서 어려움을 보였다.

한국 성인의 당뇨병 미진단 비율 영향요인: 2차 자료 분석 연구 (Factors related to undiagnosed diabetes in Korean adults: a secondary data analysis)

  • 김보현
    • Journal of Korean Biological Nursing Science
    • /
    • 제25권4호
    • /
    • pp.295-305
    • /
    • 2023
  • Purpose: This study compared health behaviors and health-related clinical characteristics between individuals with normal glucose levels without diabetes and those with undiagnosed diabetes. Factors that were associated with undiagnosed diabetes were identified by sex. Methods: This was an observational study with a cross-sectional design based on data from the eighth Korea National Health and Nutrition Examination Survey, which used a stratified, multi-stage, cluster-sampling design to obtain a nationally representative sample. Multiple logistic regression analysis was employed to compute the odds ratios of health behaviors and clinical characteristics to identify risk factors for undiagnosed diabetes. Results: The overall prevalence of undiagnosed diabetes was 5.2% (weighted %, n = 700, p < .001). Among individuals with undiagnosed diabetes, 58.3% were men. Univariate logistic regression for undiagnosed diabetes identified sex, age, house income, educational level, and triglycerides as influencing factors. In multiple logistic regression by sex, the factors associated with undiagnosed diabetes in men were age, perceived health status, a diagnosis of angina, and triglycerides. Conclusion: Strategies should be targeted to improve health behaviors and clinical characteristics for specific age groups, men in bad perceived health status, women with high systolic blood pressure, and high triglycerides. Moreover, healthcare providers should understand the barriers to health behaviors and health-related quality of life to effectively deliver healthcare services.

인구주택 총조사 자료를 이용한 인구, 가구, 주택 특성과 빈집 현황 분석 (Analysis of the Effects of Population, Household, and Housing Characteristics on the Status of Empty Houses Using Population Housing Census Data)

  • 이지민;최원
    • 한국농공학회논문집
    • /
    • 제62권5호
    • /
    • pp.1-13
    • /
    • 2020
  • The empty houses' problem is important in the local revitalization and local sustainability, and these phenomenon caused by various factors of the region. The population and housing census data are the most effective data available to study this phenomenon by small regions. In this study, logistic regression and multiple regression analysis were performed to understand the effects of population, household, and housing characteristics on empty houses using population and housing census data. Also, the scale and direction of the effect of each characteristic in large cities, small cities, and rural areas were compared. As results, there was a slight difference between cities and province regions in the district and housing characteristic variables. In the comparison of Eup-Myeon-Dong, the affected variables were different in the Dong and Myeon areas. The significance of this study is to examine the effect of the characteristics of population and housing on the vacant houses and to confirm that the factors affecting different regions.

인공신경망을 이용한 인스턴트 메신저 선택 예측에 관한 연구 (A study on the forecasting of instant messinger's users choice using neural network)

  • 김동성;김계수
    • 한국품질경영학회:학술대회논문집
    • /
    • 한국품질경영학회 2004년도 품질경영모델을 통한 가치 창출
    • /
    • pp.597-602
    • /
    • 2004
  • This study examined the forecasting of instant messinger's users choice using neural network. We used the statistical methods which were Logistic Regression, MDA(Multiple Discriminant Analysis), and ANN(Artificial Neural Network). In the result, the forecasting performance of the ANN was better than conventional model(Logistic Regression, MDA).

  • PDF

1인가구 성인의 대사증후군 영향 요인 분석 (Analysis of Metabolic Syndrome in Korean Adult One-Person Households)

  • 안보미;손지희
    • 한국보건간호학회지
    • /
    • 제32권1호
    • /
    • pp.30-43
    • /
    • 2018
  • Purpose: This study was to conducted to investigate the prevalence and related factors of metabolic syndrome (MS) among Korean adults. Methods: We used secondary data of the sixth Korean National Health and Nutrition Examination Survey (KNHANES) from 2013 to 2015 and selected 4,939 adults 20 to 64 years old. General characteristics and health-related characteristics were included as related factors for analysis. Chi-square tests were used to compare the prevalence of MS between one-person and multiple-person households, while a multiple logistic regression model was used to identify factors to MS among one-person and multiple-person households. Results: MS of one-person households (26.4%) were significantly higher (${\chi}^2=7.81$, p=.017) than multiple-households (19.5%). Risk factors for MS were identified as walking, flexibility exercises, reading nutrition labels, and sleep hours in one-person households; and flexibility exercises and dietary control among multiple-person households using multiple logistic regression. Conclusion: Specialized health policies and programs should be provided to reduce MS prevalence in one-person households in consideration of risk factors revealed in this study.

Screening for Patients with Non-small Cell Lung Cancer Who Could Survive Long Term Chemotherapy

  • Wu, Xue-Yan;Huang, Xin-En
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제16권2호
    • /
    • pp.647-652
    • /
    • 2015
  • Background: Lung cancer was one of the most common cancers in both men and women all over the world. In this study, we aimed to clarify who could survive after long term chemotherapy in patients with advanced non-small cell lung cancer (NSCLC). Methods: We enrolled 186 patients with stage IV NSCLC after long term chemotherapy from Jun 2006 to Nov 2014 diagnosed in Jiangsu Cancer Hospital. Multiple variables like age, gender, smoking, histology of adenocarcinoma and squamous-cell cancer, number of metastatic sites, metastatic sites (e.g. lung, brain, bone, liver and pleura), hemoglobin, lymphocyte rate (LYR), Change of LYR during multiple therapies, hypertension, diabetes, chronic bronchitis, treatments (e.g.radiotherapy and targeted therapy) were selected. For consideration of factors influencing survival and response for patients with advanced NSCLC, logistic regression analysis and Cox regression analysis were used in an attempt to develop a screening module for patients with elevated survival after long term chemotherapy become possible. Results: Of the total of 186 patients enrolled, 69 survived less than 1 year (short-term group), 45 one to two years, and 72 longer than 3 years (long-term group). For logistic regression analysis, the short-term group was taken as control group and the long-term group as the case group. We found that age, histology of adenocarcinoma, metastatic site (e.g. lung and liver), treatments (e.g. targeted therapy and radiotherapy), LYR, a decreasing tendency of LYR and chronic bronchitis were individually associated with overall survival by Cox regression analysis. A multivariable Cox regression model showed that metastatic site (e.g. lung and liver), histology of adenocarcinoma, treatments (e.g. targeted therapy and radiotherapy) and chronic bronchitis were associated with overall survival. Thus metastatic site (e.g. lung and liver) and chronic bronchitis may be important risk factors for patients with advanced NSCLC. Gender, metastatic site (e.g. lung and liver), LYR and the decreasing tendency of LYR were significantly associated with long-term survival in the individual-variable logistic regression model (P<0.05). On multivariate logistic regression analysis, gender, metastatic site (e.g. lung and liver) and the decreasing tendency of LYR associated with long-term survival. Conclusions: In conclusion, female patients with stage IV adenocarcinoma of NSCLC who had decreasing tendency of LYR during the course therapy and had accepted multiple therapies e.g. more than third-line chemotherapy, radiotherapy and/or targeted therapy might be expected to live longer.

사전검사를 통한 고립성 폐결절 환자에서의 악성 확률 타당성에 대한 연구 (A Study to Validate the Pretest Probability of Malignancy in Solitary Pulmonary Nodule)

  • 장주현;박성훈;최정희;이창률;황용일;신태림;박용범;이재영;장승훈;김철홍;박상면;김동규;이명구;현인규;정기석
    • Tuberculosis and Respiratory Diseases
    • /
    • 제67권2호
    • /
    • pp.105-112
    • /
    • 2009
  • Background: Solitary pulmonary nodules (SPN) are encountered incidentally in 0.2% of patients who undergo chest X-ray or chest CT. Although SPN has malignant potential, it cannot be treated surgically by biopsy in all patients. The first stage is to determine if patients with SPN require periodic observation and biopsy or resection. An important early step in the management of patients with SPN is to estimate the clinical pretest probability of a malignancy. In every patient with SPN, it is recommended that clinicians estimate the pretest probability of a malignancy either qualitatively using clinical judgment or quantitatively using a validated model. This study examined whether Bayesian analysis or multiple logistic regression analysis is more predictive of the probability of a malignancy in SPN. Methods: From January 2005 to December 2008, this study enrolled 63 participants with SPN at the Kangnam Sacred Hospital. The accuracy of Bayesian analysis and Bayesian analysis with a FDG-PET scan, and Multiple logistic regression analysis was compared retrospectively. The accurate probability of a malignancy in a patient was compared by taking the chest CT and pathology of SPN patients with <30 mm at CXR incidentally. Results: From those participated in study, 27 people (42.9%) were classified as having a malignancy, and 36 people were benign. The result of the malignant estimation by Bayesian analysis was 0.779 (95% confidence interval [CI], 0.657 to 0.874). Using Multiple logistic regression analysis, the result was 0.684 (95% CI, 0.555 to 0.796). This suggests that Bayesian analysis provides a more accurate examination than multiple logistic regression analysis. Conclusion: Bayesian analysis is better than multiple logistic regression analysis in predicting the probability of a malignancy in solitary pulmonary nodules but the difference was not statistically significant.

병원에 근무하는 간병인의 업무적 특성과 직무스트레스가 근골격계 자각증상에 미치는 요인 (The Related Factor of Job Characteristics and Occupational Stress on Musculoskeletal Symptom for Caregiver Working in Hospital)

  • 최율정;심현보
    • 대한정형도수물리치료학회지
    • /
    • 제18권1호
    • /
    • pp.19-29
    • /
    • 2012
  • Background: This study was conducted to investigate subjective musculoskeletal symptom and the related factor of caregiver. Methods: For 300 caregiver, we used the self-administered questionnaires to examine occupational stress and subjective musculoskeletal symptom designed by KOSHA. The collected data were analyzed chi-square test, independent t-test and multiple logistic regression analysis using SPSS 12.0. Results: The multiple logistic regression analysis showed that the caregiver working in the general hospital significantly increased the subjective musculoskeletal symptom in their neck, shoulder, hand/wrist/finger, back, leg/foot. For the caregiver working in hospital showed significantly increased the subjective musculoskeletal symptom in their hand/wrist/finger and leg/foot. Conclusions: With the above results, continuous and systematic prevention program should be established, which include the ergonomics and psychosocial factor for the caregiver's musculoskeletal symptom.

  • PDF

중학생의 학교따돌림 피해경험과 건강상태, 스트레스 대처행동 (School Bullying Victimization, Health Status and Stress Coping Behavior of Middle School Students)

  • 최미경
    • 보건교육건강증진학회지
    • /
    • 제30권3호
    • /
    • pp.25-34
    • /
    • 2013
  • Objectives: The main purpose of this study was to examine factors influencing school bullying victimization of middle school students in relation to social support, self-esteem, stress coping behavior, and health status. Methods: The questionnaire survey was carried out on a convenience sample of 441 middle school students. The data analysis procedure included frequency, ${\chi}^2$-test, t-test, and multiple logistic regression. Results: It was found that 18% of the subjects were bullied by other students. Multiple logistic regression analysis revealed that the factors such as sex(OR=2.35, p=.006), aggressive coping behavior(OR=1.18, p=.028), and health status(OR=1.04, p=.002) were significant affecting factors. Conclusions: The findings suggest that to prevent middle school students' bullying victimization, it is necessary to design intervention programs that considering their health status and stress coping behavior.

일 지역 초등학교 고학년 학생의 과체중에서 비만으로 진행요인 (A Study on Overweight and Obesity in Childhood)

  • 유장학;최희정;김영미
    • 동서간호학연구지
    • /
    • 제16권2호
    • /
    • pp.156-163
    • /
    • 2010
  • Purpose: The purpose of this study was to investigate factors associated with childhood obesity. Methods: In this descriptive study, the number of participants was 78 4-6th grade students in a primary school, and the subjects were overweight or obese. Data were collected for 10 days from June 1 to 10 in 2009. The data were analyzed using descriptive statistics, independent-samples t-test, $X^2$-test, and multiple logistic regression. Results: Mother's occupation is a statistically significant factor in a bivariate analysis. In multiple logistic regression analysis, the statistically significant factors were mother's occupation (OR=0.17, 95% CI=0.04-0.70) and dietary attitudes (OR=0.08, 95% CI=0.01-0.66). Conclusion: For an effective obesity management program in childhood, it is necessary to consider a mother occupation and dietary attitudes.