• Title/Summary/Keyword: Binary Logistic Analysis

Search Result 278, Processing Time 0.022 seconds

Analyzing Intention to Use Shared E-scooters Considering Individual Travel Attitudes : The Case of Seoul Metropolitan Areas (개인 통행성향을 고려한 공유 전동킥보드 이용의향 분석: 서울시를 중심으로)

  • Lee, Yoonhee;Koo, Jahun;Choo, Sangho
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.21 no.1
    • /
    • pp.1-16
    • /
    • 2022
  • Recently, e-scooters have been attracting attention as eco-friendly modes of transportation in cities due to an increasing interest in the environment. Accordingly, various studies on usage behavior are being conducted, but studies that reflect individual travel attitudes are insufficient. Therefore, this study surveyed commuters in Seoul and analyzed respondents' traveling attitudes through factor analysis. It also built a binary logistic regression model for the intention to use shared e-scooters to determine how individual travel behaviors are affected. In particular, the model results showed that age, the main mode of transportation (car), walking time to the bus stop, and four travel attitude variables (disutility of travel, preference to self-drive, internet/smartphone friendliness, and willingness to pay extra money for services) significantly affected the intention to use shared e-scooters. This study is expected to be used as basic data, with aspect to travel behavior, for the efficient operation and use of shared e-scooters in the future.

Statistical Analysis for Risk Factors and Prediction of Hypertension based on Health Behavior Information (건강행위정보기반 고혈압 위험인자 및 예측을 위한 통계분석)

  • Heo, Byeong Mun;Kim, Sang Yeob;Ryu, Keun Ho
    • Journal of Digital Contents Society
    • /
    • v.19 no.4
    • /
    • pp.685-692
    • /
    • 2018
  • The purpose of this study is to develop a prediction model of hypertension in middle-aged adults using Statistical analysis. Statistical analysis and prediction models were developed using the National Health and Nutrition Survey (2013-2016).Binary logistic regression analysis showed statistically significant risk factors for hypertension, and a predictive model was developed using logistic regression and the Naive Bayes algorithm using Wrapper approach technique. In the statistical analysis, WHtR(p<0.0001, OR = 2.0242) in men and AGE (p<0.0001, OR = 3.9185) in women were the most related factors to hypertension. In the performance evaluation of the prediction model, the logistic regression model showed the best predictive power in men (AUC = 0.782) and women (AUC = 0.858). Our findings provide important information for developing large-scale screening tools for hypertension and can be used as the basis for hypertension research.

Development of Standardized Predictive Models for Traditional Korean Medical Diagnostic Pattern Identification in Stroke Subjects: A Hospital-based Multi-center Trial

  • Jung, Woo-Sang;Cho, Seung-Yeon;Park, Seong-Uk;Moon, Sang-Kwan;Park, Jung-Mi;Ko, Chang-Nam;Cho, Ki-Ho;Kwon, Seungwon
    • The Journal of Korean Medicine
    • /
    • v.40 no.4
    • /
    • pp.49-60
    • /
    • 2019
  • Objectives: To develop a standardized diagnostic pattern identification equation for stroke patients, our group conducted a study to derive the predictive logistic equations. However, the sample size was relatively small. In the current study, we aimed to derive new predictive logistic equations for each diagnostic pattern using an expanded number of subjects. Methods: This study was a hospital-based multi-center trial recruited stroke patients within 30 days of symptom onset. Patients' general information, and the variables related to diagnostic pattern identification were measured. The diagnostic pattern of each patient was identified independently by two Korean Medicine Doctors. To derive a predictive model for pattern identification, binary logistic regression analysis was applied. Results: Among the 1,251 patients, 385 patients (30.8%) had the Fire Heat Pattern, 460 patients (36.8%) the Phlegm Dampness Pattern, 212 patients (16.9%) the Qi Deficiency Pattern, and 194 patients (15.5%) the Yin Deficiency Pattern. After the regression analysis, the predictive logistic equations for each pattern were determined. Conclusion: The predictive equations for Fire Heat, Phlegm Dampness, Qi Deficiency, and Yin Deficiency would be useful to determine individual stroke patients' pattern identification in the clinical setting. However, further studies using objective measurements are necessary to validate these data.

Hybrid Learning Architectures for Advanced Data Mining:An Application to Binary Classification for Fraud Management (개선된 데이터마이닝을 위한 혼합 학습구조의 제시)

  • Kim, Steven H.;Shin, Sung-Woo
    • Journal of Information Technology Application
    • /
    • v.1
    • /
    • pp.173-211
    • /
    • 1999
  • The task of classification permeates all walks of life, from business and economics to science and public policy. In this context, nonlinear techniques from artificial intelligence have often proven to be more effective than the methods of classical statistics. The objective of knowledge discovery and data mining is to support decision making through the effective use of information. The automated approach to knowledge discovery is especially useful when dealing with large data sets or complex relationships. For many applications, automated software may find subtle patterns which escape the notice of manual analysis, or whose complexity exceeds the cognitive capabilities of humans. This paper explores the utility of a collaborative learning approach involving integrated models in the preprocessing and postprocessing stages. For instance, a genetic algorithm effects feature-weight optimization in a preprocessing module. Moreover, an inductive tree, artificial neural network (ANN), and k-nearest neighbor (kNN) techniques serve as postprocessing modules. More specifically, the postprocessors act as second0order classifiers which determine the best first-order classifier on a case-by-case basis. In addition to the second-order models, a voting scheme is investigated as a simple, but efficient, postprocessing model. The first-order models consist of statistical and machine learning models such as logistic regression (logit), multivariate discriminant analysis (MDA), ANN, and kNN. The genetic algorithm, inductive decision tree, and voting scheme act as kernel modules for collaborative learning. These ideas are explored against the background of a practical application relating to financial fraud management which exemplifies a binary classification problem.

  • PDF

Factors Influencing Post Traumatic Stress Disorder in Crime Scene Investigators (경찰 과학수사요원의 외상 후 스트레스 장애 발생 영향요인)

  • Nho, Seon Mi;Kim, Eun A
    • Journal of Korean Academy of Nursing
    • /
    • v.47 no.1
    • /
    • pp.39-48
    • /
    • 2017
  • Purpose: The purpose of this study was to verify the relationships among social support, resilience and post traumatic stress disorder (PTSD), and especially to identify factors influencing PTSD in police crime scene investigators. Methods: A cross-sectional design was used, with a convenience sample of 226 police crime scene investigators from 7 Metropolitan Police Agencies. Data were collected through self-report questionnaires during July and August, 2015. Data were analyzed using t-test, ${\chi}^2$-test, Fisher's exact test, and binary logistic regression analysis with SPSS/WIN 21.0 program. Results: The mean score for PTSD in police crime scene investigators was 13.69 .11 points. Of the crime scene investigators 181 (80.1%) were in the low-risk group and 45 (19.9%) in high-risk group. Social support (t=5.68, p<.001) and resilience (t=5.47, p<.001) were higher in the low-risk group compared to the high-risk group. Logistic regression analysis showed that resilience (OR=4.74, 95% CI: 1.57~14.35), and social support (OR=2.13, 95% CI: 1.23~3.69) are effect factors for PTSD low group. Conclusion: For effective improvement of PTSD in police crime scene investigators, intervention programs including social support and strategies to increase should be established.

Relationship between Increased Intracranial Pressure and Mastoid Effusion

  • Jung, Hoonkyo;Jang, Kyoung Min;Ko, Myeong Jin;Choi, Hyun Ho;Nam, Taek Kyun;Kwon, Jeong-Taik;Park, Yong-sook
    • Journal of Korean Neurosurgical Society
    • /
    • v.63 no.5
    • /
    • pp.640-648
    • /
    • 2020
  • Objective : This study aimed to assess the relationship between increased intracranial pressure (ICP) and mastoid effusions (ME). Methods : Between January 2015 and October 2018, patients who underwent intracranial surgery and had ICP monitoring catheters placed were enrolled. ICP was recorded hourly for at least 3 days. ME was determined by the emergence of opacification in mastoid air cells on follow-up brain imaging. C-reactive protein (CRP) levels, presence of endotracheal tube (ETT) and nasogastric tube (NGT), duration of intensive care unit (ICU) stay, duration of mechanical ventilator application, diagnosis, surgical modalities, and presence of sinusitis were recorded. Each factor's effect on the occurrence of ME was analyzed by binary logistic regression analyses. To analyze the independent effects of ICP as a predictor of ME a multivariable logistic regression analysis was performed. Results : Total of 61 (53%) out of 115 patients had ME. Among the patients who had unilateral brain lesions, 94% of subject (43/50) revealed the ipsilateral development of ME. ME developed at a mean of 11.1±6.2 days. The variables including mean ICP, peak ICP, age, trauma, CRP, ICU stays, application of mechanical ventilators and presence of ETT and NGT showed statistically significant difference between ME groups and non-ME groups in univariate analysis. Sex and the occurrence of sinusitis did not differ between two groups. Adding the ICP variables significantly improved the prediction of ME in multivariable logistic regression analysis. Conclusion : While multiple factors affect ME, this study demonstrates that ICP and ME are probably related. Further studies are needed to determine the mechanistic relationship between ICP and middle ear pressure.

Factors Affecting Injury Severity in Pedestrian-Vehicle Crash by Novice Driver (초보 운전자에 의한 보행자-차량 교통사고의 심각도 영향 요인 분석)

  • Choe, Sae-Ro-Na;Park, Jun-Hyeong;O, Cheol
    • Journal of Korean Society of Transportation
    • /
    • v.29 no.4
    • /
    • pp.43-51
    • /
    • 2011
  • Since a variety of factors are associated with crash occurrence, the analysis of causes of crash is a hard task for traffic researchers and engineers. Among contributing factors leading to crash, the characteristics of driver is of keen interest. This study attempted to identify factors affecting the severity of pedestrian in the collision between pedestrian and vehicle. In particular, our analyses were focused on the novice driver. A binary logistic regression technique was adopted for the analyses. The results showed that driver's age, crash location, and the frequency of violations were dominant factors for the severity. Findings are expected to be useful information for deffective policy- and education-based countermeasures.

Prediction of Rear-end Crash Potential using Vehicle Trajectory Data (차량 주행궤적을 이용한 후미추돌 가능성 예측 모형)

  • Kim, Tae-Jin;O, Cheol;Gang, Gyeong-Pyo
    • Journal of Korean Society of Transportation
    • /
    • v.29 no.3
    • /
    • pp.73-82
    • /
    • 2011
  • Recent advancement in traffic surveillance systems has allowed the researchers to obtain more detailed vehicular movement such as individual vehicle trajectory data. Understanding the characteristics of interactions between leading and following vehicles in the traffic flow stream is a backbone for designing and evaluating more sophisticated traffic and vehicle control strategies. This study proposes a methodology for estimating rear-end crash potential, as a probabilistic measure, in real-time based on the analysis of vehicular movements. The methodology presented in this study consists of three components. The first predicts vehicle position and speed every second using a Kalman filtering technique. The second estimates the probability for the vehicle's trajectory to belong to either 'changing lane' or 'going straight'. A binary logistic regression (BLR) is used to model the lane-changing decision of the subject vehicle. The other component calculates crash probability by employing an exponential decay function that uses time-to-collision (TTC) between the subject vehicle and the front vehicle. The result of this study is expected to be adapted in developing traffic control and information systems, in particular, for crash prevention.

Expression of p53 Breast Cancer in Kurdish Women in the West of Iran: a Reverse Correlation with Lymph Node Metastasis

  • Payandeh, Mehrdad;Sadeghi, Masoud;Sadeghi, Edris;Madani, Seyed-Hamid
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.3
    • /
    • pp.1261-1264
    • /
    • 2016
  • Background: In breast cancer (BC), it has been suggested that nuclear overexpression of p53 protein might be an indicator of poor prognosis. The aim of the current study was to evaluate the expression of p53 BC in Kurdish women from the West of Iran and its correlation with other clinicopathology figures. Materials and Methods: In the present retrospective study, 231 patients were investigated for estrogen receptor (ER) and progesterone receptor (PR) positivity, defined as ${\geq}10%$ positive tumor cells with nuclear staining. A binary logistic regression model was selected using Akaike Information Criteria (AIC) in stepwise selection for determination of important factors. Results: ER, PR, the human epidermal growth factor receptor 2 (HER2) and p53 were positive in 58.4%, 55.4%, 59.7% and 45% of cases, respectively. Ki67 index was divided into two groups: 54.5% had Ki67<20% and 45.5% had Ki67 ${\geq}20%$. Of 214 patients, 137(64%) had lymph node metastasis and of 186 patients, 122(65.6%) had vascular invasion. Binary logistic regression analysis showed that there was inverse significant correlation between lymph node metastasis (P=0.008, OR 0.120 and 95%CI 0.025-0.574), ER status (P=0.006, OR 0.080, 95%CI 0.014-0.477) and a direct correlation between HER2 (P=005, OR 3.047, 95%CI 1.407-6.599) with the expression of p53. Conclusions: As in a number of studies, expression of p53 had a inverse correlation with lymph node metastasis and ER status and also a direct correlation with HER2 status. Also, p53-positivity is more likely in triple negative BC compared to other subtypes.

Recommended Rice Intake Levels Based on Average Daily Dose and Urinary Excretion of Cadmium in a Cadmium-Contaminated Area of Northwestern Thailand

  • La-Up, Aroon;Wiwatanadate, Phongtape;Pruenglampoo, Sakda;Uthaikhup, Sureeporn
    • Toxicological Research
    • /
    • v.33 no.4
    • /
    • pp.291-297
    • /
    • 2017
  • This study was performed to investigate the dose-response relationship between average daily cadmium dose (ADCD) from rice and the occurrence of urinary cadmium (U-Cd) in individuals eating that rice. This was a retrospective cohort designed to compare populations from two areas with different levels of cadmium contamination. Five-hundred and sixty-seven participants aged 18 years or older were interviewed to estimate their rice intake, and were assessed for U-Cd. The sources of consumed rice were sampled for cadmium measurement, from which the ADCD was estimated. Binary logistic regression was used to examine the association between ADCD and U-Cd (cut-off point at $2{\mu}g/g$ creatinine), and a correlation between them was established. The lowest estimate was $ADCD=0.5{\mu}g/kg\;bw/day$ [odds ratio (OR) = 1.71; with a 95% confidence interval (CI) 1.02-2.87]. For comparison, the relationship in the contaminated area is expressed by $ADCD=0.7{\mu}g/kg\;bw/day$, OR = 1.84; [95 % CI, 1.06-3.19], while no relationship was found in the non-contaminated area, meaning that the highest level at which this relationship does not exist is $ADCD=0.6{\mu}g/kg\;bw/day$ [95% CI, 0.99-2.95]. Rice, as a main staple food, is the most likely source of dietary cadmium. Abstaining from or limiting rice consumption, therefore, will increase the likelihood of maintaining U-Cd within the normal range. As the recommended maximum ADCD is not to exceed $0.6{\mu}g/kg\;bw/day$, the consumption of rice grown in cadmium-contaminated areas should not be more than 246.8 g/day. However, the exclusion of many edible plants grown in the contaminated area from the analysis might result in an estimated ADCD that does not reflect the true level of cadmium exposure among local people.