• Title/Summary/Keyword: logistic regression

Search Result 5,239, Processing Time 0.154 seconds

Value Weighted Regularized Logistic Regression Model (속성값 기반의 정규화된 로지스틱 회귀분석 모델)

  • Lee, Chang-Hwan;Jung, Mina
    • Journal of KIISE
    • /
    • v.43 no.11
    • /
    • pp.1270-1274
    • /
    • 2016
  • Logistic regression is widely used for predicting and estimating the relationship among variables. We propose a new logistic regression model, the value weighted logistic regression, which comprises of a fine-grained weighting method, and assigns adapted weights to each feature value. This gradient approach obtains the optimal weights of feature values. Experiments were conducted on several data sets from the UCI machine learning repository, and the results revealed that the proposed method achieves meaningful improvement in the prediction accuracy.

Variable Selection for Logistic Regression Model Using Adjusted Coefficients of Determination (수정 결정계수를 사용한 로지스틱 회귀모형에서의 변수선택법)

  • Hong C. S.;Ham J. H.;Kim H. I.
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.2
    • /
    • pp.435-443
    • /
    • 2005
  • Coefficients of determination in logistic regression analysis are defined as various statistics, and their values are relatively smaller than those for linear regression model. These coefficients of determination are not generally used to evaluate and diagnose logistic regression model. Liao and McGee (2003) proposed two adjusted coefficients of determination which are robust at the addition of inappropriate predictors and the variation of sample size. In this work, these adjusted coefficients of determination are applied to variable selection method for logistic regression model and compared with results of other methods such as the forward selection, backward elimination, stepwise selection, and AIC statistic.

Landslide susceptibility mapping using Logistic Regression and Fuzzy Set model at the Boeun Area, Korea (로지스틱 회귀분석과 퍼지 기법을 이용한 산사태 취약성 지도작성: 보은군을 대상으로)

  • Al-Mamun, Al-Mamun;JANG, Dong-Ho
    • Journal of The Geomorphological Association of Korea
    • /
    • v.23 no.2
    • /
    • pp.109-125
    • /
    • 2016
  • This study aims to identify the landslide susceptible zones of Boeun area and provide reliable landslide susceptibility maps by applying different modeling methods. Aerial photographs and field survey on the Boeun area identified landslide inventory map that consists of 388 landslide locations. A total ofseven landslide causative factors (elevation, slope angle, slope aspect, geology, soil, forest and land-use) were extracted from the database and then converted into raster. Landslide causative factors were provided to investigate about the spatial relationship between each factor and landslide occurrence by using fuzzy set and logistic regression model. Fuzzy membership value and logistic regression coefficient were employed to determine each factor's rating for landslide susceptibility mapping. Then, the landslide susceptibility maps were compared and validated by cross validation technique. In the cross validation process, 50% of observed landslides were selected randomly by Excel and two success rate curves (SRC) were generated for each landslide susceptibility map. The result demonstrates the 84.34% and 83.29% accuracy ratio for logistic regression model and fuzzy set model respectively. It means that both models were very reliable and reasonable methods for landslide susceptibility analysis.

Power Failure Sensitivity Analysis via Grouped L1/2 Sparsity Constrained Logistic Regression

  • Li, Baoshu;Zhou, Xin;Dong, Ping
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.8
    • /
    • pp.3086-3101
    • /
    • 2021
  • To supply precise marketing and differentiated service for the electric power service department, it is very important to predict the customers with high sensitivity of electric power failure. To solve this problem, we propose a novel grouped 𝑙1/2 sparsity constrained logistic regression method for sensitivity assessment of electric power failure. Different from the 𝑙1 norm and k-support norm, the proposed grouped 𝑙1/2 sparsity constrained logistic regression method simultaneously imposes the inter-class information and tighter approximation to the nonconvex 𝑙0 sparsity to exploit multiple correlated attributions for prediction. Firstly, the attributes or factors for predicting the customer sensitivity of power failure are selected from customer sheets, such as customer information, electric consuming information, electrical bill, 95598 work sheet, power failure events, etc. Secondly, all these samples with attributes are clustered into several categories, and samples in the same category are assumed to be sharing similar properties. Then, 𝑙1/2 norm constrained logistic regression model is built to predict the customer's sensitivity of power failure. Alternating direction of multipliers (ADMM) algorithm is finally employed to solve the problem by splitting it into several sub-problems effectively. Experimental results on power electrical dataset with about one million customer data from a province validate that the proposed method has a good prediction accuracy.

Logistic Regression Classification by Principal Component Selection

  • Kim, Kiho;Lee, Seokho
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.1
    • /
    • pp.61-68
    • /
    • 2014
  • We propose binary classification methods by modifying logistic regression classification. We use variable selection procedures instead of original variables to select the principal components. We describe the resulting classifiers and discuss their properties. The performance of our proposals are illustrated numerically and compared with other existing classification methods using synthetic and real datasets.

Model-Based Prediction of the Population Proportion and Distribution Function Using a Logistic Regression

  • Park, Min-Gue
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.5
    • /
    • pp.783-791
    • /
    • 2008
  • Estimation procedure of the finite population proportion and distribution function is considered. Based on a logistic regression model, an approximately model- optimal estimator is defined and conditions for the estimator to be design-consistent are given. Simulation study shows that the model-optimal design-consistent estimator defined under a logistic regression model performs well in estimating the finite population distribution function.

Fuzzy c-Logistic Regression Model in the Presence of Noise Cluster

  • Alanzado, Arnold C.;Miyamoto, Sadaaki
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.431-434
    • /
    • 2003
  • In this paper we introduce a modified objective function for fuzzy c-means clustering with logistic regression model in the presence of noise cluster. The logistic regression model is commonly used to describe the effect of one or several explanatory variables on a binary response variable. In real application there is very often no sharp boundary between clusters so that fuzzy clustering is often better suited for the data.

  • PDF

Binary Forecast of Heavy Snow Using Statistical Models

  • Sohn, Keon-Tae
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.2
    • /
    • pp.369-378
    • /
    • 2006
  • This Study focuses on the binary forecast of occurrence of heavy snow in Honam area based on the MOS(model output statistic) method. For our study daily amount of snow cover at 17 stations during the cold season (November to March) in 2001 to 2005 and Corresponding 45 RDAPS outputs are used. Logistic regression model and neural networks are applied to predict the probability of occurrence of Heavy snow. Based on the distribution of estimated probabilities, optimal thresholds are determined via true shill score. According to the results of comparison the logistic regression model is recommended.

On the Logistic Regression Diagnostics

  • Kim, Choong-Rak;Jeong, Kwang-Mo
    • Journal of the Korean Statistical Society
    • /
    • v.22 no.1
    • /
    • pp.27-37
    • /
    • 1993
  • Since the analytic expression for a diagnostic in the logistic regression model is not available, one-step estimation is often used by a case-deletion point of view. In this paper, infinitesimal perturbation approach is used, and it is shown that the scale transformation of infinitesimal perturbation approach is eventually equal to the weighted perturbation of local influence approach and the replacement measure. Also, multiple cases deletion for the masking effect is considered.

  • PDF

Multicollinarity in Logistic Regression

  • Jong-Han lee;Myung-Hoe Huh
    • Communications for Statistical Applications and Methods
    • /
    • v.2 no.2
    • /
    • pp.303-309
    • /
    • 1995
  • Many measures to detect multicollinearity in linear regression have been proposed in statistics and numerical analysis literature. Among them, condition number and variance inflation factor(VIF) are most popular. In this study, we give new interpretations of condition number and VIF in linear regression, using geometry on the explanatory space. In the same line, we derive natural measures of condition number and VIF for logistic regression. These computer intensive measures can be easily extended to evaluate multicollinearity in generalized linear models.

  • PDF