• 제목/요약/키워드: 로지스틱모형

Search Result 534, Processing Time 0.022 seconds

로지스틱 회귀를 통한 경마의 입상확률모형

  • 유선경;박흥선
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.1
    • /
    • pp.35-43
    • /
    • 2000
  • 본 연구에서는 우리 나라 경마의 실제자료를 이용하여 연승식 경마의 입상확률에 미치는 여러 가지 요인을 조사하였고, 이를 토대로 입상확률모형을 유도하여 보았다. 외국의 경우, 경마에 대한 통계적 접근이 다각적으로 시행되었지만, 기존의 선행방법이 배당금에 의한 입상확률에 근거를 하고 있는 반면, 본 연구에서는 경마장에서 쉽게 구할 수 있는 정보를 중심으로, 로지스틱 회귀를 이용한 방법을 시도해 보았다.

  • PDF

A Comparative Experiment of Software Defect Prediction Models using Object Oriented Metrics (객체지향 메트릭을 이용한 결함 예측 모형의 실험적 비교)

  • Kim, Yun-Kyu;Kim, Tae-Yeon;Chae, Heung-Seok
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.8
    • /
    • pp.596-600
    • /
    • 2009
  • To support an efficient management of software verification and validation activities, many defect prediction models have been proposed based on object oriented metrics. They usually adopt logistic regression analysis, And, they state that the correctness of prediction is about 60${\sim}$70%, We performed a similar experiment with Eclipse 3.3 to check their prediction effectiveness, However, the result shows that correctness is about 40% which is much lower than the original results. We also found that univariate logistic regression analysis produces better results than multivariate logistic regression analysis.

The Comparative Software Reliability Cost Model of Considering Shape Parameter (형상모수를 고려한 소프트웨어 신뢰성 비용 모형에 관한 비교 연구)

  • Kim, Kyung-Soo;Kim, Hee-Cheul
    • Journal of Digital Convergence
    • /
    • v.12 no.3
    • /
    • pp.219-226
    • /
    • 2014
  • In this study, reliability software cost model considering shape parameter based on life distribution from the process of software product testing was studied. The shape parameter using the Erlang and Log-logistic model that is widely used in the field of reliability problems presented. The software failure model was used finite failure non-homogeneous Poisson process model, the parameters estimation using maximum likelihood estimation was conducted. In comparison result of software cost model based on the Erlang distribution and the log-logistic distribution software cost model, because Erlang model is to predict the optimal release time can be software, but the log-logistic model to predict to optimal release time can not be, Erlang distribution than the log-logistic distribution appears to be effective. In this research, software developers to identify software development cost some extent be able to help is considered.

Study on Detection Technique for Cochlodinium polykrikoides Red tide using Logistic Regression Model and Decision Tree Model (로지스틱 회귀모형과 의사결정나무 모형을 이용한 Cochlodinium polykrikoides 적조 탐지 기법 연구)

  • Bak, Su-Ho;Kim, Heung-Min;Kim, Bum-Kyu;Hwang, Do-Hyun;Unuzaya, Enkhjargal;Yoon, Hong-Joo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.13 no.4
    • /
    • pp.777-786
    • /
    • 2018
  • This study propose a new method to detect Cochlodinium polykrikoides on satellite images using logistic regression and decision tree. We used spectral profiles(918) extracted from red tide, clear water and turbid water as training data. The 70% of the entire data set was extracted and used for model training, and the classification accuracy of the model was evaluated by using the remaining 30%. As a result of the accuracy evaluation, the logistic regression model showed about 97% classification accuracy, and the decision tree model showed about 86% classification accuracy.

Principal Components Regression in Logistic Model (로지스틱모형에서의 주성분회귀)

  • Kim, Bu-Yong;Kahng, Myung-Wook
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.4
    • /
    • pp.571-580
    • /
    • 2008
  • The logistic regression analysis is widely used in the area of customer relationship management and credit risk management. It is well known that the maximum likelihood estimation is not appropriate when multicollinearity exists among the regressors. Thus we propose the logistic principal components regression to deal with the multicollinearity problem. In particular, new method is suggested to select proper principal components. The selection method is based on the condition index instead of the eigenvalue. When a condition index is larger than the upper limit of cutoff value, principal component corresponding to the index is removed from the estimation. And hypothesis test is sequentially employed to eliminate the principal component when a condition index is between the upper limit and the lower limit. The limits are obtained by a linear model which is constructed on the basis of the conjoint analysis. The proposed method is evaluated by means of the variance of the estimates and the correct classification rate. The results indicate that the proposed method is superior to the existing method in terms of efficiency and goodness of fit.

Log-density Ratio with Two Predictors in a Logistic Regression Model (로지스틱 회귀모형에서 이변량 정규분포에 근거한 로그-밀도비)

  • Kahng, Myung Wook;Yoon, Jae Eun
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.1
    • /
    • pp.141-149
    • /
    • 2013
  • We present methods for studying the log-density ratio that enables the selection of the predictors and the form to be included in the logistic regression model. Under bivariate normal distributional assumptions, we investigate the form of the log-density ratio as a function of two predictors. If two covariance matrices are equal, then the crossproduct and quadratic terms are not needed. If the variables are uncorrelated, we do not need the crossproduct terms, but we still need the linear and quadratic terms. We also explore other conditions in which the crossproduct and quadratic terms are not needed in the logistic regression model.

Maximum likelihood estimation of Logistic random effects model (로지스틱 임의선형 혼합모형의 최대우도 추정법)

  • Kim, Minah;Kyung, Minjung
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.957-981
    • /
    • 2017
  • A generalized linear mixed model is an extension of a generalized linear model that allows random effect as well as provides flexibility in developing a suitable model when observations are correlated or when there are other underlying phenomena that contribute to resulting variability. We describe maximum likelihood estimation methods for logistic regression models that include random effects - the Laplace approximation, Gauss-Hermite quadrature, adaptive Gauss-Hermite quadrature, and pseudo-likelihood. Applications are provided with social science problems by analyzing the effect of mental health and life satisfaction on volunteer activities from Korean welfare panel data; in addition, we observe that the inclusion of random effects in the model leads to improved analyses with more reasonable inferences.

Developing the high-risk drinking predictive model in Korea using the data mining technique (데이터마이닝 기법을 활용한 한국인의 고위험 음주 예측모형 개발 연구)

  • Park, Il-Su;Han, Jun-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.6
    • /
    • pp.1337-1348
    • /
    • 2017
  • In this paper, we develop the high-risk drinking predictive model in Korea using the cross-sectional data from Korea Community Health Survey (2014). We perform the logistic regression analysis, the decision tree analysis, and the neural network analysis using the data mining technique. The results of logistic regression analysis showed that men in their forties had a high risk and the risk of office workers and sales workers were high. Especially, current smokers had higher risk of high-risk drinking. Neural network analysis and logistic regression were the most significant in terms of AUROC (area under a receiver operation characteristic curve) among the three models. The high-risk drinking predictive model developed in this study and the selection method of the high-risk intensive drinking group can be the basis for providing more effective health care services such as hazardous drinking prevention education, and improvement of drinking program.

순서형 대설 예보를 위한 통계 모형 개발

  • Son, Geon-Tae;Lee, Jeong-Hyeong;Ryu, Chan-Su
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2005.11a
    • /
    • pp.101-105
    • /
    • 2005
  • 호남지역에 대한 대설특보 예보를 위한 통계모형 개발을 수행하였다. 일 신적설량에 따라 세법주(0: 비발생, 1: 대설주의보, 2: 대설경보)로 구분되는 순서형 자료 형태를 지니고 있다. 두가지 통계 모형(다등급 로지스틱 회귀모형, 신경회로망 모형)을 고려하였으며, 수치모델 출력자료를 이용한 역학-통계모형 기법의 하나인 MOS(model output statistics)를 적용하여 축적된 수치모델 예보자료와 관측치의 관계를 통계모형식으로 추정하여 예측모형을 개발하였다. 군집분석을 사용하여 훈련자료와 검증자료를 구분하였으며, 예보치 생성을 위하여 문턱치를 고려하였다.

  • PDF

Comparison of Multinomial Logit and Logistic Regression on Disability Pensioners' Characteristic (다범주 자료의 다항로짓 모형과 로지스틱 회귀모형 비교;장애연금 특성분석 중심으로)

  • Kim, Mi-Jung
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.4
    • /
    • pp.589-602
    • /
    • 2008
  • This article studies on disability pensioners' characteristic with multinomial logit and logistic regression model. Seven factors are examined on whether each factor is reflected in degree of disability in the disability pension. By incorporating multinomial logit and logistic regression model, effectiveness and characteristic of the seven factors are investigated on the degree of disability. Result shows all the seven factors are significant on the degree of disability, while among the seven, five factors, age, sex, type of coverage, type of category, insured duration show a trend in degree of disability and the other two, cause of disability and class of standard monthly income are not effective on trend in degree of disability. Results from analyses might be useful for disability pension management.