• Title/Summary/Keyword: 조건 로지스틱 회귀모형

Search Result 19, Processing Time 0.026 seconds

로지스틱 회귀모형을 분석하기 위한 SPSS, SAS, STATA의 비교분석

  • Kim, Sun-Gwi;Jeong, Dong-Bin
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.11a
    • /
    • pp.287-292
    • /
    • 2002
  • 최근 여러 분야에서 로지스틱 회귀에 대한 필요성과 그 응용이 급증하면서 이를 분석하기 위한 통계패키지가 많이 개발되어 사용되고 있다. 이 논문에서는 자료의 유형에 따라 활용할 수 있는 여러 형태의 로지스틱 회귀모형을 간단히 살펴보고, SPSS, SAS, STATA, MINITAB과 같은 통계패키지를 사용하여 로지스틱 회귀모형에 적용할 때 각각 다룰 수 있는 범위와 그 특징에 대해 다룬다.

  • PDF

Log-density Ratio with Two Predictors in a Logistic Regression Model (로지스틱 회귀모형에서 이변량 정규분포에 근거한 로그-밀도비)

  • Kahng, Myung Wook;Yoon, Jae Eun
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.1
    • /
    • pp.141-149
    • /
    • 2013
  • We present methods for studying the log-density ratio that enables the selection of the predictors and the form to be included in the logistic regression model. Under bivariate normal distributional assumptions, we investigate the form of the log-density ratio as a function of two predictors. If two covariance matrices are equal, then the crossproduct and quadratic terms are not needed. If the variables are uncorrelated, we do not need the crossproduct terms, but we still need the linear and quadratic terms. We also explore other conditions in which the crossproduct and quadratic terms are not needed in the logistic regression model.

데이터마이닝을 위한 혼합 데이터베이스에서의 속성선택

  • Cha, Un-Ok;Heo, Mun-Yeol
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2003.05a
    • /
    • pp.103-108
    • /
    • 2003
  • 데이터마이닝을 위한 대용량 데이터베이스를 축소시키는 방법 중에 속성선택 방법이 많이 사용되고 있다. 본 논문에서는 세 가지 속성선택 방법을 사용하여 조건속성 수를 60%이상 축소시켜 결정나무와 로지스틱 회귀모형에 적용시켜보고 이들의 효율을 비교해 본다. 세 가지 속성선택 방법은 MDI, 정보획득, ReliefF 방법이다. 결정나무 방법은 QUEST, CART, C4.5를 사용하였다. 속성선택 방법들의 분류 정확성은 UCI 데이터베이스에 주어진 Credit 승인 데이터베이스와 German Credit 데이터베이스를 사용하여 10층-교차확인 방법으로 평가하였다.

  • PDF

Development of a Logistic Regression Model for Analyzing Site Characteristics of Tombs Surrounding Expressway in Aerial Photographs (항공사진에 나타난 고속국도 주변 묘지의 입지 분석을 위한 로지스틱 회귀모형의 개발)

  • Han, Hee;Seol, A-Ra;Chung, JooSang
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.11 no.4
    • /
    • pp.193-202
    • /
    • 2008
  • The objectives of this study are to analyze the spatial site characteristics of existing tombs and the change in the pattern of spatial distributions of tombs over time. The spatial distributions of tombs located in Honam province along the Honam expressway were investigated by interpreting digital aerial photographs taken in two different points of time; 1990 and 2000. According to the results of the study, the tombs newly observed in 2000 photos were located closer to roads and villages than those found in the photos of 1990. This is a finding indicating that the accessibility of tombs has been more important consideration in determining the location of tomb sites. Also found were the gentle slopes of southern aspects to be favored as tomb sites. Based on the data sets of tombs locations and their topographic site characteristics, the probability function of tombs appearance in the study area was derived using the logistic regression analysis technique. As a result, tomb sites were classified as 74.7% by logistic regression. All of six input factors (elevation, slope, aspect, distance from the roads, the town and the stream, respectively) affected the probability of tombs appearance significantly.

  • PDF

Life Estimation of Elevator Wire Ropes Using Accelerated Degradation Test Data (가속열화시험 데이터를 활용한 엘리베이터 와이어로프 수명 예측)

  • Kim, Seung Ho;Kim, Sang Boo;Kim, Sung Ho;Ham, Sung Hoon
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.41 no.10
    • /
    • pp.997-1004
    • /
    • 2017
  • The life of elevator wire ropes is one of the most important characteristics of an elevator, which is closely related to the safety of users and its maintenance policy. It is not cost effective to measure the lifetime of elevator wire ropes during their use. In this study, the life estimation of elevator wire ropes (8x19W-IWRC) is considered using accelerated degradation test data. A bending fatigue tester is used to perform the accelerated degradation tests, incorporating the acceleration factor of tensile force. Assuming that the life of wire ropes is log-normally distributed, two life estimation methods are suggested and their results are compared. The first method estimates the life of wire ropes utilizing the accelerated life model with pseudo lives obtained from a linear regression model. The second method estimates the life using a logistic model based on failure probability.

Major Factors Influencing Landslide Occurrence along a Forest Road Determined Using Structural Equation Model Analysis and Logistic Regression Analysis (구조방정식과 로지스틱 회귀분석을 이용한 임도비탈면 산사태의 주요 영향인자 선정)

  • Kim, Hyeong-Sin;Moon, Seong-Woo;Seo, Yong-Seok
    • The Journal of Engineering Geology
    • /
    • v.32 no.4
    • /
    • pp.585-596
    • /
    • 2022
  • This study determined major factors influencing landslide occurrence along a forest road near Sangsan village, Sancheok-myeon, Chungju-si, Chungcheongbuk-do, South Korea. Within a 2 km radius of the study area, landslides occur intensively during periods of heavy rainfall (August 2020). This makes study of the area advantageous, as it allows examination of the influence of only geological and tomographic factors while excluding the effects of rainfall and vegetation. Data for 82 locations (37 experiencing landslides and 45 not) were obtained from geological surveys, laboratory tests, and geo-spatial analysis. After some data preprocessing (e.g., error filtering, minimum-maximum normalization, and multicollinearity), structural equation model (SEM) and logistic regression (LR) analyses were conducted. These showed the regolith thickness, porosity, and saturated unit weight to be the factors most influential of landslide risk in the study area. The sums of the influence magnitudes of these factors are 71% in SEM and 83% in LR.

Development of Ingrowth Estimation Equations for Pinus densiflora in Korea Derived from National Forest Inventory Data (국가산림자원조사 자료를 이용한 소나무의 진계생장 추정식 개발)

  • Moon, Ga Hyun;Yim, Jong Su;Shin, Man Yong
    • Journal of Korean Society of Forest Science
    • /
    • v.107 no.4
    • /
    • pp.402-411
    • /
    • 2018
  • This study was conducted to develop ingrowth estimation equations on Pinus densiflora found in Gangwon Province and in the center of Korean Peninsula, based on the National Forest Inventory (NFI)'s permanent sampling plot data. For this study, identical sampling plots in $5^{th}$ and $6^{th}$ NFI data were collected in order to identify ingrowth amounts for the last 5 years. Following two-stage approaches in developing the ingrowth estimation equations, the logistic regression model was used in the first stage to estimate the ingrowth probability. In the second stage, regression analysis on sampling plots with ingrowth occurrence was used to estimate the ingrowth amount. A candidate model was finally selected as an optimal model after a verification based on three evaluation statistics which include mean difference (MD), standard deviation of difference (SDD) and standard error of difference (SED). In results, a logistic regression model based on the number of sampling plot which did not result in ingrowth (model VI), was selected for an ingrowth probability estimation equation and exponential function including the species composition (SC) variable was optimal for an ingrowth estimation equation (model VII). The ingrowth estimation equations developed in this study also evaluated the estimation ability in various forest stand conditions, and no particular issue in fitness or applicability was observed.

Comparisons of the corporate credit rating model power under various conditions (기준값 변화에 따른 기업신용평가모형 성능 비교)

  • Ha, Jeongcheol;Kim, Soojin
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.6
    • /
    • pp.1207-1216
    • /
    • 2015
  • This study aims to compare the model power in developing corporate credit rating models and to suggest a good way to build models based on the characteristic of data. Among many measurement methods, AR is used to measure the model power under various conditions. SAS/MACRO is in use for similar repetitions to reduce time to build models under several combination of conditions. A corporate credit rating model is composed of two sub-models; a credit scoring model and a default prediction model. We verify that the latter performs better than the former under various conditions. From the result of size comparisons, models of large size corporate are more powerful and more meaningful in financial viewpoint than those of small size corporate. As a corporate size gets smaller, the gap between sub-models becomes huge and the effect of outliers becomes serious.

Mandatory Lane-changing Behavior under the Congested Work Zone Traffic Operation (정체상황에서의 강제 차로변경행태 분석 (도로공사로 인한 차로폐쇄 시뮬레이션 기반))

  • Kang, Kyeong-Pyo;Lee, Kwang-Hoon
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.1
    • /
    • pp.215-223
    • /
    • 2008
  • Due partly to lack of actual lane-changing data and partly to few studies on simulation functions to consider the lane-changing behavior, it may result in significant difference between simulation-based and real conditions. The objectives of this study are to estimate the set of mandatory lane-changing models and to analyze their features, depending on the merge control strategies under the lane-closed work zone operations. To achieve them, first, the elaborated calibration is required to simulate the mandatory lane-changing behaviors with the actual field data. Second, one can estimate their models with the logistic regression models, to obtain traffic variables as well as the lane-changing frequencies under the various levels of work zone traffic conditions. As a result, one can state that the well-calibrated simulation has the potential to properly reflect the target mandatory lane-changing behaviors. In addition, it should be mentioned that the set of proposed models is not practicable but preliminary result needed to identify the relations between the actual traffic conditions and lane-changing maneuvers and to develop their practical models for the actual applications.

신경망 모형의 초기가중치 최적화 방법에 관한 연구

  • Jo, Yong-Jun;Lee, Yong-Gu
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2003.05a
    • /
    • pp.19-24
    • /
    • 2003
  • 신경망은 적용 다양성과 제약조건의 최소성, 강력한 예측성, 범용성, 근사성 등 많은 장점을 지니고 있으나 초기 가중치의 할당에 따라 모델 생성의 Performance와 예측의 결과가 달라지게 되는 단점을 지니고 있다. 이런 신경망의 초기 가중치에 따른 단점을 보안하기 위해 통계적 알고리즘의 접목을 통해 Hybrid된 신경망 보완 알고리즘을 제시하고자 하였다. 논문을 위한 기본 가정으로 신경망의 가장 기본인 SLP 알고리즘을 바탕으로 활성함수에 가장 일반적으로 사용되는 Sigmoid 활성함수를 이용하였을 때, 초기 가중치로 기존의 임의 난수 생성 방식이 아닌 통계적 로지스틱 회귀분석의 계수값(mle)을 제시하여 이를 초기치로 사용한 경우와 그렇지 않은 경우의 예측 정확성과 수렴의 Performance정도를 비교하여 가장 효과적인 초기치 방법을 제시하고자 하였다.

  • PDF