• Title/Summary/Keyword: 로지스틱회귀

Search Result 1,750, Processing Time 0.026 seconds

Analysis-based Pedestrian Traffic Incident Analysis Based on Logistic Regression (로지스틱 회귀분석 기반 노인 보행자 교통사고 요인 분석)

  • Siwon Kim;Jeongwon Gil;Jaekyung Kwon;Jae seong Hwang;Choul ki Lee
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.23 no.2
    • /
    • pp.15-31
    • /
    • 2024
  • The characteristics of elderly traffic accidents were identified by reflecting the situation of the elderly population in Korea, which is entering an ultra-aging society, and the relationship between independent and dependent variables was analyzed by classifying traffic accidents of serious or higher and traffic accidents of minor or lower in elderly pedestrian traffic accidents using binomial variables. Data collection, processing, and variable selection were performed by acquiring data from the elderly pedestrian traffic accident analysis system (TAAS) for the past 10 years (from 13 to 22 years), and basic statistics and analysis by accident factors were performed. A total of 15 influencing variables were derived by applying the logistic regression model, and the influencing variables that have the greatest influence on the probability of a traffic accident involving severe or higher elderly pedestrians were derived. After that, statistical tests were performed to analyze the suitability of the logistic model, and a method for predicting the probability of a traffic accident according to the construction of a prediction model was presented.

Detecting Improper Sentences in a News Article Using Text Mining (텍스트 마이닝을 이용한 기사 내 부적합 문단 검출 시스템)

  • Kim, Kyu-Wan;Sin, Hyun-Ju;Kim, Seon-Jin;Lee, Hyun Ah
    • Annual Conference on Human and Language Technology
    • /
    • 2017.10a
    • /
    • pp.294-297
    • /
    • 2017
  • SNS와 스마트기기의 발전으로 온라인을 통한 뉴스 배포가 용이해지면서 악의적으로 조작된 뉴스가 급속도로 생성되어 확산되고 있다. 뉴스 조작은 다양한 형태로 이루어지는데, 이 중에서 정상적인 기사 내에 광고나 낚시성 내용을 포함시켜 독자가 의도하지 않은 정보에 노출되게 하는 형태는 독자가 해당 내용을 진짜 뉴스로 받아들이기 쉽다. 본 논문에서는 뉴스 기사 내에 포함된 문단 중에서 부적합한 문단이 포함되었는지를 판정하기 위한 방법을 제안한다. 제안하는 방식에서는 자연어 처리에 유용한 Convolutional Neural Network(CNN)모델 중 Word2Vec과 tf-idf 알고리즘, 로지스틱 회귀를 함께 이용하여 뉴스 부적합 문단을 검출한다. 본 시스템에서는 로지스틱 회귀를 이용하여 문단의 카테고리를 분류하여 본문의 카테고리 분포도를 계산하고 Word2Vec을 이용하여 문단간의 유사도를 계산한 결과에 가중치를 부여하여 부적합 문단을 검출한다.

  • PDF

The Rating of Korean Basketball League Teams in 2006-2007 Season: Taking Account of Home-Court Advantage (홈팀의 이점을 고려한 KBL 2006-2007 시즌 경기력 평가)

  • Lee, Seung-Chun;Byun, Jong-Seok
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.5
    • /
    • pp.687-695
    • /
    • 2008
  • It is widely known that the home advantage plays an important factor for determining victory or defeat in sport leagues. Thus a ranking system of sport league should take account of the home advantage as a key factor. Various statistical models are studied to rate the Korean Basketball league teams in 2006-2007 season. Among them, the model equation provided by Harville and Smith (1994) is useful for constructing two ranking systems. Both systems give quite reasonable quantifications of the team's ability and the home advantage.

Inferential Problems in Bayesian Logistic Regression Models (베이지안 로지스틱 회귀모형에서의 추론에 대한 연구)

  • Hwang, Jin-Soo;Kang, Sung-Chan
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.6
    • /
    • pp.1149-1160
    • /
    • 2011
  • Model selection and hypothesis testing problems in Bayesian inference are still debated between scholars. Bayesian factors traditionally used as a criterion in Bayesian hypothesis testing and model selection, are easy to understand but sometimes hard to compute. In addition, there are other model selection criterions such as DIC(Deviance Information Criterion) by Spiegelhalter et al. (2002) and Bayesian P-values for testing. In this paper, we briefly introduce the Bayesian hypothesis testing and model selection procedure. In addition we have applied a Bayesian inference to Swiss banknote data by a fitting logistic regression model and computing several test statistics to see if they provide consistent results.

Network Identification of Major Risk Factor Associated with Delirium by Bayesian Network (베이지안 네트워크를 활용한 정신장애 질병 섬망(delirium)의 주요 요인 네트워크 규명)

  • Lee, Jea-Young;Choi, Young-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.2
    • /
    • pp.323-333
    • /
    • 2011
  • We analyzed using logistic to find factors with a mental disorder because logistic is the most efficient way assess risk factors. In this paper, we applied data mining techniques that are logistic, neural network, c5.0, cart and Bayesian network to delirium data. The Bayesian network method was chosen as the best model. When delirium data were applied to the Bayesian network, we determined the risk factors associated with delirium as well as identified the network between the risk factors.

Various Graphical Methods for Assessing a Logistic Regression Model (로지스틱회귀모형의 평가를 위한 그래픽적 방법)

  • Kim, Kyung Jin;Kahng, Myung Wook
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1191-1208
    • /
    • 2015
  • Most statistical methods are dependent on the summary statistic. However, with graphical approaches, it is easier to identify the characteristics of the data and detect information that cannot be obtained by the summary statistic. We present various graphical methods to assess the adequacy of models in logistic regression that include checking log-density ratio, structural dimension, marginal model plot, chi-residual plot, and CERES plot. Through simulation data, we investigate and compare the results of graphical approaches under diverse conditions.

Evaluation of the Probability of Detection Surface for ODSCC in Steam Generator Tubes Using Multivariate Logistic Regression (다변량 로지스틱 회귀분석을 이용한 증기발생기 전열관 ODSCC의 POD곡면 분석)

  • Lee, Jae-Bong;Park, Jai-Hak;Kim, Hong-Deok;Chung, Han-Sub
    • Proceedings of the KSME Conference
    • /
    • 2007.05a
    • /
    • pp.250-255
    • /
    • 2007
  • Steam generator tubes play an important role in safety because they constitute one of the primary barriers between the radioactive and non-radioactive sides of the nuclear power plant. For this reason, the integrity of the tubes is essential in minimizing the leakage possibility of radioactive water. The integrity of the tubes is evaluated based on NDE (non-destructive evaluation) inspection results. Especially ECT (eddy current test) method is usually used for detecting the flaws in steam generator tubes. However, detection capacity of the NDE is not perfect and all of the "real flaws" which actually existing in steam generator tunes is not known by NDE results. Therefore reliability of NDE system is one of the essential parts in assessing the integrity of steam generators. In this study POD (probability of detection) of ECT system for ODSCC in steam generator tubes is evaluated using multivariate logistic regression. The cracked tube specimens are made using the withdrawn steam generator tubes. Therefore the cracks are not artificial but real. Using the multivariate logistic regression method, continuous POD surfaces are evaluated from hit (detection) and miss (no detection) binary data obtained from destructive and non-destructive evaluation of the cracked tubes. Length and depth of cracks are considered in multivariate logistic regression and their effects on detection capacity are evaluated.

  • PDF

Application Method of Logistic Regression Analysis for Annoyance Prediction Model Based on Predicted Noise Level (예측소음도를 이용한 어노이언스 예측모델을 위한 로지스틱 회귀분석의 적용방법)

  • Son, Jin-Hee;Lee, Kun;Choung, Tae-Ryang;Chang, Seo-Il
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.20 no.6
    • /
    • pp.555-561
    • /
    • 2010
  • Predicted noise level has been used to assess the annoyance response since noise map was generalized and being the normal method to assess the environmental noise. Unfortunately using predicted noise level to derive the annoyance prediction curve caused some problems. The data have to be grouped manually to use the annoyance prediction curve. The aim of this paper is to propose the method to handle the predicted noise level and the survey data for annoyance prediction curve. This paper used the percentage of persons annoyed(%A) and the percentage of persons highly annoyed as the descriptor of noise annoyance in a population. The logistic regression method was used for deriving annoyance prediction curve. It is concluded that the method of dichotomizing data and logistic regression was suitable to handle the predicted noise level and survey data.

Detecting Improper Sentences in a News Article Using Text Mining (텍스트 마이닝을 이용한 기사 내 부적합 문단 검출 시스템)

  • Kim, Kyu-Wan;Sin, Hyun-Ju;Kim, Seon-Jin;Lee, Hyun Ah
    • 한국어정보학회:학술대회논문집
    • /
    • 2017.10a
    • /
    • pp.294-297
    • /
    • 2017
  • SNS와 스마트기기의 발전으로 온라인을 통한 뉴스 배포가 용이해지면서 악의적으로 조작된 뉴스가 급속도로 생성되어 확산되고 있다. 뉴스 조작은 다양한 형태로 이루어지는데, 이 중에서 정상적인 기사 내에 광고나 낚시성 내용을 포함시켜 독자가 의도하지 않은 정보에 노출되게 하는 형태는 독자가 해당 내용을 진짜 뉴스로 받아들이기 쉽다. 본 논문에서는 뉴스 기사 내에 포함된 문단 중에서 부적합한 문단이 포함 되었는지를 판정하기 위한 방법을 제안한다. 제안하는 방식에서는 자연어 처리에 유용한 Convolutional Neural Network(CNN)모델 중 Word2Vec과 tf-idf 알고리즘, 로지스틱 회귀를 함께 이용하여 뉴스 부적합 문단을 검출한다. 본 시스템에서는 로지스틱 회귀를 이용하여 문단의 카테고리를 분류하여 본문의 카테고리 분포도를 계산하고 Word2Vec을 이용하여 문단간의 유사도를 계산한 결과에 가중치를 부여하여 부적합 문단을 검출한다.

  • PDF

Prediction of fine dust PM10 using a deep neural network model (심층 신경망모형을 사용한 미세먼지 PM10의 예측)

  • Jeon, Seonghyeon;Son, Young Sook
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.2
    • /
    • pp.265-285
    • /
    • 2018
  • In this study, we applied a deep neural network model to predict four grades of fine dust $PM_{10}$, 'Good, Moderate, Bad, Very Bad' and two grades, 'Good or Moderate and Bad or Very Bad'. The deep neural network model and existing classification techniques (such as neural network model, multinomial logistic regression model, support vector machine, and random forest) were applied to fine dust daily data observed from 2010 to 2015 in six major metropolitan areas of Korea. Data analysis shows that the deep neural network model outperforms others in the sense of accuracy.