• Title/Summary/Keyword: Prediction of variables

Search Result 1,817, Processing Time 0.027 seconds

Analyzing Customer Management Data by Data Mining: Case Study on Chum Prediction Models for Insurance Company in Korea

  • Cho, Mee-Hye;Park, Eun-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.19 no.4
    • /
    • pp.1007-1018
    • /
    • 2008
  • The purpose of this case study is to demonstrate database-marketing management. First, we explore original variables for insurance customer's data, modify them if necessary, and go through variable selection process before analysis. Then, we develop churn prediction models using logistic regression, neural network and SVM analysis. We also compare these three data mining models in terms of misclassification rate.

  • PDF

A Research on Yield Prediction of Mixed Pastures in Korea via Model Construction in Stages (혼파초지에서 모형의 단계적 적용을 통한 수량예측 연구)

  • Oh, Seung Min;Kim, Moon Ju;Peng, Jinglun;Lee, Bae Hun;Kim, Ji Yung;Kim, Byong Wan;Jo, Mu Hwan;Sung, Kyung Il
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.37 no.1
    • /
    • pp.80-91
    • /
    • 2017
  • The objective of this study was to select a model showing high-levels of interpretability which is high in R-squared value in terms of predicting the yield in the mixed pasture using the factors of fertilization, seeding rate and years after pasture establishment in steps, as well as the climate as a basic factor. The processes of constructing the yield prediction model for the mixed pasture were performed in the sequence of data collection (forage and climatic data), preparation, analysis, and model construction. Through this process, six models were constructed after considering climatic variables, fertilization management, seeding rates, and periods after pasture establishment years in steps, thereafter the optimum model was selected through considering the coincidence of the models to the forage production theories. As a result, Model VI (R squared = 53.8%) including climatic variables, fertilization amount, seeding rates, and periods after pasture establishment was considered as the optimum yield prediction model for mixed pastures in South Korea. The interpretability of independent variables in the model were decreased in the sequence of climatic variables(24.5%), fertilization amount(17.8%), seeding rates(10.7%), and periods after pasture establishment(0.8%). However, it is necessary to investigate the reasons of positive correlation between dry matter yield and days of summer depression (DSD) by considering cultivated locations and using other cumulative temperature related variables instead of DSD. Meanwhile the another research about the optimum levels of fertilization amounts and seeding rates is required using the quadratic term due to the certain value-centered distribution of these two variables.

A Meta-Analysis of Variables Related to Suicidal Ideation in Adolescents (청소년 자살생각 관련변인에 관한 메타분석)

  • Kim, Bo-Young;Lee, Chung-Sook
    • Journal of Korean Academy of Nursing
    • /
    • v.39 no.5
    • /
    • pp.651-661
    • /
    • 2009
  • Purpose: This study was done using meta-analysis to examine 58 studies from studies published in the past eight years (2000 to 2007) that included variables related to adolescents' suicidal ideation. Methods: The materials for this study were based on 32 variables which were selected from masters' thesis, doctoral dissertation and articles from Journals of the Korean Academy of Nursing. Results: The classification consisted of 5 variables groups and 32 variables. In terms of effect size on risk, variables which were significant included psychological variables (0.668), socio-cultural variables (0.511), family environmental variables (0.405), school environmental variables (0.221), and personal characteristics variables (0.147). In terms of effect size on protection, variables which were significant included personal characteristics variables (-1.107), psychological variables (-0.526), family environmental variables (-0.264), and school environmental variables (-0.155). In terms of effect size on risk variables, psychological variables (0.668) were highest. In terms of effect size on protective variables, the variable of personal characteristic (-1.107) was the highest. Conclusion: While the results indicate possible risk and protective variables for suicidal ideation, but prediction is still difficult. Further study to compare adolescents with similar variables but no suicidal ideation and those with suicidal ideation is necessary.

A Prediction Model of the Sum of Container Based on Combined BP Neural Network and SVM

  • Ding, Min-jie;Zhang, Shao-zhong;Zhong, Hai-dong;Wu, Yao-hui;Zhang, Liang-bin
    • Journal of Information Processing Systems
    • /
    • v.15 no.2
    • /
    • pp.305-319
    • /
    • 2019
  • The prediction of the sum of container is very important in the field of container transport. Many influencing factors can affect the prediction results. These factors are usually composed of many variables, whose composition is often very complex. In this paper, we use gray relational analysis to set up a proper forecast index system for the prediction of the sum of containers in foreign trade. To address the issue of the low accuracy of the traditional prediction models and the problem of the difficulty of fully considering all the factors and other issues, this paper puts forward a prediction model which is combined with a back-propagation (BP) neural networks and the support vector machine (SVM). First, it gives the prediction with the data normalized by the BP neural network and generates a preliminary forecast data. Second, it employs SVM for the residual correction calculation for the results based on the preliminary data. The results of practical examples show that the overall relative error of the combined prediction model is no more than 1.5%, which is less than the relative error of the single prediction models. It is hoped that the research can provide a useful reference for the prediction of the sum of container and related studies.

Characteristics of Chatter Stability Lobe in 2-DOF Machining System (2-DOF 가공시스템의 채터로브 거동연구)

  • Lee, Hyuk;Chin, Dohun;Yoon, Moonchul
    • Journal of the Korean Society of Manufacturing Process Engineers
    • /
    • v.18 no.7
    • /
    • pp.1-7
    • /
    • 2019
  • A chatter lobe analysis is frequently used to look at the chatter state. Even if there is a lot of research on chatter, chatter lobe characteristics are not well defined. In this study, the chatter lobe behavior according to several variables of vibration mode is verified for further clarity. The dynamic variables of the chatter model are defined and their behaviors on chatter lobe boundary are analyzed in detail. In this sense, the chatter model with 2-DOF (2-DOF) was used to analyze chatter stability characteristics. The discussed results are satisfying and these can be used for the prediction of chatter existence in machining processes of 2-DOF systems in several revolution range. These analyses indicate a better agreement for predicting an appropriate stability lobe over a wide detailed range of critical depths of cut in machining operation. The results allow an excellent prediction of chatter according to various static and dynamic variables in machining states. The behavior of chatter dynamic variables in machining were also discussed in detail. All these results can also be applied to other machining processes by establishing a chatter model in a 2-DOF system.

Learning Method for Real-time Crime Prediction Model Utilizing CCTV

  • Bang, Seung-Hwan;Cho, Hyun-Bo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.5
    • /
    • pp.91-98
    • /
    • 2016
  • We propose a method to train a model that can predict the probability of a crime being committed. CCTV data by matching criminal events are required to train the crime prediction model. However, collecting CCTV data appropriate for training is difficult. Thus, we collected actual criminal records and converted them to an appropriate format using variables by considering a crime prediction environment and the availability of real-time data collection from CCTV. In addition, we identified new specific crime types according to the characteristics of criminal events and trained and tested the prediction model by applying neural network partial least squares for each crime type. Results show a level of predictive accuracy sufficiently significant to demonstrate the applicability of CCTV to real-time crime prediction.

Development of the Roundwood Demand Prediction Model

  • Kim, Dong-Jun
    • Journal of Korean Society of Forest Science
    • /
    • v.95 no.2
    • /
    • pp.203-208
    • /
    • 2006
  • This study compared the roundwood demand prediction accuracy of econometric and time-series models using Korean data. The roundwood was divided into softwood and hardwood by species. The econometric model of roundwood demand was specified with four explanatory variables; own price, substitute price, gross domestic product, dummy. The time-series model was specified with lagged endogenous variable. The dummy variable reflected the abrupt decrease in roundwood demand in the late 1990's in the case of softwood roundwood, and the boom of plywood export in the late 1970's in the case of hardwood roundwood. On the other hand, the prediction accuracy was estimated on the basis of Residual Mean Square Errors(RMSE). The results showed that the softwood roundwood demand prediction can be performed more accurately by econometric model than by time-series model. However, the hardwood roundwood demand prediction accuracy was similar in the case of using econometric and time-series model.

Development of the Prediction Method for Hospital Bankruptcy using a Hierarchical Generalized Linear Model(HGIM) (HGLM을 적용한 병원 도산 예측방법의 개발)

  • Noh, Maeng-Seok;Chang, Hye-Jung;Lee, Young-Jo
    • Korea Journal of Hospital Management
    • /
    • v.6 no.2
    • /
    • pp.22-36
    • /
    • 2001
  • The hospital bankruptcy rate is increasing, therefore it is very important to predict the bankruptcy using the existing hospital management information. The hospital bankruptcy is often measured in year intervals, called grouped duration data, not by the continuous time elapsed to the bankruptcy. This study introduces a hierarchical generalized linear model(HGLM) for analysis of hospital bankruptcy data. The hazard function for each hospital may be influenced by unobservable latent variables, and these unknown variables are usually termed as random effects or frailties which explain correlations among repeated measures of the same hospital and describe individual heterogeneities of hospitals. Practically, the data of twenty bankrupt and sixty profitable hospitals were collected for five years, and were fitted to HGLM. The results were compared with those of the logit model. While the logit model resulted only in the effects of explanatory variables on the bankruptcy status at specific period, the HGLM showed variables with significant effects over all observed years. It is concluded that the HGLM with a fixed ratio and a period of total asset turnrounds was justified, and could find significant within and between hospital variations.

  • PDF

Construction Safety and Health Management Cost Prediction Model using Support Vector Machine (서포트 벡터 머신을 이용한 건설업 안전보건관리비 예측 모델)

  • Shin, Sung Woo
    • Journal of the Korean Society of Safety
    • /
    • v.32 no.1
    • /
    • pp.115-120
    • /
    • 2017
  • The aim of this study is to develop construction safety and health management cost prediction model using support vector machine (SVM). To this end, theoretical concept of SVM is investigated to formulate the cost prediction model. Input and output variables have been selected by analyzing the balancing accounts for the completed construction project. In order to train and validate the proposed prediction model, 150 data sets have been gathered from field. Effects of SVM parameters on prediction accuracy are analyzed and from which the optimal parameter values have been determined. The prediction performance tests are conducted to confirm the applicability of the proposed model. Based on the results, it is concluded that the proposed SVM model can effectively be used to predict the construction safety and health management cost.

Determining Direction of Conditional Probabilistic Dependencies between Clusters (클러스터간 조건부 확률적 의존의 방향성 결정에 대한 연구)

  • Jung, Sung-Won;Lee, Do-Heon;Lee, Kwang-H.
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.5
    • /
    • pp.684-690
    • /
    • 2007
  • We describe our method to predict the direction of conditional probabilistic dependencies between clusters of random variables. Selected variables called 'gateway variables' are used to predict the conditional probabilistic dependency relations between clusters. The direction of conditional probabilistic dependencies between clusters are predicted by finding directed acyclic graph (DAG)-shaped dependency structure between the gateway variables. We show that our method shows meaningful prediction results in determining directions of conditional probabilistic dependencies between clusters.