• Title/Summary/Keyword: Models, statistical

Search Result 3,026, Processing Time 0.029 seconds

A study on forecasting of consumers' choice using artificial neural network (인공신경망을 이용한 소비자 선택 예측에 관한 연구)

  • 송수섭;이의훈
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.26 no.4
    • /
    • pp.55-70
    • /
    • 2001
  • Artificial neural network(ANN) models have been widely used for the classification problems in business such as bankruptcy prediction, credit evaluation, etc. Although the application of ANN to classification of consumers' choice behavior is a promising research area, there have been only a few researches. In general, most of the researches have reported that the classification performance of the ANN models were better than conventional statistical model Because the survey data on consumer behavior may include much noise and missing data, ANN model will be more robust than conventional statistical models welch need various assumptions. The purpose of this paper is to study the potential of the ANN model for forecasting consumers' choice behavior based on survey data. The data was collected by questionnaires to the shoppers of department stores and discount stores. Then the correct classification rates of the ANN models for the training and test sample with that of multiple discriminant analysis(MDA) and logistic regression(Logit) model. The performance of the ANN models were betted than the performance of the MDA and Logit model with respect to correct classification rate. By using input variables identified as significant in the stepwise MDA, the performance of the ANN models were improved.

  • PDF

Validation Comparison of Credit Rating Models for Categorized Financial Data (범주형 재무자료에 대한 신용평가모형 검증 비교)

  • Hong, Chong-Sun;Lee, Chang-Hyuk;Kim, Ji-Hun
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.4
    • /
    • pp.615-631
    • /
    • 2008
  • Current credit evaluation models based on only financial data except non-financial data are used continuous data and produce credit scores for the ranking. In this work, some problems of the credit evaluation models based on transformed continuous financial data are discussed and we propose improved credit evaluation models based on categorized financial data. After analyzing and comparing goodness-of-fit tests of two models, the availability of the credit evaluation models for categorized financial data is explained.

Development of Empirical and Statistical Models for Prediction of Water Quality of Pretreated Wastewater in Pulp and Paper Industry (제지공정 폐수 전처리 수질예측을 위한 실험적 모델과 통계적 모델 개발)

  • Sohn, Jinsik;Han, Jihee;Lee, Sangho
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.31 no.4
    • /
    • pp.289-296
    • /
    • 2017
  • Pulp and paper industry produces large volumes of wastewater and residual sludge waste, resulting in many issues in relation to wastewater treatment and sludge disposal. Contaminants in pulp and paper wastewater include effluent solids, sediments, chemical oxygen demand (COD), and biological oxygen demand (BOD), which should be treated by wastewater treatment processes such as coagulation and biological treatment. However, few works have been attempted to predict the treatment efficiency of pulp and paper wastewater. Accordingly, this study presented empirical models based on experimental data in laboratory-scale coagulation tests and compared them with statistical models such as artificial neural network (ANN). Results showed that the water quality parameters such as turbidity, suspended solids, COD, and UVA can be predicted using either linear or expoential regression models. Nevertheless, the accuracies for turbidity and UVA predictions were relatively lower than those for SS and COD. On the other hand, ANN showed higher accuracies than the emprical models for all water parameters. However, it seems that two kinds of models should be used together to provide more accurate information on the treatment efficiency of pulp and paper wastewater.

Intensity estimation with log-linear Poisson model on linear networks

  • Idris Demirsoy;Fred W. Hufferb
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.1
    • /
    • pp.95-107
    • /
    • 2023
  • Purpose: The statistical analysis of point processes on linear networks is a recent area of research that studies processes of events happening randomly in space (or space-time) but with locations limited to reside on a linear network. For example, traffic accidents happen at random places that are limited to lying on a network of streets. This paper applies techniques developed for point processes on linear networks and the tools available in the R-package spatstat to estimate the intensity of traffic accidents in Leon County, Florida. Methods: The intensity of accidents on the linear network of streets is estimated using log-linear Poisson models which incorporate cubic basis spline (B-spline) terms which are functions of the x and y coordinates. The splines used equally-spaced knots. Ten different models are fit to the data using a variety of covariates. The models are compared with each other using an analysis of deviance for nested models. Results: We found all covariates contributed significantly to the model. AIC and BIC were used to select 9 as the number of knots. Additionally, covariates have different effects such as increasing the speed limit would decrease traffic accident intensity by 0.9794 but increasing the number of lanes would result in an increase in the intensity of traffic accidents by 1.086. Conclusion: Our analysis shows that if other conditions are held fixed, the number of accidents actually decreases on roads with higher speed limits. The software we currently use allows our models to contain only spatial covariates and does not permit the use of temporal or space-time covariates. We would like to extend our models to include such covariates which would allow us to include weather conditions or the presence of special events (football games or concerts) as covariates.

Travel-Time Models for Class-Based AS/RS Systems

  • Lee, Young-Hae;Cho, Yong-Seong
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.14 no.1
    • /
    • pp.119-130
    • /
    • 1989
  • This paper presents average travel time models automated warehousing system where the stacker crane transports only one pallet at a time with the tchebychev travel, I/O point is located at the cornor of the rack, and items are stored by the class-based storage assignment rule. In this study, the racks are treated as the continuous rectangle in time and a statistical approach was used to develop the models. In order to test the proposed models, average travel times determined by the models are compared with the true values for various rack shapes.

  • PDF

Performance Evaluation of Time Series Models using Short-Term Air Passenger Data

  • Park, W.G.;Kim, S.
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.6
    • /
    • pp.917-923
    • /
    • 2012
  • We perform a comparison of time series models that include seasonal ARIMA, Fractional ARIMA, and Holt-Winters models; in addition, we also consider hourly and daily air passenger data. The results of the performance evaluation of the models show that the Holt-Winters methods outperforms other models in terms of MAPE.

Near-real time Kp forecasting methods based on neural network and support vector machine

  • Ji, Eun-Young;Moon, Yong-Jae;Park, Jongyeob;Lee, Dong-Hun
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.37 no.2
    • /
    • pp.123.1-123.1
    • /
    • 2012
  • We have compared near-real time Kp forecast models based on neural network (NN) and support vector machine (SVM) algorithms. We consider four models as follows: (1) a NN model using ACE solar wind data; (2) a SVM model using ACE solar wind data; (3) a NN model using ACE solar wind data and preliminary kp values from US ground-based magnetometers; (4) a SVM model using the same input data as model 3. For the comparison of these models, we estimate correlation coefficients and RMS errors between the observed Kp and the predicted Kp. As a result, we found that the model 3 is better than the other models. The values of correlation coefficients and RMS error of the model 3 are 0.93 and 0.48, respectively. For the forecast evaluation of models for geomagnetic storms ($Kp{\geq}6$), we present contingency tables and estimate statistical parameters such as probability of detection yes (PODy), false alarm ratio (FAR), bias, and critical success index (CSI). From a comparison of these statistical parameters, we found that the SVM models (model 2 and model 4) are better than the NN models (model 1 and model 3). The values of PODy and CSI of the model 4 are the highest among these models (PODy: 0.57 and CSI: 0.48). From these results, we suggest that the NN models are better than the SVM models for predicting Kp and the SVM models are better than the NN models for forecasting geomagnetic storms.

  • PDF

A Comparison Study on Statistical Modeling Methods (통계모델링 방법의 비교 연구)

  • Noh, Yoojeong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.5
    • /
    • pp.645-652
    • /
    • 2016
  • The statistical modeling of input random variables is necessary in reliability analysis, reliability-based design optimization, and statistical validation and calibration of analysis models of mechanical systems. In statistical modeling methods, there are the Akaike Information Criterion (AIC), AIC correction (AICc), Bayesian Information Criterion, Maximum Likelihood Estimation (MLE), and Bayesian method. Those methods basically select the best fitted distribution among candidate models by calculating their likelihood function values from a given data set. The number of data or parameters in some methods are considered to identify the distribution types. On the other hand, the engineers in a real field have difficulties in selecting the statistical modeling method to obtain a statistical model of the experimental data because of a lack of knowledge of those methods. In this study, commonly used statistical modeling methods were compared using statistical simulation tests. Their advantages and disadvantages were then analyzed. In the simulation tests, various types of distribution were assumed as populations and the samples were generated randomly from them with different sample sizes. Real engineering data were used to verify each statistical modeling method.

Generating high resolution of daily mean temperature using statistical models (통계적모형을 통한 고해상도 일별 평균기온 산정)

  • Yoon, Sanghoo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.5
    • /
    • pp.1215-1224
    • /
    • 2016
  • Climate information of the high resolution grid units is an important factor to explain the phenomenon in a variety of research field. Statistical linear interpolation models are computationally inexpensive and applicable to any climate data compared to the dynamic simulation method at regional scales. In this paper, we considered four different linear-based statistical interpolation models: general linear model, generalized additive model, spatial linear regression model, and Bayesian spatial linear regression model. The climate variable of interest was the daily mean temperature, where the spatial variability was explained using geographic terrain information: latitude, longitude, elevation. The data were collected by weather stations in January from 2003 and 2012. In the sense of RMSE and correlation coefficient, Bayesian spatial linear regression model showed better performance in reflecting the spatial pattern compared to the other models.

Prediction & Assessment of Change Prone Classes Using Statistical & Machine Learning Techniques

  • Malhotra, Ruchika;Jangra, Ravi
    • Journal of Information Processing Systems
    • /
    • v.13 no.4
    • /
    • pp.778-804
    • /
    • 2017
  • Software today has become an inseparable part of our life. In order to achieve the ever demanding needs of customers, it has to rapidly evolve and include a number of changes. In this paper, our aim is to study the relationship of object oriented metrics with change proneness attribute of a class. Prediction models based on this study can help us in identifying change prone classes of a software. We can then focus our efforts on these change prone classes during testing to yield a better quality software. Previously, researchers have used statistical methods for predicting change prone classes. But machine learning methods are rarely used for identification of change prone classes. In our study, we evaluate and compare the performances of ten machine learning methods with the statistical method. This evaluation is based on two open source software systems developed in Java language. We also validated the developed prediction models using other software data set in the same domain (3D modelling). The performance of the predicted models was evaluated using receiver operating characteristic analysis. The results indicate that the machine learning methods are at par with the statistical method for prediction of change prone classes. Another analysis showed that the models constructed for a software can also be used to predict change prone nature of classes of another software in the same domain. This study would help developers in performing effective regression testing at low cost and effort. It will also help the developers to design an effective model that results in less change prone classes, hence better maintenance.