• 제목/요약/키워드: Statistical classification

검색결과 1,419건 처리시간 0.027초

인공신경망을 이용한 소비자 선택 예측에 관한 연구 (A study on forecasting of consumers' choice using artificial neural network)

  • 송수섭;이의훈
    • 한국경영과학회지
    • /
    • 제26권4호
    • /
    • pp.55-70
    • /
    • 2001
  • Artificial neural network(ANN) models have been widely used for the classification problems in business such as bankruptcy prediction, credit evaluation, etc. Although the application of ANN to classification of consumers' choice behavior is a promising research area, there have been only a few researches. In general, most of the researches have reported that the classification performance of the ANN models were better than conventional statistical model Because the survey data on consumer behavior may include much noise and missing data, ANN model will be more robust than conventional statistical models welch need various assumptions. The purpose of this paper is to study the potential of the ANN model for forecasting consumers' choice behavior based on survey data. The data was collected by questionnaires to the shoppers of department stores and discount stores. Then the correct classification rates of the ANN models for the training and test sample with that of multiple discriminant analysis(MDA) and logistic regression(Logit) model. The performance of the ANN models were betted than the performance of the MDA and Logit model with respect to correct classification rate. By using input variables identified as significant in the stepwise MDA, the performance of the ANN models were improved.

  • PDF

사상체질의학의 심성과 MBTI 성격유형의 설문 비교 연구 (The comparative questionnaire study of the spirit of Sasang Constitution with the MBTI classification of character)

  • 성진혁;한국MBTI연구소
    • 사상체질의학회지
    • /
    • 제13권2호
    • /
    • pp.156-164
    • /
    • 2001
  • This study started from the curiasty that Sasang Constitution spirit of Lee ]e-ma has something to do with MBTI based on classification of character of C.G. Jung. This reports made from the information of 368 people who got the Sasang Constitution therapy and showed the good result in health in my hospital and they take part in self-report from of MBTI which are made as statistics and research in relationship between Sasang Constitution spitrit of Lee ]e-ma and classification of character of C.G. Jung This is the statistical result of the research. There is not exactly statistical result which support Sasang Constitution spirit of Lee Je-ma relate to the classification of character of C.G. Jung. The identification of statement of Sasang Constitution spirit with partispart is average 41 which is low while the identification of classification of character with partispart is average 76 which is high. In this reseult, It is hard to get general agreement that the statement of Sasang Constitution of spirit relate to the classification of character because of the difference of identification. so more studies are needed in this part.

  • PDF

GIS기반의 통계정보를 이용한 토지이용 분류 (Land Use Classification Using GIS based Statistical Unit data)

  • 민숙주;김계현;박태옥;전방진
    • 한국측량학회:학술대회논문집
    • /
    • 한국측량학회 2004년도 추계학술발표회 논문집
    • /
    • pp.343-347
    • /
    • 2004
  • Landuse information is used to plan land use, urban and environmental management as base data. And, demand for landuse information is rising due to ecological consideration in urban area. But existing method to extract landuse information from aerial photographs or satellite images is difficulte to describe sufficient urban landuses. Also landuse information need to be linked with statistical data because statistical data is used to make decision for urban planning and management with landuse. Therefore this study aims to examine the landuse classification method using statistical unit data and 1:1,000 digital topographic data. for the purpose, the method was applied to a part of metropolitan Seoul. The results of study shows that total accuracy is 95%. For the future, the method will be effectively applicable for the city maintenance.

  • PDF

A Decision Tree Algorithm using Genetic Programming

  • Park, Chongsun;Ko, Young Kyong
    • Communications for Statistical Applications and Methods
    • /
    • 제10권3호
    • /
    • pp.845-857
    • /
    • 2003
  • We explore the use of genetic programming to evolve decision trees directly for classification problems with both discrete and continuous predictors. We demonstrate that the derived hypotheses of standard algorithms can substantially deviated from the optimum. This deviation is partly due to their top-down style procedures. The performance of the system is measured on a set of real and simulated data sets and compared with the performance of well-known algorithms like CHAID, CART, C5.0, and QUEST. Proposed algorithm seems to be effective in handling problems caused by top-down style procedures of existing algorithms.

Bias Reduction in Split Variable Selection in C4.5

  • Shin, Sung-Chul;Jeong, Yeon-Joo;Song, Moon Sup
    • Communications for Statistical Applications and Methods
    • /
    • 제10권3호
    • /
    • pp.627-635
    • /
    • 2003
  • In this short communication we discuss the bias problem of C4.5 in split variable selection and suggest a method to reduce the variable selection bias among categorical predictor variables. A penalty proportional to the number of categories is applied to the splitting criterion gain of C4.5. The results of empirical comparisons show that the proposed modification of C4.5 reduces the size of classification trees.

A Method of Obtaning Least Squares Estimators of Estimable Functions in Classification Linear Models

  • Kim, Byung-Hwee;Chang, In-Hong;Dong, Kyung-Hwa
    • Journal of the Korean Statistical Society
    • /
    • 제28권2호
    • /
    • pp.183-193
    • /
    • 1999
  • In the problem of estimating estimable functions in classification linear models, we propose a method of obtaining least squares estimators of estimable functions. This method is based on the hierarchical Bayesian approach for estimating a vector of unknown parameters. Also, we verify that estimators obtained by our method are identical to least squares estimators of estimable functions obtained by using either generalized inverses or full rank reparametrization of the models. Some examples are given which illustrate our results.

  • PDF

Robust Variable Selection in Classification Tree

  • 장정이;정광모
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2001년도 추계학술발표회 논문집
    • /
    • pp.89-94
    • /
    • 2001
  • In this study we focus on variable selection in decision tree growing structure. Some of the splitting rules and variable selection algorithms are discussed. We propose a competitive variable selection method based on Kruskal-Wallis test, which is a nonparametric version of ANOVA F-test. Through a Monte Carlo study we note that CART has serious bias in variable selection towards categorical variables having many values, and also QUEST using F-test is not so powerful to select informative variables under heavy tailed distributions.

  • PDF

Classification Analysis in Information Retrieval by Using Gauss Patterns

  • Lee, Jung-Jin;Kim, Soo-Kwan
    • Communications for Statistical Applications and Methods
    • /
    • 제9권1호
    • /
    • pp.1-11
    • /
    • 2002
  • This paper discusses problems of the Poisson Mixture model which Is widely used to decide the effective words in judging relevant document. Gamma Distribution model and Gauss Patterns model as an alternative of the Poisson Mixture model are studied. Classification experiments by using TREC sub-collection, WSJ[1,2] with MGQUERY and AidSearch3.0 system are discussed.

Improving Bagging Predictors

  • Kim, Hyun-Joong;Chung, Dong-Jun
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2005년도 추계 학술발표회 논문집
    • /
    • pp.141-146
    • /
    • 2005
  • Ensemble method has been known as one of the most powerful classification tools that can improve prediction accuracy. Ensemble method also has been understood as ‘perturb and combine’ strategy. Many studies have tried to develop ensemble methods by improving perturbation. In this paper, we propose two new ensemble methods that improve combining, based on the idea of pattern matching. In the experiment with simulation data and with real dataset, the proposed ensemble methods peformed better than bagging. The proposed ensemble methods give the most accurate prediction when the pruned tree was used as the base learner.

  • PDF

Empirical Choice of the Shape Parameter for Robust Support Vector Machines

  • Pak, Ro-Jin
    • Communications for Statistical Applications and Methods
    • /
    • 제15권4호
    • /
    • pp.543-549
    • /
    • 2008
  • Inspired by using a robust loss function in the support vector machine regression to control training error and the idea of robust template matching with M-estimator, Chen (2004) applies M-estimator techniques to gaussian radial basis functions and form a new class of robust kernels for the support vector machines. We are specially interested in the shape of the Huber's M-estimator in this context and propose a way to find the shape parameter of the Huber's M-estimating function. For simplicity, only the two-class classification problem is considered.