• Title/Summary/Keyword: Clamor-von Mises 검정

Search Result 2, Processing Time 0.07 seconds

Ordinal Variable Selection in Decision Trees (의사결정나무에서 순서형 분리변수 선택에 관한 연구)

  • Kim Hyun-Joong
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.149-161
    • /
    • 2006
  • The most important component in decision tree algorithm is the rule for split variable selection. Many earlier algorithms such as CART and C4.5 use greedy search algorithm for variable selection. Recently, many methods were developed to cope with the weakness of greedy search algorithm. Most algorithms have different selection criteria depending on the type of variables: continuous or nominal. However, ordinal type variables are usually treated as continuous ones. This approach did not cause any trouble for the methods using greedy search algorithm. However, it may cause problems for the newer algorithms because they use statistical methods valid for continuous or nominal types only. In this paper, we propose a ordinal variable selection method that uses Cramer-von Mises testing procedure. We performed comparisons among CART, C4.5, QUEST, CRUISE, and the new method. It was shown that the new method has a good variable selection power for ordinal type variables.

Modified Test Statistic for Identity of Two Distribution on Credit Evaluation (신용평가에서 두 분포의 동일성 검정에 대한 수정통계량)

  • Hong, C.S.;Park, H.S.
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.237-248
    • /
    • 2009
  • The probability of default on the credit evaluation study is represented as a linear combination of two distributions of default and non-default, and the distribution of the probability of default are generally known in most cases. Except the well-known Kolmogorov-Smirnov statistic for testing the identity of two distribution, Kuiper, Cramer-Von Mises, Anderson-Darling, and Watson test statistics are introduced in this work. Under the assumption that the population distribution is known, modified Cramer-Von Mises, Anderson-Darling, and Watson statistics are proposed. Based on score data generated from various probability density functions of the probability of default, the modified test statistics are discussed and compared.