• 제목/요약/키워드: Performance-based Statistics

검색결과 1,048건 처리시간 0.021초

A Method to Predict the Number of Clusters

  • Chae, Seong-San;Willian D. Warde
    • Journal of the Korean Statistical Society
    • /
    • 제20권2호
    • /
    • pp.162-176
    • /
    • 1991
  • The problem of determining the number of clusters, K. is the main objective of this study. Attention is focused on the use of Rand(1971)'s $C_{k}$ statistic with some agglomerative clustering algorithms(ACA) defined in the ($\beta$, $\pi$) plane in predicting the number of clusters within the given set of data. The (k, $C_{k}$) plots for k=1, 2, …, N are explored by a Monte Carlo study. Based on its performance, the use of $C_{k}$ with the pair of ACA, (-.5, .75) and (-.25, .0), is recommended for predicting the number of clusters present within a set of data. data.

  • PDF

Combined Response Modeling for Individual Marketing by RFM and Confidence

  • Lee, Jea-Young;Lee, Ho-Kuen
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권2호
    • /
    • pp.597-608
    • /
    • 2008
  • Marketing has been used the power of data and information technology in the pursuit of personal marketing of products and service to customers, based on their preferences and needs. We analyzed the performance of twenty six combined(RFM and Confidence) response modeling methods that were proposed by Zahavi and Levin(l997) and Sho, et al.(1999). As a result, we were able to increase about 3.5%p. forecasting accuracy of customers response through combination with confidence(C) that is able to consider characteristics of product than using the single RFM model that is practically the most widely used.

  • PDF

A Note on the Use of Peer Assessment to Improve Pupil's Performance

  • Lee, Kyung-Koo;Mun, Gil-Seong;Ahn, Jeong-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권2호
    • /
    • pp.443-450
    • /
    • 2008
  • Peer assessment is the process of assessment of students by other students and one form of innovative assessment. It actively involves students in the assessment process and is generally agreed that such involvement enhances the quality and effectiveness of the learning process, since assessing something and benchmarking process is a powerful aid to mastering it themselves. It is more effective on the hard courses for them to understand. In this article we present a peer assessment technique which was applied to students enrolled in a mathematical statistics course and a historical course. In order to measure the effectiveness of the technique, students had to evaluate their colleagues based on predefined criteria and a comparison is presented between the instructor assessments and the peer assessment.

  • PDF

On Practical Efficiency of Locally Parametric Nonparametric Density Estimation Based on Local Likelihood Function

  • Kang, Kee-Hoon;Han, Jung-Hoon
    • Communications for Statistical Applications and Methods
    • /
    • 제10권2호
    • /
    • pp.607-617
    • /
    • 2003
  • This paper offers a practical comparison of efficiency between local likelihood approach and conventional kernel approach in density estimation. The local likelihood estimation procedure maximizes a kernel smoothed log-likelihood function with respect to a polynomial approximation of the log likelihood function. We use two types of data driven bandwidths for each method and compare the mean integrated squares for several densities. Numerical results reveal that local log-linear approach with simple plug-in bandwidth shows better performance comparing to the standard kernel approach in heavy tailed distribution. For normal mixture density cases, standard kernel estimator with the bandwidth in Sheather and Jones(1991) dominates the others in moderately large sample size.

Remarks on correlated error tests

  • Kim, Tae Yoon;Ha, Jeongcheol
    • Journal of the Korean Data and Information Science Society
    • /
    • 제27권2호
    • /
    • pp.559-564
    • /
    • 2016
  • The Durbin-Watson (DW) test in regression model and the Ljung-Box (LB) test in ARMA (autoregressive moving average) model are typical examples of correlated error tests. The DW test is used for detecting autocorrelation of errors using the residuals from a regression analysis. The LB test is used for specifying the correct ARMA model using the first some sample autocorrelations based on the residuals of a tted ARMA model. In this article, simulations with four data generating processes have been carried out to evaluate their performances as correlated error tests. Our simulations show that the DW test is severely dependent on the assumed AR(1) model but isn't sensitive enough to reject the misspecified model and that the LB test reports lackluster performance in general.

EDF 통계량을 이용한 다변량 정규성검정 (Testing Multivariate Normality Based on EDF Statistics)

  • 김남현
    • 응용통계연구
    • /
    • 제19권2호
    • /
    • pp.241-256
    • /
    • 2006
  • EDF에 근거한 $Cram{\acute{e}}r$-von Mises 통계량을 합교원리를 이용하여 다변량으로 일반화한다. 그리고 제안된 통계량의 귀무가설에서의 극한분포를 적절한 공분산 함수를 가진 가우스 과정의 적분의 형태로 표현하고 통계량의 근사적인 계산방법을 고려한다. 또한 실제 자료에 제안된 통계량을 적용해보고 여러가지 대립가설에서의 검정력을 유사한 통계량과 비교해 본다.

Tests for homogeneity of proportions in clustered binomial data

  • Jeong, Kwang Mo
    • Communications for Statistical Applications and Methods
    • /
    • 제23권5호
    • /
    • pp.433-444
    • /
    • 2016
  • When we observe binary responses in a cluster (such as rat lab-subjects), they are usually correlated to each other. In clustered binomial counts, the independence assumption is violated and we encounter an extra-variation. In the presence of extra-variation, the ordinary statistical analyses of binomial data are inappropriate to apply. In testing the homogeneity of proportions between several treatment groups, the classical Pearson chi-squared test has a severe flaw in the control of Type I error rates. We focus on modifying the chi-squared statistic by incorporating variance inflation factors. We suggest a method to adjust data in terms of dispersion estimate based on a quasi-likelihood model. We explain the testing procedure via an illustrative example as well as compare the performance of a modified chi-squared test with competitive statistics through a Monte Carlo study.

Bivariate odd-log-logistic-Weibull regression model for oral health-related quality of life

  • Cruz, Jose N. da;Ortega, Edwin M.M.;Cordeiro, Gauss M.;Suzuki, Adriano K.;Mialhe, Fabio L.
    • Communications for Statistical Applications and Methods
    • /
    • 제24권3호
    • /
    • pp.271-290
    • /
    • 2017
  • We study a bivariate response regression model with arbitrary marginal distributions and joint distributions using Frank and Clayton's families of copulas. The proposed model is used for fitting dependent bivariate data with explanatory variables using the log-odd log-logistic Weibull distribution. We consider likelihood inferential procedures based on constrained parameters. For different parameter settings and sample sizes, various simulation studies are performed and compared to the performance of the bivariate odd-log-logistic-Weibull regression model. Sensitivity analysis methods (such as local and total influence) are investigated under three perturbation schemes. The methodology is illustrated in a study to assess changes on schoolchildren's oral health-related quality of life (OHRQoL) in a follow-up exam after three years and to evaluate the impact of caries incidence on the OHRQoL of adolescents.

Comparison of parameter estimation methods for normal inverse Gaussian distribution

  • Yoon, Jeongyoen;Kim, Jiyeon;Song, Seongjoo
    • Communications for Statistical Applications and Methods
    • /
    • 제27권1호
    • /
    • pp.97-108
    • /
    • 2020
  • This paper compares several methods for estimating parameters of normal inverse Gaussian distribution. Ordinary maximum likelihood estimation and the method of moment estimation often do not work properly due to restrictions on parameters. We examine the performance of adjusted estimation methods along with the ordinary maximum likelihood estimation and the method of moment estimation by simulation and real data application. We also see the effect of the initial value in estimation methods. The simulation results show that the ordinary maximum likelihood estimator is significantly affected by the initial value; in addition, the adjusted estimators have smaller root mean square error than ordinary estimators as well as less impact on the initial value. With real datasets, we obtain similar results to what we see in simulation studies. Based on the results of simulation and real data application, we suggest using adjusted maximum likelihood estimates with adjusted method of moment estimates as initial values to estimate the parameters of normal inverse Gaussian distribution.

Statistical tests for biosimilarity based on relative distance between follow-on biologics for ordinal endpoints

  • Yoo, Myung Soo;Kim, Donguk
    • Communications for Statistical Applications and Methods
    • /
    • 제27권1호
    • /
    • pp.1-14
    • /
    • 2020
  • Investigations of biosimilarity between reference drugs and test drugs required statistical tests; in addition, statistical tests to evaluate biosimilarity have been recently proposed. Ordinal outcome data has been observed in research; however, appropriate statistical tests to deal with ordinal endpoints for biosimilar have not yet been proposed. This paper extends existing design for ordinal endpoints. Using measure of nominal-ordinal association and relative distances between drugs are defined so that testing procedures are developed. Through simulation studies, we investigate type I error rate and power to show the performance of our suggested method. Furthermore, a comparison between the statistical tests and other designs is proviede to show significance of ordinal endpoints.