• 제목/요약/키워드: statistical inferences

검색결과 122건 처리시간 0.027초

Association Rule Mining by Environmental Data Fusion

  • Cho, Kwang-Hyun;Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권2호
    • /
    • pp.279-287
    • /
    • 2007
  • Data fusion is the process of combining multiple data in order to produce information of tactical value to the user. Data fusion is generally defined as the use of techniques that combine data from multiple sources and gather that information in order to achieve inferences. Data fusion is also called data combination or data matching. Data fusion is divided in five branch types which are exact matching, judgemental matching, probability matching, statistical matching, and data linking. In this paper, we develop was macro program for statistical matching which is one of five branch types for data fusion. And then we apply data fusion and association rule techniques to environmental data.

  • PDF

Development of an Item Selection Method for Test-Construction by using a Relationship Structure among Abilities

  • Kim, Sung-Ho;Jeong, Mi-Sook;Kim, Jung-Ran
    • Communications for Statistical Applications and Methods
    • /
    • 제8권1호
    • /
    • pp.193-207
    • /
    • 2001
  • When designing a test set, we need to consider constraints on items that are deemed important by item developers or test specialists. The constraints are essentially on the components of the test domain or abilities relevant to a given test set. And so if the test domain could be represented in a more refined form, test construction would be made in a more efficient way. We assume that relationships among task abilities are representable by a causal model and that the item response theory (IRT) is not fully available for them. In such a case we can not apply traditional item selection methods that are based on the IRT. In this paper, we use entropy as an uncertainty measure for making inferences on task abilities and developed an optimal item selection algorithm which reduces most the entropy of task abilities when items are selected from an item pool.

  • PDF

Applications of Saddlepoint Method to Stress-Strength Model

  • Na, Jong-Hwa;Kim, Woo-Chul
    • Communications for Statistical Applications and Methods
    • /
    • 제2권2호
    • /
    • pp.336-346
    • /
    • 1995
  • In many problems concerned with statistical inferences, it will be of interest to compute tail areas rather than densities. But, it is often hard to calculate the exact tail probability. Saddlepoint approximation formula to the tail probability of a smooth function of random cector is developed by DiCiccio and Martin(1991). Applications of this method to stress-strength model are considered in this paper. To obtain the generalized p-values suggested by Tsui and Weerahandi(1989), we need to calculate complicated multiple integration. However, DiCiccio and Martin's(1991) results offer a convenient method to approximate these very accurately. For many artificial data sets, we access the accuracy of DiCiccio and Martin's by comparing the approximate value with the exact one.

  • PDF

Statistical Inferences on the Lognormal Hazard Function under Type I Censored Data

  • Kil Ho Cho;In Suk Lee;Jeen Kap Choi
    • Communications for Statistical Applications and Methods
    • /
    • 제1권1호
    • /
    • pp.20-26
    • /
    • 1994
  • The hazard function is a non-negative function that measures the propensity of failure in the immediate furture, and is frequently used as a decision criterion, especially in replacement decisions. In this paper, we compute approximate confidence intervals for the lognormal hazard function under Type I censored data, and show how to choose the sample size needed to estimate a point on the hazard function with a specified degree of precision. Also we provide a table that can be used to compute the sample size.

  • PDF

On the maximum likelihood estimators for parameters of a Weibull distribution under random censoring

  • Kim, Namhyun
    • Communications for Statistical Applications and Methods
    • /
    • 제23권3호
    • /
    • pp.241-250
    • /
    • 2016
  • In this paper, we consider statistical inferences on the estimation of the parameters of a Weibull distribution when data are randomly censored. Maximum likelihood estimators (MLEs) and approximate MLEs are derived to estimate the parameters. We consider two cases for the censoring model: the assumption that the censoring distribution does not involve any parameters of interest and a censoring distribution that follows a Weibull distribution. A simulation study is conducted to compare the performances of the estimators. The result shows that the MLEs and the approximate MLEs are similar in terms of biases and mean square errors; in addition, the assumption of the censoring model has a strong influence on the estimation of scale parameter.

On the maximum likelihood estimation for a normal distribution under random censoring

  • Kim, Namhyun
    • Communications for Statistical Applications and Methods
    • /
    • 제25권6호
    • /
    • pp.647-658
    • /
    • 2018
  • In this paper, we study statistical inferences on the maximum likelihood estimation of a normal distribution when data are randomly censored. Likelihood equations are derived assuming that the censoring distribution does not involve any parameters of interest. The maximum likelihood estimators (MLEs) of the censored normal distribution do not have an explicit form, and it should be solved in an iterative way. We consider a simple method to derive an explicit form of the approximate MLEs with no iterations by expanding the nonlinear parts of the likelihood equations in Taylor series around some suitable points. The points are closely related to Kaplan-Meier estimators. By using the same method, the observed Fisher information is also approximated to obtain asymptotic variances of the estimators. An illustrative example is presented, and a simulation study is conducted to compare the performances of the estimators. In addition to their explicit form, the approximate MLEs are as efficient as the MLEs in terms of variances.

Statistical Estimates from Black Non-Hispanic Female Breast Cancer Data

  • Khan, Hafiz Mohammad Rafiqullah;Ibrahimou, Boubakari;Saxena, Anshul;Gabbidon, Kemesha;Abdool-Ghany, Faheema;Ramamoorthy, Venkataraghavan;Ullah, Duff;Stewart, Tiffanie Shauna-Jeanne
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권19호
    • /
    • pp.8371-8376
    • /
    • 2014
  • Background: The use of statistical methods has become an imperative tool in breast cancer survival data analysis. The purpose of this study was to develop the best statistical probability model using the Bayesian method to predict future survival times for the black non-Hispanic female breast cancer patients diagnosed during 1973-2009 in the U.S. Materials and Methods: We used a stratified random sample of black non-Hispanic female breast cancer patient data from the Surveillance Epidemiology and End Results (SEER) database. Survival analysis was performed using Kaplan-Meier and Cox proportional regression methods. Four advanced types of statistical models, Exponentiated Exponential (EE), Beta Generalized Exponential (BGE), Exponentiated Weibull (EW), and Beta Inverse Weibull (BIW) were utilized for data analysis. The statistical model building criteria, Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC) were used to measure the goodness of fit tests. Furthermore, we used the Bayesian approach to obtain the predictive survival inferences from the best-fit data based on the exponentiated Weibull model. Results: We identified the highest number of black non-Hispanic female breast cancer patients in Michigan and the lowest in Hawaii. The mean (SD), of age at diagnosis (years) was 58.3 (14.43). The mean (SD), of survival time (months) for black non-Hispanic females was 66.8 (30.20). Non-Hispanic blacks had a significantly increased risk of death compared to Black Hispanics (Hazard ratio: 1.96, 95%CI: 1.51-2.54). Compared to other statistical probability models, we found that the exponentiated Weibull model better fits for the survival times. By making use of the Bayesian method predictive inferences for future survival times were obtained. Conclusions: These findings will be of great significance in determining appropriate treatment plans and health-care cost allocation. Furthermore, the same approach should contribute to build future predictive models for any health related diseases.

중도절단(中途切斷)된 데이터를 이용한 와이블회귀모형(回歸模型)의 통계적(統計的) 추론(推論)에 관한 연구(硏究) (Statistical Inferences in the Weibull Regression Model based on Censored Data)

  • 조길호
    • Journal of the Korean Data and Information Science Society
    • /
    • 제4권
    • /
    • pp.13-30
    • /
    • 1993
  • We propose the ordered least squares estimators(OLSE's) of the parameters and the p-th quantiles for the two-parameter Weibull regression model under the Type II censoring, The Monte Carlo simulations are performed to compare the proposed estimators with the maximum likelihood estimators(MLE's), and it is shown that the proposed estimators are slightly better than MLE's as the censoring rate goes up.

  • PDF

Unified jackknife estimation for parameter changes in an exponential distribution

  • Woo, Jung-Soo
    • 대한수학회지
    • /
    • 제32권1호
    • /
    • pp.77-84
    • /
    • 1995
  • Many authors have utilized an exponential distribution because of its wide applicability in reliability engineering and statistical inferences (see Bain & Engelhart(1987) and Saunders & Mann(1985)). Here we are considering the parametric estimation in an exponential distribution when its scale and location parametes are linear functions of a known exposure level t, which often occurs in the engineering and physical phenomena.

  • PDF

Empirical Bayes Inferences in the Burr Distribution by the Bootstrap Methods

  • Cho, Kil-Ho;Cho, Jang-Sik;Jeong, Seong-Hwa;Shin, Jae-Seock
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권3호
    • /
    • pp.625-632
    • /
    • 2004
  • We consider the empirical Bayes confidence intervals that attain a specified level of EB coverage for the scale parameter in the Burr distribution under type II censoring data. Also, we compare the coverage probabilities and the expected confidence interval lengths for these confidence intervals through simulation study.

  • PDF