• Title/Summary/Keyword: Statistical decision

Search Result 940, Processing Time 0.027 seconds

Multivariate Decision Tree for High -dimensional Response Vector with Its Application

  • Lee, Seong-Keon
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.3
    • /
    • pp.539-551
    • /
    • 2004
  • Multiple responses are often observed in many application fields, such as customer's time-of-day pattern for using internet. Some decision trees for multiple responses have been constructed by many researchers. However, if the response is a high-dimensional vector that can be thought of as a discretized function, then fitting a multivariate decision tree may be unsuccessful. Yu and Lambert (1999) suggested spline tree and principal component tree to analyze high dimensional response vector by using dimension reduction techniques. In this paper, we shall propose factor tree which would be more interpretable and competitive. Furthermore, using Korean internet company data, we will analyze time-of-day patterns for internet user.

A Data Mining Approach for a Dynamic Development of an Ontology-Based Statistical Information System

  • Mohamed Hachem Kermani;Zizette Boufaida;Amel Lina Bensabbane;Besma Bourezg
    • Journal of Information Science Theory and Practice
    • /
    • v.11 no.2
    • /
    • pp.67-81
    • /
    • 2023
  • This paper presents a dynamic development of an ontology-based statistical information system supporting the collection, storage, processing, analysis, and the presentation of statistical knowledge at the national scale. To accomplish this, we propose a data mining technique to dynamically collect data relating to citizens from publicly available data sources; the collected data will then be structured, classified, categorized, and integrated into an ontology. Moreover, an intelligent platform is proposed in order to generate quantitative and qualitative statistical information based on the knowledge stored in the ontology. The main aims of our proposed system are to digitize administrative tasks and to provide reliable statistical information to governmental, economic, and social actors. The authorities will use the ontology-based statistical information system for strategic decision-making as it easily collects, produces, analyzes, and provides both quantitative and qualitative knowledge that will help to improve the administration and management of national political, social, and economic life.

Decision Tree Based Context Clustering with Cross Likelihood Ratio for HMM-based TTS (HMM 기반의 TTS를 위한 상호유사도 비율을 이용한 결정트리 기반의 문맥 군집화)

  • Jung, Chi-Sang;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.2
    • /
    • pp.174-180
    • /
    • 2013
  • This paper proposes a decision tree based context clustering algorithm for HMM-based speech synthesis systems using the cross likelihood ratio with a hierarchical prior (CLRHP). Conventional algorithms tie the context-dependent HMM states that have similar statistical characteristics, but they do not consider the statistical similarity of split child nodes, which does not guarantee the statistical difference between the final leaf nodes. The proposed CLRHP algorithm improves the reliability of model parameters by taking a criterion of minimizing the statistical similarity of split child nodes. Experimental results verify the superiority of the proposed approach to conventional ones.

대학도서관의 복본수 결정기법에 관한 연구

  • 양재한
    • Journal of Korean Library and Information Science Society
    • /
    • v.13
    • /
    • pp.131-166
    • /
    • 1986
  • This study is designed to review the methods of duplicate copies decision making in the academic library. In this thesis, I surveyed queueing & markov model, statistical model, and simulation model. The contents of the study can be summarized as follows: 1) Queueing and markov model is used for one of duplicate copies decision-making methods. This model was suggested by Leimkuler, Morse, and Chen, etc. Leimkuler proposed growth model, storage model, and availability model through using system analysis method. Queueing theory is a n.0, pplied to Leimkuler's availability model. Morse ad Chen a n.0, pplied queueing and markov model to their theory. They used queueing theory for measuring satisfaction level and Markov model for predicting user demand. 2) Another model of duplicate copies decision-making methods is statistical model. This model is suggested by Grant and Sohn, Jung Pyo. Grant suggested a model with a formula to satisfy the user demand more than 95%, Sohn, Jung Pyo suggested a model with two formulars: one for duplicate copies decision-making by using standard deviation and the other for duplicate copies predicting by using coefficient of variation. 3) Simulation model is used for one of duplicate copies decision-making methods. This model is suggested by Buckland and Arms. Buckland considered both loan period and duplicate copies simultaneously in his simulation model. Arms suggested computer-simulation model as one of duplicate copies decision-making methods. These methods can help improve the efficiency of collection development and solve some problems (space, staff, budget, etc, ) of Korean academic libraries today.

  • PDF

Application of Quality Statistical Techniques Based on the Review and the Interpretation of Medical Decision Metrics (의학적 의사결정 지표의 고찰 및 해석에 기초한 품질통계기법의 적용)

  • Choi, Sungwoon
    • Journal of the Korea Safety Management & Science
    • /
    • v.15 no.2
    • /
    • pp.243-253
    • /
    • 2013
  • This research paper introduces the application and implementation of medical decision metrics that classifies medical decision-making into four different metrics using statistical diagnostic tools, such as confusion matrix, normal distribution, Bayesian prediction and Receiver Operating Curve(ROC). In this study, the metrics are developed based on cross-section study, cohort study and case-control study done by systematic literature review and reformulated the structure of type I error, type II error, confidence level and power of detection. The study proposed implementation strategies for 10 quality improvement activities via 14 medical decision metrics which consider specificity and sensitivity in terms of ${\alpha}$ and ${\beta}$. Examples of ROC implication are depicted in this paper with a useful guidelines to implement a continuous quality improvement, not only in a variable acceptance sampling in Quality Control(QC) but also in a supplier grading score chart in Supplier Chain Management(SCM) quality. This research paper is the first to apply and implement medical decision-making tools as quality improvement activities. These proposed models will help quality practitioners to enhance the process and product quality level.

Statistical Decision making of Association Threshold in Association Rule Data Mining

  • Park, Hee-Chang;Song, Geum-Min
    • Journal of the Korean Data and Information Science Society
    • /
    • v.13 no.2
    • /
    • pp.115-128
    • /
    • 2002
  • One of the well-studied problems in data mining is the search for association rules. In this paper we consider the statistical decision making of association threshold in association rule. A chi-squared statistic is used to find minimum association threshold. We calculate the range of the value that two item sets are occurred simultaneously, and find the minimum confidence threshold values.

  • PDF

Corresponding between Error Probabilities and Bayesian Wrong Decision Lasses in Flexible Two-stage Plans

  • Ko, Seoung-gon
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.4
    • /
    • pp.435-441
    • /
    • 2000
  • Ko(1998, 1999) proposed certain flexible two-stage plans that could be served as one-step interim analysis in on-going clinical trials. The proposed Plans are optimal simultaneously in both a Bayes and a Neyman-Pearson sense. The Neyman-Pearson interpretation is that average expected sample size is being minimized, subject just to the two overall error rates $\alpha$ and $\beta$, respectively of first and second kind. The Bayes interpretation is that Bayes risk, involving both sampling cost and wrong decision losses, is being minimized. An example of this correspondence are given by using a binomial setting.

  • PDF

Development of Discriminant Analysis System by Graphical User Interface of Visual Basic

  • Lee, Yong-Kyun;Shin, Young-Jae;Cha, Kyung-Joon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.2
    • /
    • pp.447-456
    • /
    • 2007
  • Recently, the multivariate statistical analysis has been used to analyze meaningful information for various data. In this paper, we develope the multivariate statistical analysis system combined with Fisher discriminant analysis, logistic regression, neural network, and decision tree using visual basic 6.0.

  • PDF

Statistical Decision making of Association Threshold in Association Rule Data Mining

  • Park, Hee-Chang;Song, Geum-Min
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2002.06a
    • /
    • pp.169-182
    • /
    • 2002
  • One of the well-studied problems in data mining is the search for association rules. In this paper we consider the statistical decision making of association threshold in association rule. A chi-squared statistic is used to find minimum association threshold. We can calculate the range of the value that two item sets are occurred simultaneously, and can find the minimum confidence threshold values.

  • PDF

Estimating the Difference of Two Normal Means

  • M. Aimahmeed;M. S. Son;H. I. Hamdy
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.1
    • /
    • pp.297-312
    • /
    • 2000
  • A three stage sampling procedure designed to estimate the difference betweentwo normal means is proposed and evaluated within a unified decision-theoretic framework. Both point and fixed-width confidence interval estimation are combined in a single decision rule to make full use of the available data. Adjustments to previous solutions focusing on only one of the latter objectives are indicated. The sensitivity of the confidence interval for detecting shifts in true mean difference is also investigated Numerical and simulation studies are presented to supplement the theoretical results.

  • PDF