• Title/Summary/Keyword: statistical tools and methods

검색결과 209건 처리시간 0.058초

식스시그마 DMAIC 프로세스에서 모집단의 수와 데이터 종류에 따른 품질개선 기법의 오적용 유형 및 이해 (Understanding and Misuse Type of Quality Improvement Tools According to the Kind of Data and the Number of Population in DMAIC Process of Six Sigma)

  • 최성운
    • 대한안전경영과학회:학술대회논문집
    • /
    • 대한안전경영과학회 2010년도 춘계학술대회
    • /
    • pp.509-517
    • /
    • 2010
  • The paper proposes the misuse types of statistical quality tools according to the kind of data and the number of population in DMAIC process of six sigma. The result presented in this paper can be extended to the QC story 15 steps of QC circle. The study also provides the improvement methods about control chart, measurement system analysis, statistical difference, and practical equivalence.

  • PDF

A Comparison of Capabilities of Data Mining Tools

  • Choi, Youn-Seok;Kim, Jong-Geoun;Lee, Jong-Hee
    • Communications for Statistical Applications and Methods
    • /
    • 제8권2호
    • /
    • pp.531-541
    • /
    • 2001
  • In this study, we compare the capabilities of the data mining tools of the most updated version objectively and provide the useful information in which enterprises and universities chose them. In particular, we compare the SAS/Enterprise Miner 3.0, SPSS/Clementine 5.2 and IBM/Intelligent Miner 6.1 which are well known and easily gotten.

  • PDF

Zooming Statistics: Inference across scales

  • Hannig, Jan;Marron, J.S.;Riedi, R.H.
    • Journal of the Korean Statistical Society
    • /
    • 제30권2호
    • /
    • pp.327-345
    • /
    • 2001
  • New statistical methods are ended to analyzed data in a multi-scale way. Some multi-scale extensions of stand methods, including novel visualization using dynamic graphics are proposed. These tools are used to explore non-standard structure in internet traffic data.

  • PDF

Selecting the Number and Location of Knots for Presenting Densities

  • Ahn, JeongYong;Moon, Gill Sung;Han, Kyung Soo;Han, Beom Soo
    • Communications for Statistical Applications and Methods
    • /
    • 제11권3호
    • /
    • pp.609-617
    • /
    • 2004
  • To present graph of probability densities, many softwares and graphical tools use methods that link points or straight lines. However, the methods can't display exactly and smoothly the graph and are not efficient from the viewpoint of process time. One method to overcome these shortcomings is utilizing interpolation methods. In these methods, selecting the number and location of knots is an important factor. This article proposes an algorithm to select knots for graphically presenting densities and implements graph components based on the algorithm.

Variable Arrangement for Data Visualization

  • Huh, Moon Yul;Song, Kwang Ryeol
    • Communications for Statistical Applications and Methods
    • /
    • 제8권3호
    • /
    • pp.643-650
    • /
    • 2001
  • Some classical plots like scatterplot matrices and parallel coordinates are valuable tools for data visualization. These tools are extensively used in the modern data mining softwares to explore the inherent data structure, and hence to visually classify or cluster the database into appropriate groups. However, the interpretation of these plots are very sensitive to the arrangement of variables. In this work, we introduce two methods to arrange the variables for data visualization. First method is based on the work of Wegman (1999), and this is to arrange the variables using minimum distance among all the pairwise permutation of the variables. Second method is using the idea of principal components. We Investigate the effectiveness of these methods with parallel coordinates using real data sets, and show that each of the two proposed methods has its own strength from different aspects respectively.

  • PDF

데이터마이닝을 위한 동적 결정나무 (Dynamic Decision Tree for Data Mining)

  • 최병수;차운옥
    • Communications for Statistical Applications and Methods
    • /
    • 제16권6호
    • /
    • pp.959-969
    • /
    • 2009
  • 결정나무는 데이터마이닝에서 데이터를 분류하는 기법으로 가장 많이 사용되고 있으며, 데이터 탐색 소프트웨어 DAVIS에서는 동적 기능을 사용하여 데이터 시각화를 하는 것이 가능하다. 본 논문에서는 동적 데이터 분석의 기본 원리와 이를 결정나무에 적용하는 방법을 소개하고, 생성되는 동적 결정나무의 효율성과 유용성을 실제 데이터를 사용하여 분석한다.

Predictive analysis in insurance: An application of generalized linear mixed models

  • Rosy Oh;Nayoung Woo;Jae Keun Yoo;Jae Youn Ahn
    • Communications for Statistical Applications and Methods
    • /
    • 제30권5호
    • /
    • pp.437-451
    • /
    • 2023
  • Generalized linear models and generalized linear mixed models (GLMMs) are fundamental tools for predictive analyses. In insurance, GLMMs are particularly important, because they provide not only a tool for prediction but also a theoretical justification for setting premiums. Although thousands of resources are available for introducing GLMMs as a classical and fundamental tool in statistical analysis, few resources seem to be available for the insurance industry. This study targets insurance professionals already familiar with basic actuarial mathematics and explains GLMMs and their linkage with classical actuarial pricing tools, such as the Buhlmann premium method. Focus of the study is mainly on the modeling aspect of GLMMs and their application to pricing, while avoiding technical issues related to statistical estimation, which can be automatically handled by most statistical software.

A Comparison of Influence Diagnostics in Linear Mixed Models

  • Lee, Jang-Taek
    • Communications for Statistical Applications and Methods
    • /
    • 제10권1호
    • /
    • pp.125-134
    • /
    • 2003
  • Standard estimation methods for linear mixed models are sensitive to influential observations. However, tools and concepts for linear mixed model diagnostics are rudimentary until now and research is heavily demanded in linear mixed models. In this paper, we consider two diagnostics to evaluate the effects of individual observations in the estimation of fixed effects for linear mixed models. Those are Cook's distance and COVRATIO. Results of our limited simulation study suggest that the Cook's distance is not good statistical quantity in linear mixed models. Also calibration point for COVRATIO seems to be quite conservative.

Improving Bagging Predictors

  • Kim, Hyun-Joong;Chung, Dong-Jun
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2005년도 추계 학술발표회 논문집
    • /
    • pp.141-146
    • /
    • 2005
  • Ensemble method has been known as one of the most powerful classification tools that can improve prediction accuracy. Ensemble method also has been understood as ‘perturb and combine’ strategy. Many studies have tried to develop ensemble methods by improving perturbation. In this paper, we propose two new ensemble methods that improve combining, based on the idea of pattern matching. In the experiment with simulation data and with real dataset, the proposed ensemble methods peformed better than bagging. The proposed ensemble methods give the most accurate prediction when the pruned tree was used as the base learner.

  • PDF

On-Line Analytical Processing and Research Problems for Statisticians

  • Ahn, JeongYong;Han, Kyung Soo
    • Communications for Statistical Applications and Methods
    • /
    • 제7권2호
    • /
    • pp.457-463
    • /
    • 2000
  • Recently, statistical analysis tools have been changed to the applications on the World Wide Web that access data stored in databases. On-line analytical processing(OLAP) is a class of technologies that give users statistical information with multidimensional views of data in databases. In this paper, we introduce the concept and requisites of OLAP system, and we propose some research issues.

  • PDF