• Title/Summary/Keyword: Statistical classification

Search Result 1,432, Processing Time 0.03 seconds

A study of constitution diagnosis using decision tree method (의사결정나무법을 이용한 체질진단에 관한 연구)

  • Lee, Yong-Seop;Park, Seong-Sik;Park, Eun-Kyung
    • Journal of Sasang Constitutional Medicine
    • /
    • v.13 no.2
    • /
    • pp.144-155
    • /
    • 2001
  • By the increasing concern about Sasang Constitution Medicine, its practical use is considered very important in disease prevention and medical treatment. However, the method of constitution classification is depending on the doctor's clinical trials because of the lack of the objective test criteria. This study is trying to improve the objectiveness of diagnosis using a new statistical method, decision tree. Decision tree method-a classification technique in the statistical analysis- was used to analyze the result of QSCCII instead of using discriminant analysis. As a result, 16 among 121 QSCCII questions was selected as important questions and 21 terminal nodes was built to classify the constitution. Using only 16 questions shown in the result of decision tree, we can diagnose and interpret the constitution easily and effectively.

  • PDF

Classification of Welding Defects in Austenitic Stainless Steel by Neural Pattern Recognition of Ultrasonic Signal (초음파신호의 신경망 형상인식법을 이용한 오스테나이트 스테인레스강의 용접부결함 분류에 관한 연구)

  • Lee, Gang-Yong;Kim, Jun-Seop
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.20 no.4
    • /
    • pp.1309-1319
    • /
    • 1996
  • The research for the classification of the natural defects in welding zone is performd using the neuro-pattern recognition technology. The signal pattern recognition package including the user's defined function is developed to perform the digital signal processing, feature extraction, feature selection and classifier selection, The neural network classifier and the statistical classifiers such as the linear discriminant function classifier and the empirical Bayesian calssifier are compared and discussed. The neuro-pattern recognition technique is applied to the classificaiton of such natural defects as root crack, incomplete penetration, lack of fusion, slag inclusion, porosity, etc. If appropriately learned, the neural network classifier is concluded to be better than the statistical classifiers in the classification of the natural welding defects.

Motion classification using distributional features of 3D skeleton data

  • Woohyun Kim;Daeun Kim;Kyoung Shin Park;Sungim Lee
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.6
    • /
    • pp.551-560
    • /
    • 2023
  • Recently, there has been significant research into the recognition of human activities using three-dimensional sequential skeleton data captured by the Kinect depth sensor. Many of these studies employ deep learning models. This study introduces a novel feature selection method for this data and analyzes it using machine learning models. Due to the high-dimensional nature of the original Kinect data, effective feature extraction methods are required to address the classification challenge. In this research, we propose using the first four moments as predictors to represent the distribution of joint sequences and evaluate their effectiveness using two datasets: The exergame dataset, consisting of three activities, and the MSR daily activity dataset, composed of ten activities. The results show that the accuracy of our approach outperforms existing methods on average across different classifiers.

Genetic classification of various familial relationships using the stacking ensemble machine learning approaches

  • Su Jin Jeong;Hyo-Jung Lee;Soong Deok Lee;Ji Eun Park;Jae Won Lee
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.3
    • /
    • pp.279-289
    • /
    • 2024
  • Familial searching is a useful technique in a forensic investigation. Using genetic information, it is possible to identify individuals, determine familial relationships, and obtain racial/ethnic information. The total number of shared alleles (TNSA) and likelihood ratio (LR) methods have traditionally been used, and novel data-mining classification methods have recently been applied here as well. However, it is difficult to apply these methods to identify familial relationships above the third degree (e.g., uncle-nephew and first cousins). Therefore, we propose to apply a stacking ensemble machine learning algorithm to improve the accuracy of familial relationship identification. Using real data analysis, we obtain superior relationship identification results when applying meta-classifiers with a stacking algorithm rather than applying traditional TNSA or LR methods and data mining techniques.

AUTOMATED ELECTROFACIES DETERMINATION USING MULTIVARIATE STATISTICAL ANALYSIS

  • Kim Jungwhan;Lim Jong-Se
    • 한국석유지질학회:학술대회논문집
    • /
    • spring
    • /
    • pp.10-14
    • /
    • 1998
  • A systematic methodology is developed for the electrofacies determination from wireline log data using multivariate statistical analysis. To consider corresponding contribution of each log and reduce the computational dimension, multivariate logs are transformed into a single variable through principal components analysis. Resultant principal components logs are segmented using the statistical zonation method to enhance the efficiency and quality of the interpreted results. Hierarchical cluster analysis is then used to group the segments into electrofacies. Optimal number of groups is determined on the basis of the ratio of within-group variance to total variance and core data. This technique is applied to the wells in the Korea Continental Shelf. The results of field application demonstrate that the prediction of lithology based on the electrofacies classification matches well to the core and the cutting data with high reliability This methodology for electrofacies classification can be used to define the reservoir characteristics which are helpful to the reservoir management.

  • PDF

Sparse Multinomial Kernel Logistic Regression

  • Shim, Joo-Yong;Bae, Jong-Sig;Hwang, Chang-Ha
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.1
    • /
    • pp.43-50
    • /
    • 2008
  • Multinomial logistic regression is a well known multiclass classification method in the field of statistical learning. More recently, the development of sparse multinomial logistic regression model has found application in microarray classification, where explicit identification of the most informative observations is of value. In this paper, we propose a sparse multinomial kernel logistic regression model, in which the sparsity arises from the use of a Laplacian prior and a fast exact algorithm is derived by employing a bound optimization approach. Experimental results are then presented to indicate the performance of the proposed procedure.

A Wrist-Type Fall Detector with Statistical Classifier for the Elderly Care

  • Park, Chan-Kyu;Kim, Jae-Hong;Sohn, Joo-Chan;Choi, Ho-Jin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.5 no.10
    • /
    • pp.1751-1768
    • /
    • 2011
  • Falls are one of the most concerned accidents for elderly people and often result in serious physical and psychological consequences. Many researchers have studied fall detection techniques in various domain, however none released to a commercial product satisfying user requirements. We present a systematic modeling and evaluating procedure for best classification performance and then do experiments for comparing the performance of six procedures to get a statistical classifier based wrist-type fall detector to prevent dangerous consequences from falls. Even though the wrist may be the most difficult measurement location on the body to discern a fall event, the proposed feature deduction process and fall classification procedures shows positive results by using data sets of fall and general activity as two classes.

Robust inference with order constraint in microarray study

  • Kang, Joonsung
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.5
    • /
    • pp.559-568
    • /
    • 2018
  • Gene classification can involve complex order-restricted inference. Examining gene expression pattern across groups with order-restriction makes standard statistical inference ineffective and thus, requires different methods. For this problem, Roy's union-intersection principle has some merit. The M-estimator adjusting for outlier arrays in a microarray study produces a robust test statistic with distribution-insensitive clustering of genes. The M-estimator in conjunction with a union-intersection principle provides a nonstandard robust procedure. By exact permutation distribution theory, a conditionally distribution-free test based on the proposed test statistic generates corresponding p-values in a small sample size setup. We apply a false discovery rate (FDR) as a multiple testing procedure to p-values in simulated data and real microarray data. FDR procedure for proposed test statistics controls the FDR at all levels of ${\alpha}$ and ${\pi}_0$ (the proportion of true null); however, the FDR procedure for test statistics based upon normal theory (ANOVA) fails to control FDR.

On Nonparametric Estimation of Data Edges

  • Park, Byeong U.
    • Journal of the Korean Statistical Society
    • /
    • v.30 no.2
    • /
    • pp.265-280
    • /
    • 2001
  • Estimation of the edge of a distribution has many important applications. It is related to classification, cluster analysis, neural network, and statistical image recovering. The problem also arises in measuring production efficiency in economic systems. Three most promising nonparametric estimators in the existing literature are introduced. Their statistical properties are provided, some of which are new. Themes of future study are also discussed.

  • PDF

Support Vector Machine for Linear Regression

  • Hwang, Changha;Seok, Kyungha
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.2
    • /
    • pp.337-344
    • /
    • 1999
  • Support vector machine(SVM) is a new and very promising regression and classification technique developed by Vapnik and his group at AT&T Bell laboratories. This article provides a brief overview of SVM focusing on linear regression. We explain from statistical point of view why SVM might be attractive and how this could be compared with other linear regression techniques. Furthermore. we explain model selection based on VC-theory.

  • PDF