• Title/Summary/Keyword: misclassification

Search Result 226, Processing Time 0.038 seconds

Study on the fast nearest-neighbor searching classifier using distance approximation (거리 근사를 이용하는 고속 최근 이웃 탐색 분류기에 관한 연구)

  • 이일완;채수익
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.34C no.2
    • /
    • pp.71-79
    • /
    • 1997
  • In this paper, we propose a new nearest-neighbor classifier with reduced computational complexity in search process. In the proposed classifier, the classes are divided into two sets: reference and non-reference sets. It reduces computational requriement by approximating the distance between the input and a class iwth the information of distances among the calsses. It calculates only the distance between the input and the reference classes. We convert a given classifier into RCC (reduced computational complexity but smal lincrease in misclassification probability of its corresponding RCC classifier. We designed RCC classifiers for the recognition of digits from the NIST database. We obtained an RCC classifier with 60% reduction in the computational complexity with the cost of 0.5% increase in misclassification probability.

  • PDF

Modifying linearly non-separable support vector machine binary classifier to account for the centroid mean vector

  • Mubarak Al-Shukeili;Ronald Wesonga
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.3
    • /
    • pp.245-258
    • /
    • 2023
  • This study proposes a modification to the objective function of the support vector machine for the linearly non-separable case of a binary classifier yi ∈ {-1, 1}. The modification takes into account the position of each data item xi from its corresponding class centroid. The resulting optimization function involves the centroid mean vector, and the spread of data besides the support vectors, which should be minimized by the choice of hyper-plane β. Theoretical assumptions have been tested to derive an optimal separable hyperplane that yields the minimal misclassification rate. The proposed method has been evaluated using simulation studies and real-life COVID-19 patient outcome hospitalization data. Results show that the proposed method performs better than the classical linear SVM classifier as the sample size increases and is preferred in the presence of correlations among predictors as well as among extreme values.

A Study on Modulation Classification of PSK Signals Based on Statistical Moments (통계적 모먼트에 의한 PSK 신호의 변조분류에 관한 연구)

  • 이원철;한영열
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.6
    • /
    • pp.1004-1015
    • /
    • 1994
  • Modulation type classifier based on statistical moments has been successfully employed to classify PSK signals. Previously, the classifier developed utilizes the statistical moment of samples of the received signal phase, which may be difficult to extract from received signal. In this paper we propose a new moments-based classifier to classify PSK signals by using the moments of the demodulated signal for PSK. THe demodulated signal can be easily extracted from the conventional demodulation of PSK. The evaluation of the performance of the proposed classifier for PSK signals has been investigated in additive white Gaussian noise environment using the exact distribution of the demodulated signal. The performances of classifier in terms of probability of misclassification were evaluated. We found that the coherent system classifier gave 4dB improvement for BPSK and 3dB for QPSK over noncoherent system classifier, when the probability of misclassification is 10 and m equals to 4.

  • PDF

A Study on Aggregate Waste Separation Efficiency Using Adsorption System with Rotating Separation Net (회전분리망 흡착선별기의 순환 굵은골재 이물질 제거효율에 관한 연구)

  • Cho, Sungkwang;Kim, Gyuyong;Kim, Kyungwuk;Seon, Sangwon;Park, Jinyoung
    • Journal of the Korean Recycled Construction Resources Institute
    • /
    • v.9 no.1
    • /
    • pp.85-91
    • /
    • 2021
  • Aggregate waste separator with rotating separating net was designed for applying classification process of construction waste. In order to evaluate the performance of the aggregate waste separator, according to the type of waste, standardized waste samples are prepared using acrylic. The appropriate operating point was evaluated by the classification efficiency and misclassification rate of recycled aggregate according to the control frequency of the blower operating and inlet position of the separating net. The classification efficiency at the operating point of the aggregate waste separator was evaluated through flow analysis assuming recycled aggregate and waste sample as particles. As a result of the performance test, when the distance. between the conveyor belt and the inlet was 0.2m, the classification efficiency was 95%, but the misclassification rate of recycled aggregate was 2% or more, which satisfies the classification efficiency and the misclassification rate of less than 2%. The operating point was shown at a control frequency of 58Hz at a suction distance of 0.254m. As a resu lt of flow analysis, there was no misclassification of recycled aggregate. In order to redu ce constru ction waste in the existing recycled aggregate production process, adsorption system using a rotating separating net that can be operated as an installation type was built.

Bivariate ROC Curve and Optimal Classification Function

  • Hong, C.S.;Jeong, J.A.
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.4
    • /
    • pp.629-638
    • /
    • 2012
  • We propose some methods to obtain optimal thresholds and classification functions by using various cutoff criterion based on the bivariate ROC curve that represents bivariate cumulative distribution functions. The false positive rate and false negative rate are calculated with these classification functions for bivariate normal distributions.

Biomarkers available in workplaces

  • Maeng, Eung-Hee
    • Proceedings of the Korean Society of Toxicology Conference
    • /
    • 2003.05a
    • /
    • pp.31-34
    • /
    • 2003
  • The monitoring of genotoxic effect or oxidative DNA damage in workers exposed to hazardous materials is increasingly applied for hazard identification or risk assessment purposes in workplaces. The current generation of biomarkers has the potential to allow for the earlier detection of occupational disease, for the reduction of misclassification of exposure and outcome. (omitted)

  • PDF

유전자 알고리즘을 활용한 데이터 불균형 해소 기법의 조합적 활용

  • Jang, Yeong-Sik;Kim, Jong-U;Heo, Jun
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2007.05a
    • /
    • pp.309-320
    • /
    • 2007
  • The data imbalance problem which can be uncounted in data mining classification problems typically means that there are more or less instances in a class than those in other classes. It causes low prediction accuracy of the minority class because classifiers tend to assign instances to major classes and ignore the minor class to reduce overall misclassification rate. In order to solve the data imbalance problem, there has been proposed a number of techniques based on resampling with replacement, adjusting decision thresholds, and adjusting the cost of the different classes. In this paper, we study the feasibility of the combination usage of the techniques previously proposed to deal with the data imbalance problem, and suggest a combination method using genetic algorithm to find the optimal combination ratio of the techniques. To improve the prediction accuracy of a minority class, we determine the combination ratio based on the F-value of the minority class as the fitness function of genetic algorithm. To compare the performance with those of single techniques and the matrix-style combination of random percentage, we performed experiments using four public datasets which has been generally used to compare the performance of methods for the data imbalance problem. From the results of experiments, we can find the usefulness of the proposed method.

  • PDF

Prediction Approaches of Personal Exposure from Ambient Air Pollution Using Spatial Analysis: A Pilot Study Using Ulsan Cohort Data (공간분석 기법을 이용한 대기오염 개인노출추정 방안 소개 및 적용의 사례)

  • Son, Ji-Young;Kim, Yoon-Shin;Cho, Yong-Sung;Lee, Jong-Tae
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.25 no.4
    • /
    • pp.339-346
    • /
    • 2009
  • The objectives of this study were to introduce spatial interpolation methods which have been applied in recent papers, to apply three methods (nearest monitor, inverse distance weighting, kriging) to domestic data (Ulsan cohort) as an example of estimating the personal exposure levels. We predicted the personal exposure estimates of 2,102 participants in Ulsan cohort using spatial interpolation methods based on information of their residential address. We found that there was a similar tendency among the estimates of each method. The correlation coefficients between predictions from pairs of interpolation methods (except for the correlation coefficient between nearest montitor and kriging of CO and $SO_2$) were generally high (r=0.84 to 0.96). Even if there are some limitations such as location and density of monitoring station, spatial interpolation methods can reflect spatial aspects of air pollutant and spatial heterogeneity in individual level so that they provide more accurate estimates than monitor data alone. But they may still result in misclassification of exposure. To minimize misclassification for better estimates, we need to consider individual characteristics such as daily activity pattern.