• Title/Summary/Keyword: K means clustering

Search Result 1,118, Processing Time 0.03 seconds

Fault Diagnosis for Rotating Machine Using Feature Extraction and Minimum Detection Error Algorithm (특징 추출과 검출 오차 최소화 알고리듬을 이용한 회전기계의 결함 진단)

  • Chong, Ui-pil;Cho, Sang-jin;Lee, Jae-yeal
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.16 no.1 s.106
    • /
    • pp.27-33
    • /
    • 2006
  • Fault diagnosis and condition monitoring for rotating machines are important for efficiency and accident prevention. The process of fault diagnosis is to extract the feature of signals and to classify each state. Conventionally, fault diagnosis has been developed by combining signal processing techniques for spectral analysis and pattern recognition, however these methods are not able to diagnose correctly for certain rotating machines and some faulty phenomena. In this paper, we add a minimum detection error algorithm to the previous method to reduce detection error rate. Vibration signals of the induction motor are measured and divided into subband signals. Each subband signal is processed to obtain the RMS, standard deviation and the statistic data for constructing the feature extraction vectors. We make a study of the fault diagnosis system that the feature extraction vectors are applied to K-means clustering algorithm and minimum detection error algorithm.

VAD By Neural Network Under Wireless Communication Systems (Neural Network을 이용한 무선 통신시스템에서의 VAD)

  • Lee Hosun;Kim Sukyung;Park Sung-Kwon
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.12C
    • /
    • pp.1262-1267
    • /
    • 2005
  • Elliptical basis function (EBF) neural network works stably under high-level background noise environment and makes the nonlinear processing possible. It can be adapted real time VAD with simple design. This paper introduces VAD implementation using EBF and the experimental results show that EBF VAD outperforms G729 Annex B and RBF neural networks. The best error rates achieved by the EBF networks were improved more than $70\%$ in speech and $50\%$ in silence while that achieved by G.729 Annex B and RBF networks respectively.

Prediction System Design based on An Interval Type-2 Fuzzy Logic System using HCBKA (HCBKA를 이용한 Interval Type-2 퍼지 논리시스템 기반 예측 시스템 설계)

  • Bang, Young-Keun;Lee, Chul-Heui
    • Journal of Industrial Technology
    • /
    • v.30 no.A
    • /
    • pp.111-117
    • /
    • 2010
  • To improve the performance of the prediction system, the system should reflect well the uncertainty of nonlinear data. Thus, this paper presents multiple prediction systems based on Type-2 fuzzy sets. To construct each prediction system, an Interval Type-2 TSK Fuzzy Logic System and difference data were used, because, in general, it has been known that the Type-2 Fuzzy Logic System can deal with the uncertainty of nonlinear data better than the Type-1 Fuzzy Logic System, and the difference data can provide more steady information than that of original data. Also, to improve each rule base of the fuzzy prediction systems, the HCBKA (Hierarchical Correlation Based K-means clustering Algorithm) was applied because it can consider correlationship and statistical characteristics between data at a time. Subsequently, to alleviate complexity of the proposed prediction system, a system selection method was used. Finally, this paper analyzed and compared the performances between the Type-1 prediction system and the Interval Type-2 prediction system using simulations of three typical time series examples.

  • PDF

Voice Activity Detection Algorithm base on Radial Basis Function Networks with Dual Threshold (Radial Basis Function Networks를 이용한 이중 임계값 방식의 음성구간 검출기)

  • Kim Hong lk;Park Sung Kwon
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.12C
    • /
    • pp.1660-1668
    • /
    • 2004
  • This paper proposes a Voice Activity Detection (VAD) algorithm based on Radial Basis Function (RBF) network using dual threshold. The k-means clustering and Least Mean Square (LMS) algorithm are used to upade the RBF network to the underlying speech condition. The inputs for RBF are the three parameters in a Code Exited Linear Prediction (CELP) coder, which works stably under various background noise levels. Dual hangover threshold applies in BRF-VAD for reducing error, because threshold value has trade off effect in VAD decision. The experimental result show that the proposed VAD algorithm achieves better performance than G.729 Annex B at any noise level.

Analysis of the Types of Dementia Patients for Development of Clothes for Dementia Patient in Nursing Homes (요양시설 치매환자복 디자인 개발을 위한 치매환자의 유형 분석)

  • Park, Kwang Ae;Yang, Chung Eun;Lee, Jae Hyang;Kim, Hee-Jung
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.45 no.5
    • /
    • pp.788-803
    • /
    • 2021
  • This study aims to obtain basic data to develop clothes for dementia patients by classifying types of dementia patients. Data was collected from those dementia patients who entered a nursing home. This study analyzed a total of 221 sheets. Furthermore, descriptive statistics, cross-tabulation, and K-means clustering were performed for statistical processing using Minitab 14. As a result, dementia patients were classified into four types: inactive-dependent, active-problematic behavior, activity-autonomy, and inactive-offensive. Inactive-dependent type was a group with the most severe disability in cognitive and daily activity functions; however, they lacked behavioral and psychological symptoms and problematic behavior with clothes. Active-problematic behavior type showed the most behavioral and psychological problems and problematic behavior with clothes. Activity-autonomy type was a group without any problematic behaviors. Moreover, the inactive-offensive type had very good cognitive function toward humans. The study imply that it is necessary to provide clothes with proper functions based on the types of patients rather than providing them uniform clothes because clinical and clothes behaviors differ significantly depending on the types of dementia patients.

A Implementation of Optimal Multiple Classification System using Data Mining for Genome Analysis

  • Jeong, Yu-Jeong;Choi, Gwang-Mi
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.12
    • /
    • pp.43-48
    • /
    • 2018
  • In this paper, more efficient classification result could be obtained by applying the combination of the Hidden Markov Model and SVM Model to HMSV algorithm gene expression data which simulated the stochastic flow of gene data and clustering it. In this paper, we verified the HMSV algorithm that combines independently learned algorithms. To prove that this paper is superior to other papers, we tested the sensitivity and specificity of the most commonly used classification criteria. As a result, the K-means is 71% and the SOM is 68%. The proposed HMSV algorithm is 85%. These results are stable and high. It can be seen that this is better classified than using a general classification algorithm. The algorithm proposed in this paper is a stochastic modeling of the generation process of the characteristics included in the signal, and a good recognition rate can be obtained with a small amount of calculation, so it will be useful to study the relationship with diseases by showing fast and effective performance improvement with an algorithm that clusters nodes by simulating the stochastic flow of Gene Data through data mining of BigData.

EDGE: An Enticing Deceptive-content GEnerator as Defensive Deception

  • Li, Huanruo;Guo, Yunfei;Huo, Shumin;Ding, Yuehang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1891-1908
    • /
    • 2021
  • Cyber deception defense mitigates Advanced Persistent Threats (APTs) with deploying deceptive entities, such as the Honeyfile. The Honeyfile distracts attackers from valuable digital documents and attracts unauthorized access by deliberately exposing fake content. The effectiveness of distraction and trap lies in the enticement of fake content. However, existing studies on the Honeyfile focus less on this perspective. In this work, we seek to improve the enticement of fake text content through enhancing its readability, indistinguishability, and believability. Hence, an enticing deceptive-content generator, EDGE, is presented. The EDGE is constructed with three steps: extracting key concepts with a semantics-aware K-means clustering algorithm, searching for candidate deceptive concepts within the Word2Vec model, and generating deceptive text content under the Integrated Readability Index (IR). Furthermore, the readability and believability performance analyses are undertaken. The experimental results show that EDGE generates indistinguishable deceptive text content without decreasing readability. In all, EDGE proves effective to generate enticing deceptive text content as deception defense against APTs.

Morphological Characterization of small, dumpy, and long Phenotypes in Caenorhabditis elegans

  • Cho, Joshua Young;Choi, Tae-Woo;Kim, Seung Hyun;Ahnn, Joohong;Lee, Sun-Kyung
    • Molecules and Cells
    • /
    • v.44 no.3
    • /
    • pp.160-167
    • /
    • 2021
  • The determinant factors of an organism's size during animal development have been explored from various angles but remain partially understood. In Caenorhabditis elegans, many genes affecting cuticle structure, cell growth, and proliferation have been identified to regulate the worm's overall morphology, including body size. While various mutations in those genes directly result in changes in the morphological phenotypes, there is still a need for established, clear, and distinct standards to determine the apparent abnormality in a worm's size and shape. In this study, we measured the body length, body width, terminal bulb length, and head size of mutant worms with reported Dumpy (Dpy), Small (Sma) or Long (Lon) phenotypes by plotting and comparing their respective ratios of various parameters. These results show that the Sma phenotypes are proportionally smaller overall with mild stoutness, and Dpy phenotypes are significantly stouter and have disproportionally small head size. This study provides a standard platform for determining morphological phenotypes designating and annotating mutants that exhibit body shape variations, defining the morphological phenotype of previously unexamined mutants.

Method for Estimating Intramuscular Fat Percentage of Hanwoo(Korean Traditional Cattle) Using Convolutional Neural Networks in Ultrasound Images

  • Kim, Sang Hyun
    • International journal of advanced smart convergence
    • /
    • v.10 no.1
    • /
    • pp.105-116
    • /
    • 2021
  • In order to preserve the seeds of excellent Hanwoo(Korean traditional cattle) and secure quality competitiveness in the infinite competition with foreign imported beef, production of high-quality Hanwoo beef is absolutely necessary. %IMF (Intramuscular Fat Percentage) is one of the most important factors in evaluating the value of high-quality meat, although standards vary according to food culture and industrial conditions by country. Therefore, it is required to develop a %IMF estimation algorithm suitable for Hanwoo. In this study, we proposed a method of estimating %IMF of Hanwoo using CNN in ultrasound images. First, the proposed method classified the chemically measured %IMF into 10 classes using k-means clustering method to apply CNN. Next, ROI images were obtained at regular intervals from each ultrasound image and used for CNN training and estimation. The proposed CNN model is composed of three stages of convolution layer and fully connected layer. As a result of the experiment, it was confirmed that the %IMF of Hanwoo was estimated with an accuracy of 98.2%. The correlation coefficient between the estimated %IMF and the real %IMF by the proposed method is 0.97, which is about 10% better than the 0.88 of the previous method.

Machine Learning based Optimal Location Modeling for Children's Smart Pedestrian Crosswalk: A Case Study of Changwon-si (머신러닝을 활용한 어린이 스마트 횡단보도 최적입지 선정 - 창원시 사례를 중심으로 -)

  • Lee, Suhyeon;Suh, Youngwon;Kim, Sein;Lee, Jaekyung;Yun, Wonjoo
    • Journal of KIBIM
    • /
    • v.12 no.2
    • /
    • pp.1-11
    • /
    • 2022
  • Road traffic accidents (RTAs) are the leading cause of accidental death among children. RTA reduction is becoming an increasingly important social issue among children. Municipalities aim to resolve this issue by introducing "Smart Pedestrian Crosswalks" that help prevent traffic accidents near children's facilities. Nonetheless such facilities tend to be installed in relatively limited number of areas, such as the school zone. In order for budget allocation to be efficient and policy effects maximized, optimal location selection based on machine learning is needed. In this paper, we employ machine learning models to select the optimal locations for smart pedestrian crosswalks to reduce the RTAs of children. This study develops an optimal location index using variable importance measures. By using k-means clustering method, the authors classified the crosswalks into three types after the optimal location selection. This study has broadened the scope of research in relation to smart crosswalks and traffic safety. Also, the study serves as a unique contribution by integrating policy design decisions based on public and open data.