• 제목/요약/키워드: k-means algorithms

검색결과 402건 처리시간 0.023초

강화된 유전알고리즘을 이용한 이중 동조 기반 퍼지 예측시스템 설계 및 응용 (Design of Fuzzy Prediction System based on Dual Tuning using Enhanced Genetic Algorithms)

  • 방영근;이철희
    • 전기학회논문지
    • /
    • 제59권1호
    • /
    • pp.184-191
    • /
    • 2010
  • Many researchers have been considering genetic algorithms to system optimization problems. Especially, real-coded genetic algorithms are very effective techniques because they are simpler in coding procedures than binary-coded genetic algorithms and can reduce extra works that increase the length of chromosome for wide search space. Thus, this paper presents a fuzzy system design technique to improve the performance of the fuzzy system. The proposed system consists of two procedures. The primary tuning procedure coarsely tunes fuzzy sets of the system using the k-means clustering algorithm of which the structure is very simple, and then the secondary tuning procedure finely tunes the fuzzy sets using enhanced real-coded genetic algorithms based on the primary procedure. In addition, this paper constructs multiple fuzzy systems using a data preprocessing procedure which is contrived for reflecting various characteristics of nonlinear data. Finally, the proposed fuzzy system is applied to the field of time series prediction and the effectiveness of the proposed techniques are verified by simulations of typical time series examples.

K-평균 군집화 기반 WSN에서 클러스터 헤드 선택 방법 제안 (Proposal of Cluster Head Election Method in K-means Clustering based WSN)

  • 윤대열;박세영;황치곤
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2021년도 춘계학술대회
    • /
    • pp.447-449
    • /
    • 2021
  • 에너지 소비를 최소화하여 네트워크를 오랫동안 유지하기 위해 다양한 무선 센서 네트워크 프로토콜이 제안되었다. K-평균 군집화 알고리즘을 사용하면 최종 군집이 설정될 때까지 중심점을 반복적으로 이동해야 하기 때문에 기존 계층형 알고리즘보다 군집화에 시간이 더 오래 걸린다. K-평균 클러스터링 기반 프로토콜의 경우 클러스터 헤드가 선택되었을 때 클러스터 중심점 근처의 노드 또는 노드의 잔류 에너지만 고려된다. 본 논문에서는 앞서 언급한 문제를 개선하면서 에너지 효율을 개선하기 위해 K-평균 클러스터링을 기반으로 하는 새로운 무선 센서 네트워크 프로토콜을 제안한다.

  • PDF

Nonlinear damage detection using linear ARMA models with classification algorithms

  • Chen, Liujie;Yu, Ling;Fu, Jiyang;Ng, Ching-Tai
    • Smart Structures and Systems
    • /
    • 제26권1호
    • /
    • pp.23-33
    • /
    • 2020
  • Majority of the damage in engineering structures is nonlinear. Damage sensitive features (DSFs) extracted by traditional methods from linear time series models cannot effectively handle nonlinearity induced by structural damage. A new DSF is proposed based on vector space cosine similarity (VSCS), which combines K-means cluster analysis and Bayesian discrimination to detect nonlinear structural damage. A reference autoregressive moving average (ARMA) model is built based on measured acceleration data. This study first considers an existing DSF, residual standard deviation (RSD). The DSF is further advanced using the VSCS, and then the advanced VSCS is classified using K-means cluster analysis and Bayes discriminant analysis, respectively. The performance of the proposed approach is then verified using experimental data from a three-story shear building structure, and compared with the results of existing RSD. It is demonstrated that combining the linear ARMA model and the advanced VSCS, with cluster analysis and Bayes discriminant analysis, respectively, is an effective approach for detection of nonlinear damage. This approach improves the reliability and accuracy of the nonlinear damage detection using the linear model and significantly reduces the computational cost. The results indicate that the proposed approach is potential to be a promising damage detection technique.

데이터와 적용되는 알고리즘의 연관성을 이용한 클러스터링 기법 (Clustering Technique Using Relevance of Data and Applied Algorithms)

  • 한우연;남미영;이필규
    • 정보처리학회논문지B
    • /
    • 제12B권5호
    • /
    • pp.577-586
    • /
    • 2005
  • 영상 처리와 패턴 인식 그리고 컴퓨터 비젼 분야의 가장 성공적인 응용들 중 하나인 얼굴 인식을 위해 많은 알고리즘이 제안되었고, 최근에는 얼굴의 어떤 속성이 대상을 인식하는 것을 더 쉽거나 어렵게 만드는지에 대한 연구가 진행되고 있다. 본 논문에서는 얼굴의 속성(조명, 표정)에 따라 각각의 알고리즘의 인식 성능이 달라지는 점에 착안해서, 얼굴 데이터와 적용된 알고리즘과의 연관성을 이용하여 인식 성능을 높이는 클러스터링 방법을 제안하였다. 실험에서는 인식 알고리즘으로 n-tuple, PCA 그리고 가보 웨이블릿이 사용되었고, 세 가지 벡터화 방법이 제안되었다. 우선 학습 데이터를 k-means 알고리즘을 이용하여 클러스터링하고 각각의 클러스터에 대한 세 가지 인식 알고리즘의 적합도를 평가한 후, 같은 알고리즘을 선택한 클러스터들을 통합하여 새로운 클러스터를 구성한다. 그리고 테스트 데이터에서 새로운 클러스터에 대한 유사도를 평가하여 가장 가까운 클러스터가 선택한 알고리즘으로 인식을 수행한다. 그 결과 클러스터링 과정을 거치지 않고 단일 알고리즘을 사용하여 인식했을 때보다 인식 성능이 향상된 것을 관찰할 수 있다.

A Classification Algorithm Based on Data Clustering and Data Reduction for Intrusion Detection System over Big Data

  • Wang, Qiuhua;Ouyang, Xiaoqin;Zhan, Jiacheng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권7호
    • /
    • pp.3714-3732
    • /
    • 2019
  • With the rapid development of network, Intrusion Detection System(IDS) plays a more and more important role in network applications. Many data mining algorithms are used to build IDS. However, due to the advent of big data era, massive data are generated. When dealing with large-scale data sets, most data mining algorithms suffer from a high computational burden which makes IDS much less efficient. To build an efficient IDS over big data, we propose a classification algorithm based on data clustering and data reduction. In the training stage, the training data are divided into clusters with similar size by Mini Batch K-Means algorithm, meanwhile, the center of each cluster is used as its index. Then, we select representative instances for each cluster to perform the task of data reduction and use the clusters that consist of representative instances to build a K-Nearest Neighbor(KNN) detection model. In the detection stage, we sort clusters according to the distances between the test sample and cluster indexes, and obtain k nearest clusters where we find k nearest neighbors. Experimental results show that searching neighbors by cluster indexes reduces the computational complexity significantly, and classification with reduced data of representative instances not only improves the efficiency, but also maintains high accuracy.

Genetically Optimized Fuzzy Polynomial Neural Network and Its Application to Multi-variable Software Process

  • Lee In-Tae;Oh Sung-Kwun;Kim Hyun-Ki;Pedrycz Witold
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제6권1호
    • /
    • pp.33-38
    • /
    • 2006
  • In this paper, we propose a new architecture of Fuzzy Polynomial Neural Networks(FPNN) by means of genetically optimized Fuzzy Polynomial Neuron(FPN) and discuss its comprehensive design methodology involving mechanisms of genetic optimization, especially Genetic Algorithms(GAs). The conventional FPNN developed so far are based on mechanisms of self-organization and evolutionary optimization. The design of the network exploits the extended Group Method of Data Handling(GMDH) with some essential parameters of the network being provided by the designer and kept fixed throughout the overall development process. This restriction may hamper a possibility of producing an optimal architecture of the model. The proposed FPNN gives rise to a structurally optimized network and comes with a substantial level of flexibility in comparison to the one we encounter in conventional FPNNs. It is shown that the proposed advanced genetic algorithms based Fuzzy Polynomial Neural Networks is more useful and effective than the existing models for nonlinear process. We experimented with Medical Imaging System(MIS) dataset to evaluate the performance of the proposed model.

Clustering Algorithm for Time Series with Similar Shapes

  • Ahn, Jungyu;Lee, Ju-Hong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권7호
    • /
    • pp.3112-3127
    • /
    • 2018
  • Since time series clustering is performed without prior information, it is used for exploratory data analysis. In particular, clusters of time series with similar shapes can be used in various fields, such as business, medicine, finance, and communications. However, existing time series clustering algorithms have a problem in that time series with different shapes are included in the clusters. The reason for such a problem is that the existing algorithms do not consider the limitations on the size of the generated clusters, and use a dimension reduction method in which the information loss is large. In this paper, we propose a method to alleviate the disadvantages of existing methods and to find a better quality of cluster containing similarly shaped time series. In the data preprocessing step, we normalize the time series using z-transformation. Then, we use piecewise aggregate approximation (PAA) to reduce the dimension of the time series. In the clustering step, we use density-based spatial clustering of applications with noise (DBSCAN) to create a precluster. We then use a modified K-means algorithm to refine the preclusters containing differently shaped time series into subclusters containing only similarly shaped time series. In our experiments, our method showed better results than the existing method.

Colorectal Cancer Staging Using Three Clustering Methods Based on Preoperative Clinical Findings

  • Pourahmad, Saeedeh;Pourhashemi, Soudabeh;Mohammadianpanah, Mohammad
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제17권2호
    • /
    • pp.823-827
    • /
    • 2016
  • Determination of the colorectal cancer stage is possible only after surgery based on pathology results. However, sometimes this may prove impossible. The aim of the present study was to determine colorectal cancer stage using three clustering methods based on preoperative clinical findings. All patients referred to the Colorectal Research Center of Shiraz University of Medical Sciences for colorectal cancer surgery during 2006 to 2014 were enrolled in the study. Accordingly, 117 cases participated. Three clustering algorithms were utilized including k-means, hierarchical and fuzzy c-means clustering methods. External validity measures such as sensitivity, specificity and accuracy were used for evaluation of the methods. The results revealed maximum accuracy and sensitivity values for the hierarchical and a maximum specificity value for the fuzzy c-means clustering methods. Furthermore, according to the internal validity measures for the present data set, the optimal number of clusters was two (silhouette coefficient) and the fuzzy c-means algorithm was more appropriate than the k-means clustering approach by increasing the number of clusters.

New Map-Matching Algorithm Using Virtual Track for Pedestrian Dead Reckoning

  • Shin, Seung-Hyuck;Park, Chan-Gook;Choi, Sang-On
    • ETRI Journal
    • /
    • 제32권6호
    • /
    • pp.891-900
    • /
    • 2010
  • In this paper, a map-matching (MM) algorithm which combines an estimated position with digital road data is proposed. The presented algorithm using a virtual track is appropriate for a MEMS-based pedestrian dead reckoning (PDR) system, which can be used in mobile devices. Most of the previous MM algorithms are for car navigation systems and GPS-based navigation system, so existing MM algorithms are not appropriate for the pure DR-based pedestrian navigation system. The biggest problem of previous MM algorithms is that they cannot determine the correct road segment (link) due to the DR characteristics. In DR-based navigation system, the current position is propagated from the previous estimated position. This means that the MM result can be placed on a wrong link when MM algorithm fails to decide the correct link at once. It is a critical problem. Previous algorithms never overcome this problem because they did not consider pure DR characteristics. The MM algorithm using the virtual track is proposed to overcome this problem with improved accuracy. Performance of the proposed MM algorithm was verified by experiments.

등각원형배열을 고려한 코히어런트 다중신호 방향탐지 기법 연구 (The Study of Direction Finding Algorithms for Coherent Multiple Signals in Uniform Circular Array)

  • 박철순;이호주;장원
    • 한국군사과학기술학회지
    • /
    • 제12권1호
    • /
    • pp.97-105
    • /
    • 2009
  • In this paper, the performance of AP(Alternating Projection) and EM(Expectation Maximization) algorithms is investigated in terms of detection of multiple signals, resolvability of coherent signals and the efficiency of sensor array processing. The basic idea of these algorithms is utilization of relaxation technique of successive 1D maximization to solve a direction finding problem by maximizing the multidimensional likelihood function. It means that the function is maximized over only for a single parameter while the other parameters are fixed at each step of the iteration. According to simulation results, the algorithms showed good performance for both incoherent and coherent multiple signals. Moreover, some advantages are identified for direction finding with very small samples and fast convergence. The performance of AP algorithm is compared with that of EM using multiple criteria such as the number of sensor, SNR, the number of samples, and convergence speed over uniform circular array. It is resulted AP algorithm is superior to EM overally except for one criterion, convergence speed. Especially, for EM algorithm there is no performance difference between incoherent and coherent case. In conclusion, AP and EM are viable and practical alternatives, which can be applied to a direction under due to the resolvability of multi-path signals, reliable performance and no troublesome eigen-decomposition of the sample-covariance matrix.