통합 검색 | Korea Science

Comparison of time series clustering methods and application to power consumption pattern clustering

Kim, Jaehwi;Kim, Jaehee
- Communications for Statistical Applications and Methods
- /
- 제27권6호
- /
- pp.589-602
- /
- 2020
The development of smart grids has enabled the easy collection of a large amount of power data. There are some common patterns that make it useful to cluster power consumption patterns when analyzing s power big data. In this paper, clustering analysis is based on distance functions for time series and clustering algorithms to discover patterns for power consumption data. In clustering, we use 10 distance measures to find the clusters that consider the characteristics of time series data. A simulation study is done to compare the distance measures for clustering. Cluster validity measures are also calculated and compared such as error rate, similarity index, Dunn index and silhouette values. Real power consumption data are used for clustering, with five distance measures whose performances are better than others in the simulation.
https://doi.org/10.29220/CSAM.2020.27.6.589 인용 PDF KSCI

방향성을 고려한 밀도 기반 클러스터링 기법에 관한 연구 (A Study on Density-Based Clustering Method Considering Directionality)

김진만;국중진
- 반도체디스플레이기술학회지
- /
- 제23권2호
- /
- pp.38-44
- /
- 2024
This research proposed DBSCAN-D, which is a clustering technique for locating POI based on existing density-based clustering research, such as GPS data, generated by moving objects. This method is designed based on 'staying time' and 'directionality' extracted from the relationship between GPS data. The staying time can be extracted through the difference in the reception time between data using the time at which the GPS data is received. Directionality can be expressed by moving the area of data generated later in the direction of the position of the previously generated data by concentrating on the point where the GPS data is sequentially generated. Through these two properties, it is possible to perform clustering suitable for the data set generated by the moving object.
PDF

A Determination of an Optimal Clustering Method Based on Data Characteristics

Kim, Jeong-Hun;Yoo, Kwan-Hee;Nasridinov, Aziz
- 예술인문사회 융합 멀티미디어 논문지
- /
- 제7권8호
- /
- pp.305-314
- /
- 2017
Clustering is a method that collects data objects into groups based on their similary. Performance of the state-of-the-art clustering methods is different according to the data characteristics. There have been numerous studies that performed experiments to compare the accuracy of the state-of-the-art clustering methods by applying various kinds of datasets. A common problem of these studies is that they only consider clustering algorithms that yield the most accurate results for a particular dataset. They do not consider what factors affect the execution time of each clustering method and how they are affected. Nevertheless, execution time is an important factor in clustering performance if there is no significant difference in accuracy. In order to solve the problems of the existing research, through a series of experiments using various types of datasets, we compare the accuracy of four representative clustering methods. In addition, we perform practical clustering performance comparisons by deriving time complexity and identifying factors that influences to its performance.
https://doi.org/10.14257/ajmahs.2017.08.09 인용

스마트 그리드에서의 시계열 군집분석을 통한 전력수요 예측 연구 (A study on electricity demand forecasting based on time series clustering in smart grid)

손흥구;정상욱;김삼용
- 응용통계연구
- /
- 제29권1호
- /
- pp.193-203
- /
- 2016
본 논문은 ICT기반 시장에서의 수요관리시스템에서의 핵심 요소인 전력 수요 예측을 위하여, 전체 사용량을 기반으로 예측 하는 방식이 아닌, 시계열 기반 군집분석을 통한 군집별 예측량의 결합을 실시하였다. 시계열 군집 분석 방법으로서 Periodogram 기반의 정규화 군집분석, 예측 기반의 군집분석, DTW(Dynamic Time Warping)를 이용하여 군집화를 시도하였으며, 군집 별 수요예측 모형으로서 DSHW(Double Seasonal Holt-Winters) 모형, TBATS(Trigonometric, Box-Cox transform, ARMA errors, Trend and Seasonal components) 모형, FARIMA(Fractional ARIMA) 모형을 사용하여 예측을 실시하였다. 전체 사용량을 기반으로 예측 하는 방식이 아닌, 군집분석을 통한 군집별 예측량의 결합이 더 낮은 MAPE로 나타남에 따라 우수한 예측 방법으로 판단되었다.
https://doi.org/10.5351/KJAS.2016.29.1.193 인용 PDF KSCI

Consensus Clustering for Time Course Gene Expression Microarray Data

Kim, Seo-Young;Bae, Jong-Sung
- Communications for Statistical Applications and Methods
- /
- 제12권2호
- /
- pp.335-348
- /
- 2005
The rapid development of microarray technologies enabled the monitoring of expression levels of thousands of genes simultaneously. Recently, the time course gene expression data are often measured to study dynamic biological systems and gene regulatory networks. For the data, biologists are attempting to group genes based on the temporal pattern of their expression levels. We apply the consensus clustering algorithm to a time course gene expression data in order to infer statistically meaningful information from the measurements. We evaluate each of consensus clustering and existing clustering methods with various validation measures. In this paper, we consider hierarchical clustering and Diana of existing methods, and consensus clustering with hierarchical clustering, Diana and mixed hierachical and Diana methods and evaluate their performances on a real micro array data set and two simulated data sets.
https://doi.org/10.5351/CKSS.2005.12.2.335 인용 PDF KSCI

Path based K-means Clustering for RFID Data Sets

Yun, Hong-Won
- Journal of information and communication convergence engineering
- /
- 제6권4호
- /
- pp.434-438
- /
- 2008
Massive data are continuously produced with a data rate of over several terabytes every day. These applications need effective clustering algorithms to achieve an overall high performance computation. In this paper, we propose ancestor as cluster center based approach to clustering, the K-means algorithm using ancestor. We modify the K-means algorithm. We present a clustering architecture and a clustering algorithm that minimize of I/Os and show a performance with excellent. In our experimental performance evaluation, we present that our algorithm can improve the I/O speed and the query processing time.
PDF KSCI

Clustering non-stationary advanced metering infrastructure data

Kang, Donghyun;Lim, Yaeji
- Communications for Statistical Applications and Methods
- /
- 제29권2호
- /
- pp.225-238
- /
- 2022
In this paper, we propose a clustering method for advanced metering infrastructure (AMI) data in Korea. As AMI data presents non-stationarity, we consider time-dependent frequency domain principal components analysis, which is a proper method for locally stationary time series data. We develop a new clustering method based on time-varying eigenvectors, and our method provides a meaningful result that is different from the clustering results obtained by employing conventional methods, such as K-means and K-centres functional clustering. Simulation study demonstrates the superiority of the proposed approach. We further apply the clustering results to the evaluation of the electricity price system in South Korea, and validate the reform of the progressive electricity tariff system.
https://doi.org/10.29220/CSAM.2022.29.2.225 인용 PDF KSCI

Improving Real-Time Efficiency of Case Retrieving Process for Case-Based Reasoning

Park, Yoon-Joo
- Asia pacific journal of information systems
- /
- 제25권4호
- /
- pp.626-641
- /
- 2015
Conventional case-based reasoning (CBR) does not perform efficiently for high-volume datasets because of case retrieval time. To overcome this problem, previous research suggested clustering a case base into several small groups and retrieving neighbors within a corresponding group to a target case. However, this approach generally produces less accurate predictive performance than the conventional CBR. This paper proposes a new case-based reasoning method called the clustering-merging CBR (CM-CBR). The CM-CBR method dynamically indexes a search pool to retrieve neighbors considering the distance between a target case and the centroid of a corresponding cluster. This method is applied to three real-life medical datasets. Results show that the proposed CM-CBR method produces similar or better predictive performance than the conventional CBR and clustering-CBR methods in numerous cases with significantly less computational cost.
https://doi.org/10.14329/apjis.2015.25.4.626 인용 PDF

AN EFFICIENT DENSITY BASED ANT COLONY APPROACH ON WEB DOCUMENT CLUSTERING

M. REKA
- Journal of applied mathematics & informatics
- /
- 제41권6호
- /
- pp.1327-1339
- /
- 2023
World Wide Web (WWW) use has been increasing recently due to users needing more information. Lately, there has been a growing trend in the document information available to end users through the internet. The web's document search process is essential to find relevant documents for user queries.As the number of general web pages increases, it becomes increasingly challenging for users to find records that are appropriate to their interests. However, using existing Document Information Retrieval (DIR) approaches is time-consuming for large document collections. To alleviate the problem, this novel presents Spatial Clustering Ranking Pattern (SCRP) based Density Ant Colony Information Retrieval (DACIR) for user queries based DIR. The proposed first stage is the Term Frequency Weight (TFW) technique to identify the query weightage-based frequency. Based on the weight score, they are grouped and ranked using the proposed Spatial Clustering Ranking Pattern (SCRP) technique. Finally, based on ranking, select the most relevant information retrieves the document using DACIR algorithm.The proposed method outperforms traditional information retrieval methods regarding the quality of returned objects while performing significantly better in run time.
https://doi.org/10.14317/jami.2023.1327 인용 PDF

제조 시계열 데이터를 위한 진화 연산 기반의 하이브리드 클러스터링 기법 (Evolutionary Computation-based Hybird Clustring Technique for Manufacuring Time Series Data)

오상헌;안창욱
- 스마트미디어저널
- /
- 제10권3호
- /
- pp.23-30
- /
- 2021
제조 시계열 데이터 클러스터링 기법은 제조 대용량 데이터 기반 군집화를 통한 설비 및 공정 이상 탐지 분류를 위한 중요한 솔루션이지만 기존 정적 데이터 대상 클러스터링 기법을 시계열 데이터에 적용함에 있어 낮은 정확도를 가지는 단점이 있다. 본 논문에서는 진화 연산 기반 시계열 군집 분석 접근 방식을 제시하여 기존 클러스터링 기술에 대한 정합성 향상하고자 한다. 이를 위하여 먼저 제조 공정 결과 이미지 형상을 선형 스캐닝을 활용하여 1차원 시계열 데이터로 변환하고 해당 변환 데이터 대상으로 Pearson 거리 매트릭을 기반으로 계층적 군집 분석 및 분할 군집 분석에 대한 최적 하위클러스터를 도출한다. 해당 최적 하위클러스터 대상 유전 알고리즘을 활용하여 유사도가 최소화되는 최적의 군집 조합을 도출한다. 그리고 실제 제조 과정 이미지 대상으로 기존 클러스터링 기법과 성능 비교를 통하여 제안된 클러스터링 기법의 성능 우수성을 검증한다.
https://doi.org/10.30693/SMJ.2021.10.3.23 인용 PDF KSCI

검색결과 716건 처리시간 0.021초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)