Search | Korea Science

Approximate Clustering on Data Streams Using Discrete Cosine Transform

Yu, Feng;Oyana, Damalie;Hou, Wen-Chi;Wainer, Michael
- Journal of Information Processing Systems
- /
- v.6 no.1
- /
- pp.67-78
- /
- 2010
In this study, a clustering algorithm that uses DCT transformed data is presented. The algorithm is a grid density-based clustering algorithm that can identify clusters of arbitrary shape. Streaming data are transformed and reconstructed as needed for clustering. Experimental results show that DCT is able to approximate a data distribution efficiently using only a small number of coefficients and preserve the clusters well. The grid based clustering algorithm works well with DCT transformed data, demonstrating the viability of DCT for data stream clustering applications.
https://doi.org/10.3745/JIPS.2010.6.1.067 인용 PDF KSCI

Clustering Algorithm by Grid-based Sampling

Park, Hee-Chang;Ryu, Jee-Hyun
- 한국데이터정보과학회:학술대회논문집
- /
- 2003.05a
- /
- pp.97-108
- /
- 2003
Cluster analysis has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on on-line or off-line and so on. Clustering can identify dense and sparse regions among data attributes or object attributes. But it requires many hours to get clusters that we want, because of clustering is more primitive, explorative and we make many data an object of cluster analysis. In this paper we propose a new method of clustering using sample based on grid. It is more fast than any traditional clustering method and maintains its accuracy. It reduces running time by using grid-based sample. And other clustering applications can be more effective by using this methods with its original methods.
PDF

K-means Clustering using a Center Of Gravity for grid-based sample

Park, Hee-Chang;Lee, Sun-Myung
- 한국데이터정보과학회:학술대회논문집
- /
- 2004.04a
- /
- pp.51-60
- /
- 2004
K-means clustering is an iterative algorithm in which items are moved among sets of clusters until the desired set is reached. K-means clustering has been widely used in many applications, such as market research, pattern analysis or recognition, image processing, etc. It can identify dense and sparse regions among data attributes or object attributes. But k-means algorithm requires many hours to get k clusters that we want, because it is more primitive, explorative. In this paper we propose a new method of k-means clustering using a center of gravity for grid-based sample. It is more fast than any traditional clustering method and maintains its accuracy.
PDF

K-means Clustering using Grid-based Representatives

Park, Hee-Chang;Lee, Sun-Myung
- Journal of the Korean Data and Information Science Society
- /
- v.16 no.4
- /
- pp.759-768
- /
- 2005
K-means clustering has been widely used in many applications, such that pattern analysis, data analysis, market research and so on. It can identify dense and sparse regions among data attributes or object attributes. But k-means algorithm requires many hours to get k clusters, because it is more primitive and explorative. In this paper we propose a new method of k-means clustering using the grid-based representative value(arithmetic and trimmed mean) for sample. It is more fast than any traditional clustering method and maintains its accuracy.
PDF

An Optimization Method for the Calculation of SCADA Main Grid's Theoretical Line Loss Based on DBSCAN

Cao, Hongyi;Ren, Qiaomu;Zou, Xiuguo;Zhang, Shuaitang;Qian, Yan
- Journal of Information Processing Systems
- /
- v.15 no.5
- /
- pp.1156-1170
- /
- 2019
In recent years, the problem of data drifted of the smart grid due to manual operation has been widely studied by researchers in the related domain areas. It has become an important research topic to effectively and reliably find the reasonable data needed in the Supervisory Control and Data Acquisition (SCADA) system has become an important research topic. This paper analyzes the data composition of the smart grid, and explains the power model in two smart grid applications, followed by an analysis on the application of each parameter in density-based spatial clustering of applications with noise (DBSCAN) algorithm. Then a comparison is carried out for the processing effects of the boxplot method, probability weight analysis method and DBSCAN clustering algorithm on the big data driven power grid. According to the comparison results, the performance of the DBSCAN algorithm outperforming other methods in processing effect. The experimental verification shows that the DBSCAN clustering algorithm can effectively screen the power grid data, thereby significantly improving the accuracy and reliability of the calculation result of the main grid's theoretical line loss.
https://doi.org/10.3745/JIPS.02.0119 인용 PDF KSCI

K-means Clustering using a Grid-based Sampling

Park, Hee-Chang;Lee, Sun-Myung
- 한국데이터정보과학회:학술대회논문집
- /
- 2003.10a
- /
- pp.249-258
- /
- 2003
K-means clustering has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research and so on. It can identify dense and sparse regions among data attributes or object attributes. But k-means algorithm requires many hours to get k clusters that we want, because it is more primitive, explorative. In this paper we propose a new method of k-means clustering using the grid-based sample. It is more fast than any traditional clustering method and maintains its accuracy.
PDF

Clustering Algorithm using a Center Of Gravity for Grid-based Sample

Park, Hee-Chang;Ryu, Jee-Hyun
- 한국데이터정보과학회:학술대회논문집
- /
- 2003.05a
- /
- pp.77-88
- /
- 2003
Cluster analysis has been widely used in many applications, such that data analysis, pattern recognition, image processing, etc. But clustering requires many hours to get clusters that we want, because it is more primitive, explorative and we make many data an object of cluster analysis. In this paper we propose a new clustering method, 'Clustering algorithm using a center of gravity for grid-based sample'. It is more fast than any traditional clustering method and maintains accuracy. It reduces running time by using grid-based sample and keeps accuracy by using representative point, a center of gravity.
PDF

Clustering Algorithm Using a Center of Gravity for Grid-based Sample

Park, Hee-Chang;Ryu, Jee-Hyun
- Journal of the Korean Data and Information Science Society
- /
- v.16 no.2
- /
- pp.217-226
- /
- 2005
Cluster analysis has been widely used in many applications, such as data analysis, pattern recognition, image processing, etc. But clustering requires many hours to get clusters that we want, because it is more primitive, explorative and we make many data an object of cluster analysis. In this paper we propose a new clustering method, 'Clustering algorithm using a center of gravity for grid-based sample'. It reduces running time by using grid-based sample and keeps accuracy by using representative point, a center of gravity.
PDF

K-means clustering using a center of gravity for grid-based sample (그리드 기반 표본의 무게중심을 이용한 케이-평균군집화)

Lee, Sun-Myung;Park, Hee-Chang
- Journal of the Korean Data and Information Science Society
- /
- v.21 no.1
- /
- pp.121-128
- /
- 2010
K-means clustering is an iterative algorithm in which items are moved among sets of clusters until the desired set is reached. K-means clustering has been widely used in many applications, such as market research, pattern analysis or recognition, image processing, etc. It can identify dense and sparse regions among data attributes or object attributes. But k-means algorithm requires many hours to get k clusters that we want, because it is more primitive, explorative. In this paper we propose a new method of k-means clustering using a center of gravity for grid-based sample. It is more fast than any traditional clustering method and maintains its accuracy.
PDF KSCI

Dynamic Subspace Clustering for Online Data Streams (온라인 데이터 스트림에서의 동적 부분 공간 클러스터링 기법)

Park, Nam Hun
- Journal of Digital Convergence
- /
- v.20 no.2
- /
- pp.217-223
- /
- 2022
Subspace clustering for online data streams requires a large amount of memory resources as all subsets of data dimensions must be examined. In order to track the continuous change of clusters for a data stream in a finite memory space, in this paper, we propose a grid-based subspace clustering algorithm that effectively uses memory resources. Given an n-dimensional data stream, the distribution information of data items in data space is monitored by a grid-cell list. When the frequency of data items in the grid-cell list of the first level is high and it becomes a unit grid-cell, the grid-cell list of the next level is created as a child node in order to find clusters of all possible subspaces from the grid-cell. In this way, a maximum n-level grid-cell subspace tree is constructed, and a k-dimensional subspace cluster can be found at the k^th level of the subspace grid-cell tree. Through experiments, it was confirmed that the proposed method uses computing resources more efficiently by expanding only the dense space while maintaining the same accuracy as the existing method.
https://doi.org/10.14400/JDC.2022.20.2.217 인용 PDF KSCI

Search Result 72, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)