• Title/Summary/Keyword: Time-based Clustering

Search Result 721, Processing Time 0.033 seconds

GGenre Pattern based User Clustering for Performance Improvement of Collaborative Filtering System (협업적 여과 시스템의 성능 향상을 위한 장르 패턴 기반 사용자 클러스터링)

  • Choi, Ja-Hyun;Ha, In-Ay;Hong, Myung-Duk;Jo, Geun-Sik
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.11
    • /
    • pp.17-24
    • /
    • 2011
  • Collaborative filtering system is the clustering about user is built and then based on that clustering results will recommend the preferred item to the user. However, building user clustering is time consuming and also once the users evaluate and give feedback about the film then rebuilding the system is not simple. In this paper, genre pattern of movie recommendation systems is being used and in order to simplify and reduce time of rebuilding user clustering. A Frequent pattern networks is used and then extracts user preference genre patterns and through that extracted patterns user clustering will be built. Through built the clustering for all neighboring users to collaborative filtering is applied and then recommends movies to the user. When receiving user information feedback, traditional collaborative filtering is to rebuild the clustering for all neighbouring users to research and do the clustering. However by using frequent pattern Networks, through user clustering based on genre pattern, collaborative filtering is applied and when rebuilding user clustering inquiry limited by search time can be reduced. After receiving user information feedback through proposed user clustering based on genre pattern, the time that need to spent on re-establishing user clustering can be reduced and also enable the possibility of traditional collaborative filtering systems and recommendation of a similar performance.

Cluster ing for Analysis of Raman Hyper spectral Dental Data

  • Jung, Sung-Hwan
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.1
    • /
    • pp.19-28
    • /
    • 2013
  • In this research, we presented an effective clustering method based on ICA for the analysis of huge Raman hyperspectral dental data. The hyperspectral dataset captured by HR800 micro Raman spectrometer at UMKC-CRISP(University of Missouri-Kansas City Center for Research on Interfacial Structure and Properties), has 569 local points. Each point has 1,005 hyperspectal dentin data. We compared the clustering effectiveness and the clustering time for the case of using all dataset directly and the cases of using the scores after PCA and ICA. As the result of experiment, the cases of using the scores after PCA and ICA showed, not only more detailed internal dentin information in the aspect of medical analysis, but also about 7~19 times much shorter processing times for clustering. ICA based approach also presented better performance than that of PCA, in terms of the detailed internal information of dentin and the clustering time. Therefore, we could confirm the effectiveness of ICA for the analysis of Raman hyperspectral dental data.

Clustering Algorithm for Time Series with Similar Shapes

  • Ahn, Jungyu;Lee, Ju-Hong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.7
    • /
    • pp.3112-3127
    • /
    • 2018
  • Since time series clustering is performed without prior information, it is used for exploratory data analysis. In particular, clusters of time series with similar shapes can be used in various fields, such as business, medicine, finance, and communications. However, existing time series clustering algorithms have a problem in that time series with different shapes are included in the clusters. The reason for such a problem is that the existing algorithms do not consider the limitations on the size of the generated clusters, and use a dimension reduction method in which the information loss is large. In this paper, we propose a method to alleviate the disadvantages of existing methods and to find a better quality of cluster containing similarly shaped time series. In the data preprocessing step, we normalize the time series using z-transformation. Then, we use piecewise aggregate approximation (PAA) to reduce the dimension of the time series. In the clustering step, we use density-based spatial clustering of applications with noise (DBSCAN) to create a precluster. We then use a modified K-means algorithm to refine the preclusters containing differently shaped time series into subclusters containing only similarly shaped time series. In our experiments, our method showed better results than the existing method.

Determining on Model-based Clusters of Time Series Data (시계열데이터의 모델기반 클러스터 결정)

  • Jeon, Jin-Ho;Lee, Gye-Sung
    • The Journal of the Korea Contents Association
    • /
    • v.7 no.6
    • /
    • pp.22-30
    • /
    • 2007
  • Most real word systems such as world economy, stock market, and medical applications, contain a series of dynamic and complex phenomena. One of common methods to understand these systems is to build a model and analyze the behavior of the system. In this paper, we investigated methods for best clustering over time series data. As a first step for clustering, BIC (Bayesian Information Criterion) approximation is used to determine the number of clusters. A search technique to improve clustering efficiency is also suggested by analyzing the relationship between data size and BIC values. For clustering, two methods, model-based and similarity based methods, are analyzed and compared. A number of experiments have been performed to check its validity using real data(stock price). BIC approximation measure has been confirmed that it suggests best number of clusters through experiments provided that the number of data is relatively large. It is also confirmed that the model-based clustering produces more reliable clustering than similarity based ones.

Curve Clustering in Microarray

  • Lee, Kyeong-Eun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.3
    • /
    • pp.575-584
    • /
    • 2004
  • We propose a Bayesian model-based approach using a mixture of Dirichlet processes model with discrete wavelet transform, for curve clustering in the microarray data with time-course gene expressions.

  • PDF

Novel Techniques for Real Time Computing Critical Clearing Time SIME-B and CCS-B

  • Dinh, Hung Nguyen;Nguyen, Minh Y.;Yoon, Yong Tae
    • Journal of Electrical Engineering and Technology
    • /
    • v.8 no.2
    • /
    • pp.197-205
    • /
    • 2013
  • Real time transient stability assessment mainly depends on real-time prediction. Unfortunately, conventional techniques based on offline analysis are too slow and unreliable in complex power systems. Hence, fast and reliable stability prediction methods and simple stability criterions must be developed for real time purposes. In this paper, two new methods for real time determining critical clearing time based on clustering identification are proposed. This article is covering three main sections: (i) clustering generators and recognizing critical group; (ii) replacing the multi-machine system by a two-machine dynamic equivalent and eventually, to a one-machine-infinite-bus system; (iii) presenting a new method to predict post-fault trajectory and two simple algorithms for calculating critical clearing time, respectively established upon two different transient stability criterions. The performance is expected to figure out critical clearing time within 100ms-150ms and with an acceptable accuracy.

Adaptive Event Clustering for Personalized Photo Browsing (사진 사용 이력을 이용한 이벤트 클러스터링 알고리즘)

  • Kim, Kee-Eung;Park, Tae-Suh;Park, Min-Kyu;Lee, Yong-Beom;Kim, Yeun-Bae;Kim, Sang-Ryong
    • 한국HCI학회:학술대회논문집
    • /
    • 2006.02a
    • /
    • pp.711-716
    • /
    • 2006
  • Since the introduction of digital camera to the mass market, the number of digital photos owned by an individual is growing at an alarming rate. This phenomenon naturally leads to the issues of difficulties while searching and browsing in the personal digital photo archive. Traditional approach typically involves content-based image retrieval using computer vision algorithms. However, due to the performance limitations of these algorithms, at least on the casual digital photos taken by non-professional photographers, more recent approaches are centered on time-based clustering algorithms, analyzing the shot times of photos. These time-based clustering algorithms are based on the insight that when these photos are clustered according to the shot-time similarity, we have "event clusters" that will help the user browse through her photo archive. It is also reported that one of the remaining problems with the time-based approach is that people perceive events in different scales. In this paper, we present an adaptive time-based clustering algorithm that exploits the usage history of digital photos in order to infer the user's preference on the event granularity. Experiments show significant performance improvements in the clustering accuracy.

  • PDF

Improved Image Clustering Algorithm based on Weighted Sub-sampling (Weighted subsampling 기반의 향상된 영상 클러스터링 알고리즘)

  • Choi, Byung-In;Nam, Sang-Hoon;Joung, Shi-Chang;Youn, Jung-Su;Yang, Yu-Kyung
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.939-940
    • /
    • 2008
  • In this paper, we propose a novel image clustering method based on weighted sub-sampling to reduce clustering time and the number of clusters for target detection and tracking. Our proposed method first obtain sub-sampling image with specific weights which is the number of target pixels in sampling region. After performing clustering procedure, the cluster center position is properly obtained using weights of target pixels in the cluster. Therefore, our proposed method can not only reduce clustering time, but also obtain proper cluster center.

  • PDF

A Study on Representative Skyline Using Connected Component Clustering

  • Choi, Jong-Hyeok;Nasridinov, Aziz
    • Journal of Multimedia Information System
    • /
    • v.6 no.1
    • /
    • pp.37-42
    • /
    • 2019
  • Skyline queries are used in a variety of fields to make optimal decisions. However, as the volume of data and the dimension of the data increase, the number of skyline points increases with the amount of time it takes to discover them. Mainly, because the number of skylines is essential in many real-life applications, various studies have been proposed. However, previous researches have used the k-parameter methods such as top-k and k-means to discover representative skyline points (RSPs) from entire skyline point set, resulting in high query response time and reduced representativeness due to k dependency. To solve this problem, we propose a new Connected Component Clustering based Representative Skyline Query (3CRS) that can discover RSP quickly even in high-dimensional data through connected component clustering. 3CRS performs fast discovery and clustering of skylines through hash indexes and connected components and selects RSPs from each cluster. This paper proves the superiority of the proposed method by comparing it with representative skyline queries using k-means and DBSCAN with the real-world dataset.

Clustering Analysis with Spring Discharge Data and Evaluation of Groundwater System in Jeju Island (용천수 유출량 클러스터링 해석을 이용한 제주도 지하수 순환 해석)

  • Kim Tae-Hui;Mun Deok-Cheol;Park Won-Bae;Park Gi-Hwa;Go Gi-Won
    • Proceedings of the Korean Society of Soil and Groundwater Environment Conference
    • /
    • 2005.04a
    • /
    • pp.296-299
    • /
    • 2005
  • Time series of spring discharge data in Jeju island can provide abundant information on the spatial groundwater system. In this study, the classification based on time series of spring discharge was performed with clustering analysis: discharge rate and EC. Peak discharges are mainly observed in august or september. However, double peaks and late peaks of discharge are also observed at a plenty of springs. Based on results of clustering analysis, it can be deduced that GH model is not appropriate for the conceptual model of Groundwater system in Jeju island. EC distributions in dry season are also support the conclusion.

  • PDF