• Title/Summary/Keyword: Time-based Clustering

Search Result 721, Processing Time 0.027 seconds

Derivation of Digital Music's Ranking Change Through Time Series Clustering (시계열 군집분석을 통한 디지털 음원의 순위 변화 패턴 분류)

  • Yoo, In-Jin;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.171-191
    • /
    • 2020
  • This study focused on digital music, which is the most valuable cultural asset in the modern society and occupies a particularly important position in the flow of the Korean Wave. Digital music was collected based on the "Gaon Chart," a well-established music chart in Korea. Through this, the changes in the ranking of the music that entered the chart for 73 weeks were collected. Afterwards, patterns with similar characteristics were derived through time series cluster analysis. Then, a descriptive analysis was performed on the notable features of each pattern. The research process suggested by this study is as follows. First, in the data collection process, time series data was collected to check the ranking change of digital music. Subsequently, in the data processing stage, the collected data was matched with the rankings over time, and the music title and artist name were processed. Each analysis is then sequentially performed in two stages consisting of exploratory analysis and explanatory analysis. First, the data collection period was limited to the period before 'the music bulk buying phenomenon', a reliability issue related to music ranking in Korea. Specifically, it is 73 weeks starting from December 31, 2017 to January 06, 2018 as the first week, and from May 19, 2019 to May 25, 2019. And the analysis targets were limited to digital music released in Korea. In particular, digital music was collected based on the "Gaon Chart", a well-known music chart in Korea. Unlike private music charts that are being serviced in Korea, Gaon Charts are charts approved by government agencies and have basic reliability. Therefore, it can be considered that it has more public confidence than the ranking information provided by other services. The contents of the collected data are as follows. Data on the period and ranking, the name of the music, the name of the artist, the name of the album, the Gaon index, the production company, and the distribution company were collected for the music that entered the top 100 on the music chart within the collection period. Through data collection, 7,300 music, which were included in the top 100 on the music chart, were identified for a total of 73 weeks. On the other hand, in the case of digital music, since the cases included in the music chart for more than two weeks are frequent, the duplication of music is removed through the pre-processing process. For duplicate music, the number and location of the duplicated music were checked through the duplicate check function, and then deleted to form data for analysis. Through this, a list of 742 unique music for analysis among the 7,300-music data in advance was secured. A total of 742 songs were secured through previous data collection and pre-processing. In addition, a total of 16 patterns were derived through time series cluster analysis on the ranking change. Based on the patterns derived after that, two representative patterns were identified: 'Steady Seller' and 'One-Hit Wonder'. Furthermore, the two patterns were subdivided into five patterns in consideration of the survival period of the music and the music ranking. The important characteristics of each pattern are as follows. First, the artist's superstar effect and bandwagon effect were strong in the one-hit wonder-type pattern. Therefore, when consumers choose a digital music, they are strongly influenced by the superstar effect and the bandwagon effect. Second, through the Steady Seller pattern, we confirmed the music that have been chosen by consumers for a very long time. In addition, we checked the patterns of the most selected music through consumer needs. Contrary to popular belief, the steady seller: mid-term pattern, not the one-hit wonder pattern, received the most choices from consumers. Particularly noteworthy is that the 'Climbing the Chart' phenomenon, which is contrary to the existing pattern, was confirmed through the steady-seller pattern. This study focuses on the change in the ranking of music over time, a field that has been relatively alienated centering on digital music. In addition, a new approach to music research was attempted by subdividing the pattern of ranking change rather than predicting the success and ranking of music.

Parallelization of Raster GIS Operations Using PC Clusters (PC 클러스터를 이용한 래스터 GIS 연산의 병렬화)

  • 신윤호;박수홍
    • Spatial Information Research
    • /
    • v.11 no.3
    • /
    • pp.213-226
    • /
    • 2003
  • With the increasing demand of processing massive geographic data, conventional GISs based on the single processor architecture appear to be problematic. Especially, performing complex GIS operations on the massive geographic data is very time consuming and even impossible. This is due to the processor speed development does not keep up with the data volume to be processed. In the field of GIS, this PC clustering is one of the emerging technology for handling massive geographic data effectively. In this study, a MPI(Message Passing Interface)-based parallel processing approach was conducted to implement the existing raster GIS operations that typically requires massive geographic data sets in order to improve the processing capabilities and performance. Specially for this research, four types of raster CIS operations that Tomlin(1990) has introduced for systematic analysis of raster GIS operation. A data decomposition method was designed and implemented for selected raster GIS operations.

  • PDF

Advanced Freeway Traffic Safety Warning Information System based on Surrogate Safety Measures (SSM): Information Processing Methods (Surrogate Safety Measures(SSM)기반 고속도로 교통안전 경고정보 처리 및 가공기법)

  • O, Cheol;O, Ju-Taek;Song, Tae-Jin;Park, Jae-Hong;Kim, Tae-Jin
    • Journal of Korean Society of Transportation
    • /
    • v.27 no.3
    • /
    • pp.59-70
    • /
    • 2009
  • This study presents a novel traffic information system which is capable of detecting unsafe traffic events leading to accident occurrence and providing warning information to drivers for safer driving. Unsafe traffic events are captured by a vehicle image processing-based detection system in real time. Surrogate safety measures (SSM) representing quantitative accident potentials were derived, and further utilized to develop a data processing algorithm and analysis techniques in the proposed system. This study also defined 'emergency warning area' and 'general warning area' for more effective provision of warning information. In addition, methodologies for determining thresholds to trigger warning information were presented. Technical issues and further studies to fully exploit the benefits of the proposed system were discussed. It is expected that the proposed system would be effective for better management of traffic flow to prevent traffic accidents on freeways.

Identification of Fuzzy Inference Systems Using a Multi-objective Space Search Algorithm and Information Granulation

  • Huang, Wei;Oh, Sung-Kwun;Ding, Lixin;Kim, Hyun-Ki;Joo, Su-Chong
    • Journal of Electrical Engineering and Technology
    • /
    • v.6 no.6
    • /
    • pp.853-866
    • /
    • 2011
  • We propose a multi-objective space search algorithm (MSSA) and introduce the identification of fuzzy inference systems based on the MSSA and information granulation (IG). The MSSA is a multi-objective optimization algorithm whose search method is associated with the analysis of the solution space. The multi-objective mechanism of MSSA is realized using a non-dominated sorting-based multi-objective strategy. In the identification of the fuzzy inference system, the MSSA is exploited to carry out parametric optimization of the fuzzy model and to achieve its structural optimization. The granulation of information is attained using the C-Means clustering algorithm. The overall optimization of fuzzy inference systems comes in the form of two identification mechanisms: structure identification (such as the number of input variables to be used, a specific subset of input variables, the number of membership functions, and the polynomial type) and parameter identification (viz. the apexes of membership function). The structure identification is developed by the MSSA and C-Means, whereas the parameter identification is realized via the MSSA and least squares method. The evaluation of the performance of the proposed model was conducted using three representative numerical examples such as gas furnace, NOx emission process data, and Mackey-Glass time series. The proposed model was also compared with the quality of some "conventional" fuzzy models encountered in the literature.

Health State Clustering and Prediction Based on Bayesian HMM (Bayesian HMM 기반의 건강 상태 분류 및 예측)

  • Sin, Bong-Kee
    • Journal of KIISE
    • /
    • v.44 no.10
    • /
    • pp.1026-1033
    • /
    • 2017
  • In this paper a Bayesian modeling and duration-based prediction method is proposed for health clinic time series data using the Hierarchical Dirichlet Process Hidden Markov Model (HDP-HMM). HDP-HMM is a Bayesian extension of HMM which can find the optimal number of health states, a number which is highly uncertain and even difficult to estimate under the context of health dynamics. Test results of HDP-HMM using simulated data and real health clinic data have shown interesting modeling behaviors and promising prediction performance over the span of up to five years. The future of health change is uncertain and its prediction is inherently difficult, but experimental results on health clinic data suggests that practical long-term prediction is possible and can be made useful if we present multiple hypotheses given dynamic contexts as defined by HMM states.

A Study on Gesture Recognition using Edge Orientation Histogram and HMM (에지 방향성 히스토그램과 HMM을 이용한 제스처 인식에 관한 연구)

  • Lee, Kee-Jun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.12
    • /
    • pp.2647-2654
    • /
    • 2011
  • In this paper, the algorithm that recognizes the gesture by configuring the feature information obtained through edge orientation histogram and principal component analysis as low dimensional gesture symbol was described. Since the proposed method doesn't require a lot of computations compared to the existing geometric feature based method or appearance based methods and it can maintain high recognition rate by using the minimum information, it is very well suited for real-time system establishment. In addition, to reduce incorrect recognition or recognition errors that occur during gesture recognition, the model feature values projected in the gesture space is configured as a particular status symbol through clustering algorithm to be used as input symbol of hidden Markov models. By doing so, any input gesture will be recognized as the corresponding gesture model with highest probability.

An Energy Efficient Cluster-Based Local Multi-hop Routing Protocol for Wireless Sensor Networks (무선 센서 네트워크를 위한 에너지 효율적인 클러스터 기반 지역 멀티홉 라우팅 프로토콜)

  • Kim, Kyung-Tae;Youn, Hee-Yong
    • The KIPS Transactions:PartC
    • /
    • v.16C no.4
    • /
    • pp.495-504
    • /
    • 2009
  • Wireless sensor networks (WSN) consisting of a largenumber of sensors aims to gather data in a variety of environments and is beingused and applied in many different fields. The sensor nodes composing a sensornetwork operate on battery of limited power and as a result, high energyefficiency and long network lifetime are major goals of research in the WSN. Inthis paper we propose a novel cluster-based local multi-hop routing protocolthat enhances the overall energy efficiency and guarantees reliability in thesystem. The proposed protocol minimizes energy consumption for datatransmission among sensor nodes by forming a multi-hop in the cluster.Moreover, through local cluster head rotation scheme, it efficiently manageswaste of energy caused by frequent formation of clusters which was an issue inthe existing methods. Simulation results show that our scheme enhances energyefficiency and ensure longer network time in the sensor network as comparedwith existing schemes such as LEACH, LEACH-C and PEACH.

Development of a Tailored Analysis System for Korean Working Conditions Survey

  • Seo, Hwa Jeong
    • Safety and Health at Work
    • /
    • v.7 no.3
    • /
    • pp.201-207
    • /
    • 2016
  • Background: Korean Working Conditions Surveys (KWCS), referencing European Working Conditions Surveys, have been conducted three times in order to survey working condition and develop work-related policies. However, we found three limitations for managing the collected KWCS data: (1) there was no computerized system for managing data; (2) statistical KWCS data were provided by limited one-way communication; and (3) the concept of a one-time provision of information was pursued. We suggest a web-based public service system that enables ordinary people to make greater use of the KWCS data, which can be managed constantly in the future. Methods: After considering data characteristics, we designed a database, which was able to have the result of all pairwise combinations with two extracted data to construct an analysis system. Using the data of the social network for each user, the tailored analysis system was developed. This system was developed with three methods: clustering and classification for building a social network, and an infographic method for improving readability through a friendly user interface. Results: We developed a database including one input entity consisting of the sociodemographic characteristics and one output entity consisting of working condition characteristics, such as working pattern and work satisfaction. A web-based public service system to provide tailored contents was completed. Conclusion: This study aimed to present a customized analysis system to use the KWCS data efficiently, provide a large amount of data in a form that can give users a better understanding, and lay the ground for helping researchers and policy makers understand the characteristics.

Interest-based Customer Segmentation Methodology Using Topic Modeling (토픽 분석을 활용한 관심 기반 고객 세분화 방법론)

  • Hyun, Yoonjin;Kim, Namgyu;Cho, Yoonho
    • Journal of Information Technology Applications and Management
    • /
    • v.22 no.1
    • /
    • pp.77-93
    • /
    • 2015
  • As the range of the customer choice becomes more diverse, the average life span of companies' products and services is becoming shorter. Most companies are striving to maximize the revenue by understanding the customer's needs and providing customized products and services. However, companies had to bear a significant burden, in terms of the time and cost involved in the process of determining each individual customer's needs. Therefore, an alternative method is employed that involves grouping the customers into different categories based on certain criteria and establishing a marketing strategy tailored for each group. In this way, customer segmentation and customer clustering are performed using demographic information and behavioral information. Demographic information included sex, age, income level, and etc., while behavioral information was usually identified indirectly through customers' purchase history and search history. However, there is a limitation regarding companies' customer behavioral information, because the information is usually obtained through the limited data provided by a customer on a company's website. This is because the pattern indicated when a customer accesses a particular site might not be representative of the general tendency of that customer. Therefore, in this study, rather than the pattern indicated through a particular site, a customer's interest is identified using that customer's access record pertaining to external news. Hence, by utilizing this method, we proposed a methodology to perform customer segmentation. In addition, by extracting the main issues through a topic analysis covering approximately 3,000 Internet news articles, the actual experiment applying customer segmentation is performed and the applicability of the proposed methodology is analyzed.

Clustering Asian and North African Countries According to Trend of Colon and Rectum Cancer Mortality Rates: an Application of Growth Mixture Models

  • Zayeri, Farid;Sheidaei, Ali;Mansouri, Anita
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.9
    • /
    • pp.4115-4121
    • /
    • 2015
  • Background: Colorectal cancer is the second most common cause of cancer death with half a million deaths per year. Incidence and mortality rates have demonstrated notable changes in Asian and African countries during the last few decades. In this study, we first aimed to determine the trend of colorectal cancer mortality rate in each Institute for Health Metrics and Evaluation (IHME) region, and then re-classify them to find more homogenous classes. Materials and Methods: Our study population consisted of 52 countries of Asia and North Africa in six IHME pre-defined regions for both genders and age-standardized groups from 1990 to 2010.We first applied simple growth models for pre-defined IHME regions to estimate the intercepts and slopes of mortality rate trends. Then, we clustered the 52 described countries using the latent growth mixture modeling approach for classifying them based on their colorectal mortality rates over time. Results: Statistical analysis revealed that males and people in high income Asia pacific and East Asia countries were at greater risk of death from colon and rectum cancer. In addition, South Asia region had the lowest rates of mortality due to this cancer. Simple growth modeling showed that majority of IHME regions had decreasing trend in mortality rate of colorectal cancer. However, re-classification these countries based on their mortality trend using the latent growth mixture model resulted in more homogeneous classes according to colorectal mortality trend. Conclusions: In general, our statistical analyses showed that most Asian and North African countries had upward trend in their colorectal cancer mortality. We therefore urge the health policy makers in these countries to evaluate the causes of growing mortality and study the interventional programs of successful countries in managing the consequences of this cancer.