• Title/Summary/Keyword: time series & cluster analysis

Search Result 77, Processing Time 0.03 seconds

Clustering Algorithm for Time Series with Similar Shapes

  • Ahn, Jungyu;Lee, Ju-Hong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.7
    • /
    • pp.3112-3127
    • /
    • 2018
  • Since time series clustering is performed without prior information, it is used for exploratory data analysis. In particular, clusters of time series with similar shapes can be used in various fields, such as business, medicine, finance, and communications. However, existing time series clustering algorithms have a problem in that time series with different shapes are included in the clusters. The reason for such a problem is that the existing algorithms do not consider the limitations on the size of the generated clusters, and use a dimension reduction method in which the information loss is large. In this paper, we propose a method to alleviate the disadvantages of existing methods and to find a better quality of cluster containing similarly shaped time series. In the data preprocessing step, we normalize the time series using z-transformation. Then, we use piecewise aggregate approximation (PAA) to reduce the dimension of the time series. In the clustering step, we use density-based spatial clustering of applications with noise (DBSCAN) to create a precluster. We then use a modified K-means algorithm to refine the preclusters containing differently shaped time series into subclusters containing only similarly shaped time series. In our experiments, our method showed better results than the existing method.

Sensor clustering technique for practical structural monitoring and maintenance

  • Celik, Ozan;Terrell, Thomas;Gul, Mustafa;Catbas, F. Necati
    • Structural Monitoring and Maintenance
    • /
    • v.5 no.2
    • /
    • pp.273-295
    • /
    • 2018
  • In this study, an investigation of a damage detection methodology for global condition assessment is presented. A particular emphasis is put on the utilization of wireless sensors for more practical, less time consuming, less expensive and safer monitoring and eventually maintenance purposes. Wireless sensors are deployed with a sensor roving technique to maintain a dense sensor field yet requiring fewer sensors. The time series analysis method called ARX models (Auto-Regressive models with eXogeneous input) for different sensor clusters is implemented for the exploration of artificially induced damage and their locations. The performance of the technique is verified by making use of the data sets acquired from a 4-span bridge-type steel structure in a controlled laboratory environment. In that, the free response vibration data of the structure for a specific sensor cluster is measured by both wired and wireless sensors and the acceleration output of each sensor is used as an input to ARX model to estimate the response of the reference channel of that cluster. Using both data types, the ARX based time series analysis method is shown to be effective for damage detection and localization along with the interpretations and conclusions.

A spectrum based evaluation algorithm for micro scale weather analysis module with application to time series cluster analysis (스펙트럼분석 기반의 미기상해석모듈 평가알고리즘 제안 및 시계열 군집분석에의 응용)

  • Kim, Hea-Jung;Kwak, Hwa-Ryun;Kim, Yu-Na;Choi, Young-Jean
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.1
    • /
    • pp.41-53
    • /
    • 2015
  • In meteorological field, many researchers have tried to develop micro scale weather analysis modules for providing real-time weather information service in the metropolitan area. This effort enables us to cope with various economic and social harms coming from serious change in the micro meteorology of a metropolitan area due to rapid urbanization such as quantitative expansions in its urban activity, growth of population, and building concentration. The accuracy of the micro scale weather analysis modules (MSWAM) directly related to usefulness and quality of the real-time weather information service in the metropolitan area. This paper design a evaluation system along with verification tools that sufficiently accommodate spatio-temporal characteristics of the outputs of the MSWAM. For this we proposes a test for the equality of mean vectors of the output series of the MSWAM and corresponding observed time series by using a spectral analysis technique. As a byproduct, a time series cluster analysis method, using a function of the test statistic as the distance measure, is developed. A real data application is given to demonstrate the utility of the method.

The Evaluation of Water Quality in Coastal Sea of Incheon Using a Multivariate Analysis (다변량 해석기법을 이용한 인천연안해역의 수질평가)

  • Kim, Jong-Gu
    • Journal of Environmental Science International
    • /
    • v.15 no.11
    • /
    • pp.1017-1025
    • /
    • 2006
  • This study was conducted to evaluate characteristic of water duality in coastal sea of Incheon using a multivariate analysis. The analysis data in coastal sea of Incheon was aquired by the NFRDI data which was surveyed from March 1997 to November 2003. Eleven water quality parameters were determined on each survey The results were summarized as follow : Water quality in Incheon coastal sea could be explained up to 64.62% by three factors which were included in loading of fresh water and nutrients by the land(36.98%), seasonal variation(16.19%), and internal metabolism (11.24%). The results of time series analysis by factor score, in case of factor 1, station 1 influenced by Han river was shown to high factor score and station 3 located by outer sea was shown to low factor score. In case of factor 2, station 1 was appeared to high variation and station 3 was appeared to low variation. The result of cluster analysis by station was classified into three group that has different water quality characteristics. Especially, station 1 which affected by Han river and station 4 which affected by sewage treatment plant was appeared to considerable water quality characteristics against other station. In yearly cluster analysis, three group was classified and water quality in 2003 years due to high precipitation was different to another year. It could be suggested from these results that it is important to control discharge of fresh water by Han rivet and sewage treatment plant for water quality management of coastal sea of Incheon.

Clustering of Seoul Public Parking Lots and Demand Prediction (서울시 공영주차장 군집화 및 수요 예측)

  • Jeongjoon Hwang;Young-Hyun Shin;Hyo-Sub Sim;Dohyun Kim;Dong-Guen Kim
    • Journal of Korean Society for Quality Management
    • /
    • v.51 no.4
    • /
    • pp.497-514
    • /
    • 2023
  • Purpose: This study aims to estimate the demand for various public parking lots in Seoul by clustering similar demand types of parking lots and predicting the demand for new public parking lots. Methods: We examined real-time parking information data and used time series clustering analysis to cluster public parking lots with similar demand patterns. We also performed various regression analyses of parking demand based on diverse heterogeneous data that affect parking demand and proposed a parking demand prediction model. Results: As a result of cluster analysis, 68 public parking lots in Seoul were clustered into four types with similar demand patterns. We also identified key variables impacting parking demand and obtained a precise model for predicting parking demands. Conclusion: The proposed prediction model can be used to improve the efficiency and publicity of public parking lots in Seoul, and can be used as a basis for constructing new public parking lots that meet the actual demand. Future research could include studies on demand estimation models for each type of parking lot, and studies on the impact of parking lot usage patterns on demand.

A Fusion of the Period Characterized and Hierarchical Bayesian Techniques for Efficient Cluster Analysis of Time Series Data (시계열자료의 효율적 군집분석을 위한 구간특징화와 계층적 베이지안 기법의 융합)

  • Jung, Young-Ae;Jeon, Jin-Ho
    • Journal of Digital Convergence
    • /
    • v.13 no.7
    • /
    • pp.169-175
    • /
    • 2015
  • An effective way to understand the dynamic and time series that follows the passage of time, as valuation is to establish a model to analyze the phenomena of the system. Model of the decision process is efficient clustering information of the total mass of the time series data of the relevant population been collected in a particular number of sub-groups than to look at all a time to an understand of the overall data through each community-specific model determination. In this study, a sub-grouping of the group and the first of the two process model of each cluster by determining, in the following in sub-population characterized by a fusion with heuristic Bayesian clustering techniques proposed a process which can reduce calculation time and cost was confirmed by experiments using actual effectiveness valuation.

Classification and Characteristic Comparison of Groundwater Level Variation in Jeju Island Using Principal Component Analysis and Cluster Analysis (주성분분석 및 군집분석을 이용한 제주도 지하수위 변동 유형 분류 및 특성 비교)

  • Lim, Woo-Ri;Hamm, Se-Yeong;Lee, Chung-Mo
    • Journal of Soil and Groundwater Environment
    • /
    • v.27 no.6
    • /
    • pp.22-36
    • /
    • 2022
  • Water resources in Jeju Island are dependent virtually entirely on groundwater. For groundwater resources, drought damage can cause environmental and economic losses because it progresses slowly and occurs for a long time in a large area. Therefore, this study quantitatively evaluated groundwater level fluctuations using principal component and cluster analyses for 42 monitoring wells in Jeju Island, and further identified the types of groundwater fluctuations caused by drought. As a result of principal component analysis for the monthly average groundwater level during 2005-2019 and the daily average groundwater level during the dry season, it was found that the first three principal components account for most of the variance 74.5-93.5% of the total data. In the cluster analysis using these three principal components, most of wells belong to Cluster 1, and seasonal characteristics have a significant impact on groundwater fluctuations. However, wells belonging to Cluster 2 with high factor loadings of components 2 and 3 affected by groundwater pumping, tide levels, and nearby surface water are mainly distributed on the west coast. Based on these results, it is expected that groundwater in the western area will be more vulnerable to saltwater intrusion and groundwater depletion caused by drought.

Surface Wind Regionalization Based on Similarity of Time-series Wind Vectors

  • Kim, Jinsol;Kim, Hyun-Goo;Park, Hyeong-Dong
    • Asian Journal of Atmospheric Environment
    • /
    • v.10 no.2
    • /
    • pp.80-89
    • /
    • 2016
  • In the complex terrain where local wind systems are formed, accurate understanding of regional wind variability is required for wind resource assessment. In this paper, cluster analysis based on the similarity of time-series wind vector was applied to classify wind regions with similar wind characteristics and the meteorological validity of regionalization method was evaluated. Wind regions in Jeju Island and Busan were classified using the wind resource map of Korea created by a mesoscale numerical weather prediction modeling. The evaluation was performed by comparing wind speed, wind direction, and wind variability of each wind region. Wind characteristics, such as mean wind speed and prevailing wind direction, in the same wind region were similar and wind characteristics in different wind regions were meteor-statistically distinct. It was able to identify a singular wind region at the top area of Mt. Halla using the inconsistency of wind direction variability. Furthermore, it was found that the regionalization results correspond with the topographic features of Jeju Island and Busan, showing the validity.

How the domestic industry of Costa Rica became more competitive in the US market. Antecedents and Trends

  • Pena-Vinces, Jesus C.;Castro, Segundo;Espasandin-Bustelo, Francisco
    • Journal of Distribution Science
    • /
    • v.11 no.4
    • /
    • pp.5-11
    • /
    • 2013
  • Purpose - The aim of this work is to study the reorientation that the export industrial sectors in Costa Rica have experienced during the last 20 years. Research design, data, methodology - The study employs the Cluster Analysis with the export data (20 years of cut-off period) from Costa Rica to the U.S-market. To make the predictions, the technique of the time series was used, with official data (from 2001 to 2010) from the U.S. Department of Commerce and the U.S. International Trade Commission. Results - The Cluster Analysis, show how the economic sectors of traditional products exports of Costa Rica have progressively become in exporters of non-traditional products, meanwhile,the time series confirms that this trend will continue, at least during the next five years. Conclusions - The industry of traditional products exports of Costa Rica (dressmaking, vegetables, coffee, mate, species, etc.) will progressively become in exporters of non- traditional products with a high-tech component (i.e., mechanical equipment and devices, electronic devices and medical equipment),as a consequence of the Chinese (Costa Rica's main competitor) economy's presence in the Organization for Economic Co-operation and Development (OCDE). This fact has enabled the potential improvement of Costa Rica's international competitiveness in the U.S. market.

  • PDF

A Statistical Analysis of the Causes of Marine Incidents occurring during Berthing (정박 중 발생한 준해양사고 원인에 대한 통계 분석 연구)

  • Roh, Boem-Seok;Kang, Suk-Young
    • Journal of Navigation and Port Research
    • /
    • v.45 no.3
    • /
    • pp.95-101
    • /
    • 2021
  • Marine Incidents based on Heinrich's law are very important in preventing accidents. However, marine Incident data are mainly qualitative and are used to prevent similar accidents through case sharing rather than statistical analysis, which can be confirmed in the marine Incident-related data posted in the Korea Maritime Safety Tribunal. Therefore, this study derived quantitative results by analyzing the causes of marine incidents during berthing using various methods of statistical analysis. To this end, data involving marine incidents from various shipping companies were collected and reclassified for easy analysis. The main keywords were derived via primary analysis using text mining. Only meaningful words were selected via verification by an expert group, and time series and cluster analysis were performed to predict marine incidents that may occur during berthing. Although the role of an expert group was still required during the analysis, it was confirmed that quantitative analysis of marine incidents was feasible, and iused to provide cause and accident prevention information.