• Title/Summary/Keyword: multivariate data

Search Result 1,977, Processing Time 0.028 seconds

Multi-sensor data-based anomaly detection and diagnosis of a pumped storage hydropower plant

  • Sojin Shin;Cheolgyu Hyun;Seongpil Cho;Phill-Seung Lee
    • Structural Engineering and Mechanics
    • /
    • v.88 no.6
    • /
    • pp.569-581
    • /
    • 2023
  • This paper introduces a system to detect and diagnose anomalies in pumped storage hydropower plants. We collect data from various types of sensors, including those monitoring temperature, vibration, and power. The data are classified according to the operation modes (pump and turbine operation modes) and normalized to remove the influence of the external environment. To detect anomalies and diagnose their types, we adopt a multivariate normal distribution analysis by learning the distribution of the normal data. The feasibility of the proposed system is evaluated using actual monitoring data of a pumped storage hydropower plant. The proposed system can be used to implement condition monitoring systems for other plants through modifications.

Integration of Categorical Data using Multivariate Kriging for Spatial Interpolation of Ground Survey Data (현장 조사 자료의 공간 보간을 위한 다변량 크리깅을 이용한 범주형 자료의 통합)

  • Park, No-Wook
    • Spatial Information Research
    • /
    • v.19 no.4
    • /
    • pp.81-89
    • /
    • 2011
  • This paper presents a multivariate kriging algorithm that integrates categorical data as secondary data for spatial interpolation of sparsely sampled ground survey data. Instead of using constant mean values in each attribute of categorical data, disaggregated local mean values at target grid points are first estimated by area-to-point kriging and then are used as local mean values in simple kriging with local means. This algorithm is illustrated through a case study of spatial interpolation of a geochemical copper element with geological map data. Cross validation results indicates that the presented algorithm leads to significant respective improvement of 15% and 25% in prediction capability, compared with univariate ordinary kriging and conventional simple kriging with constant mean values. It is expected that the multivariate kriging algorithm applied in this study would be effectively applied for spatial interpolation with categorical data.

Method for predicting the diagnosis of mastitis in cows using multivariate data and Recurrent Neural Network (다변량 데이터와 순환 신경망을 이용한 젖소의 유방염 진단예측 방법)

  • Park, Gicheol;Lee, Seonghun;Park, Jaehwa
    • Journal of Software Assessment and Valuation
    • /
    • v.17 no.1
    • /
    • pp.75-82
    • /
    • 2021
  • Mastitis in cows is a major factor that hinders dairy productivity of farms, and many attempts have been made to solve it. However, research on mastitis has been limited to diagnosis rather than prediction, and even this is mostly using a single sensor. In this study, a predictive model was developed using multivariate data including biometric data and environmental data. The data used for the analysis were collected from robot milking machines and sensors installed in farmhouses in Chungcheongnam-do, South Korea. The recurrent neural network model using three weeks of data predicts whether or not mastitis is diagnosed the next day. As a result, mastitis was predicted with an accuracy of 82.9%. The superiority of the model was confirmed by comparing the performance of various data collection periods and various models.

Use of Multivariate Statistical Approaches for Decoding Chemical Evolution of Groundwater near Underground Storage Caverns (다변량통계기법을 이용한 지하저장시설 주변의 지하수질 변동에 관한 연구)

  • Lee, Jeonghoon
    • Journal of the Korean earth science society
    • /
    • v.35 no.4
    • /
    • pp.225-236
    • /
    • 2014
  • Multivariate statistical analyses have been extensively applied to hydrochemical measurements to analyze and interpret the data. This study examines anthropogenic factors obtained from applications of correspondence analysis (CA) and principal component analysis (PCA) to a hydrogeochemical data set. The goal was to synthesize the hydrogeochemical information using these multivariate statistical techniques by incorporating hydrogeochemical speciation results calculated by the program, commonly used, WATEQ4F included in the NETPATH. The selected case study was LPG underground storage caverns, which is located in the southeastern Korea. The highly alkaline groundwaters at this study area are an analogue for the repository system. High pH, speciation of Al and possible precipitation of calcite characterize these groundwaters. Available groundwater quality monitoring data were used to confirm these statistical models. The present study focused on understanding the hydrogeochemical attributes and establishing the changes of phase when two anthropogenic effects (i.e., disinfection activity and cement pore water) in the study area have been introduced. Comparisons made between two statistical results presented and the findings of previous investigations highlight the descriptive capabilities of PCA using calculated saturation index and CA as exploratory tools in hydrogeochemical research.

Development of Real-Time Water Quality Abnormality Warning System for Using Multivariate Statistical Method (다변량 통계기법을 활용한 실시간 수질이상 유무 판단 시스템 개발)

  • Heo, Tae-Young;Jeon, Hang-Bae;Park, Sang-Min;Lee, Young-Joo
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.37 no.3
    • /
    • pp.137-144
    • /
    • 2015
  • The purpose of this study is to develop an warning system to detect real-time water quality abnormality using a multivariate statistical approach. In this study, we applied principal component analysis among multivariate data analyses which was used for the correlation between water quality parameters considering the real-time algorithm to determine abnormality in water quality. We applied our approach to real field data and showed the utilization of algorithm for the real-time monitoring to find water quality abnormality. In addition, our approach with Korea Meterological Adminstration database identified heavy rain data due to climate change is one of the most important factors to explain water quality abnormality.

Bayesian inference on multivariate asymmetric jump-diffusion models (다변량 비대칭 라플라스 점프확산 모형의 베이지안 추론)

  • Lee, Youngeun;Park, Taeyoung
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.1
    • /
    • pp.99-112
    • /
    • 2016
  • Asymmetric jump-diffusion models are effectively used to model the dynamic behavior of asset prices with abrupt asymmetric upward and downward changes. However, the estimation of their extension to the multivariate asymmetric jump-diffusion model has been hampered by the analytically intractable likelihood function. This article confronts the problem using a data augmentation method and proposes a new Bayesian method for a multivariate asymmetric Laplace jump-diffusion model. Unlike the previous models, the proposed model is rich enough to incorporate all possible correlated jumps as well as mention individual and common jumps. The proposed model and methodology are illustrated with a simulation study and applied to daily returns for the KOSPI, S&P500, and Nikkei225 indices data from January 2005 to September 2015.

A Study on Forest Land Classification Using Multivariate Statistical Methods : A Case Study at Mt. Kwanak (다변수통계방법을 이용한 산지분류에 관한 연구)

  • 정순오
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.13 no.1
    • /
    • pp.43-66
    • /
    • 1985
  • Korea needs proper and rational public policies on conservation and use of forest land and other natural resources because of the accelerating expansion of national land developments in recent years. Unfortunately, there is no systematic planning system to support the needs. Generally, forest land use planning needs suitability analysis based on efficient land classification system. The goal of this study was to classify a forest land using multivariate satistical methods. A case study was carried out in winter of 1983 on a mountainous area higher than 100m above sea level located at Mt. Kwanak in Anyang -city, Kyung-gi-do (province). The study area was 19.80 km$^2$wide and was divided into 1, 383 Operational Taxonomic Units (OTU's) by a 120m$\times$120m grid. Fourteen descriptors were identified and quantified for each OTU from existing national land data : elevation, slope, aspect, terrain form, geologic material, surface soil permeability, topsoil type, depth of the solum, soil acidity, forest cover type, stand size class, stand age class, stand density class, and simple forest soil capability class. For this study, a FORTRAN IV program was written for input and output map data, and the computer statistics packages, SPSS and BMD, were used to perform the multivariate statistical analysis. Fourteen variables were analyzed to investigate the characteristics of their fire quench distribution and to estimate the correlation coefficients among them. Principal component analysis was executed to find the dimensions of forest land characteristics, and factor scores were used for proper samples of OTU throughout the study area. In order to develop the classes of forest land classification based on 102 surrogates, cluster and discriminant analyses of principal descriptor variable matrix were undertaken. Results obtained through a series of multivariate statistical analyses were as follows ; 1) Principal component analysis was proved to be a useful tool for data selection and identification of principal descriptor variables which represented the characteristics of forest land and facilitated the selection of samples.

  • PDF

A Case Study on the Establishment of Upper Control Limit to Detect Vessel's Main Engine Failures using Multivariate Control Chart (다변량 관리도를 활용한 선박 메인 엔진의 이상 관리 상한선 결정에 관한 연구)

  • Bae, Young-Mok;Kim, Min-Jun;Kim, Kwang-Jae;Jun, Chi-Hyuck;Byeon, Sang-Su;Park, Kae-Myoung
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.55 no.6
    • /
    • pp.505-513
    • /
    • 2018
  • Main engine failures in ship operations can lead to a major damage in terms of the vessel itself and the financial cost. In this respect, monitoring of a vessel's main engine condition is crucial in ensuring the vessel's performance and reducing the maintenance cost. The collection of a huge amount of vessel operational data in the maritime industry has never been easier with the advent of advanced data collection technologies. Real-time monitoring of the condition of a vessel's main engine has a potential to create significant value in maritime industry. This study presents a case study on the establishment of upper control limit to detect vessel's main engine failures using multivariate control chart. The case study uses sample data of an ocean-going vessel operated by a major marine services company in Korea, collected in the period of 2016.05-2016.07. This study first reviews various main engine-related variables that are considered to affect the condition of the main engine, and then attempts to detect abnormalities and their patterns via multivariate control charts. This study is expected to help to enhance the vessel's availability and provide a basis for a condition-based maintenance that can support proactive management of vessel's main engine in the future.

TextNAS Application to Multivariate Time Series Data and Hand Gesture Recognition (textNAS의 다변수 시계열 데이터로의 적용 및 손동작 인식)

  • Kim, Gi-duk;Kim, Mi-sook;Lee, Hack-man
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.518-520
    • /
    • 2021
  • In this paper, we propose a hand gesture recognition method by modifying the textNAS used for text classification so that it can be applied to multivariate time series data. It can be applied to various fields such as behavior recognition, emotion recognition, and hand gesture recognition through multivariate time series data classification. In addition, it automatically finds a deep learning model suitable for classification through training, thereby reducing the burden on users and obtaining high-performance class classification accuracy. By applying the proposed method to the DHG-14/28 and Shrec'17 datasets, which are hand gesture recognition datasets, it was possible to obtain higher class classification accuracy than the existing models. The classification accuracy was 98.72% and 98.16% for DHG-14/28, and 97.82% and 98.39% for Shrec'17 14 class/28 class.

  • PDF

Pattern Recognition for Typification of Whiskies and Brandies in the Volatile Components using Gas Chromatographic Data

  • Myoung, Sungmin;Oh, Chang-Hwan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.5
    • /
    • pp.167-175
    • /
    • 2016
  • The volatile component analysis of 82 commercialized liquors(44 samples of single malt whisky, 20 samples of blended whisky and 18 samples of brandy) was carried out by gas chromatography after liquid-liquid extraction with dichloromethane. Pattern recognition techniques such as principle component analysis(PCA), cluster analysis(CA), linear discriminant analysis(LDA) and partial least square discriminant analysis(PLSDA) were applied for the discrimination of different liquor categories. Classification rules were validated by considering sensitivity and specificity of each class. Both techniques, LDA and PLSDA, gave 100% sensitivity and specificity for all of the categories. These results suggested that the common characteristics and identities as typification of whiskies and brandys was founded by using multivariate data analysis method.