• 제목/요약/키워드: Multivariate Data

검색결과 1,980건 처리시간 0.026초

Using Structural Changes to support the Neural Networks based on Data Mining Classifiers: Application to the U.S. Treasury bill rates

  • 오경주
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 한국데이터정보과학회 2003년도 추계학술대회
    • /
    • pp.57-72
    • /
    • 2003
  • This article provides integrated neural network models for the interest rate forecasting using change-point detection. The model is composed of three phases. The first phase is to detect successive structural changes in interest rate dataset. The second phase is to forecast change-point group with data mining classifiers. The final phase is to forecast the interest rate with BPN. Based on this structure, we propose three integrated neural network models in terms of data mining classifier: (1) multivariate discriminant analysis (MDA)-supported neural network model, (2) case based reasoning (CBR)-supported neural network model and (3) backpropagation neural networks (BPN)-supported neural network model. Subsequently, we compare these models with a neural network model alone and, in addition, determine which of three classifiers (MDA, CBR and BPN) can perform better. For interest rate forecasting, this study then examines the predictability of integrated neural network models to represent the structural change.

  • PDF

Discriminant analysis using empirical distribution function

  • Kim, Jae Young;Hong, Chong Sun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권5호
    • /
    • pp.1179-1189
    • /
    • 2017
  • In this study, we propose an alternative method for discriminant analysis using a multivariate empirical distribution function to express multivariate data as a simple one-dimensional statistic. This method turns to be the estimation process of the optimal threshold based on classification accuracy measures and an empirical distribution function of data composed of classes. This can also be visually represented on a two-dimensional plane and discussed with some measures in ROC curves, surfaces, and manifolds. In order to explore the usefulness of this method for discriminant analysis in the study, we conducted comparisons between the proposed method and the existing methods through simulations and illustrative examples. It is found that the proposed method may have better performances for some cases.

다가자료에 적합한 다변수 감마-포아송 모델과 파라미터 추정방법 : LCD 화소불량 응용 (Multivariate Gamma-Poisson Model and Parameter Estimation for Polytomous Data : Application to Defective Pixels of LCD)

  • 하정훈
    • 산업경영시스템학회지
    • /
    • 제34권1호
    • /
    • pp.42-51
    • /
    • 2011
  • Poisson model and Gamma-Poisson model are popularly used to analyze statistical behavior from defective data. The methods are based on binary criteria, that is, good or failure. However, manufacturing industries prefer polytomous criteria for classifying manufactured products due to flexibility of marketing. In this paper, I introduce two multivariate Gamma-Poisson(MGP) models and estimation methods of the parameters in the models, which are able to handle polytomous data. The models and estimators are verified on defective pixels of LCD manufacturing. Experimental results show that both the independent MGP model and the multinomial MGP model have excellent performance in terms of mean absolute deviation and the choice of method depends on the purpose of use.

Analysis of Multivariate Financial Time Series Using Cointegration : Case Study

  • Choi, M.S.;Park, J.A.;Hwang, S.Y.
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권1호
    • /
    • pp.73-80
    • /
    • 2007
  • Cointegration(together with VARMA(vector ARMA)) has been proven to be useful for analyzing multivariate non-stationary data in the field of financial time series. It provides a linear combination (which turns out to be stationary series) of non-stationary component series. This linear combination equation is referred to as long term equilibrium between the component series. We consider two sets of Korean bivariate financial time series and then illustrate cointegration analysis. Specifically estimated VAR(vector AR) and VECM(vector error correction model) are obtained and CV(cointegrating vector) is found for each data sets.

  • PDF

A spatial heterogeneity mixed model with skew-elliptical distributions

  • Farzammehr, Mohadeseh Alsadat;McLachlan, Geoffrey J.
    • Communications for Statistical Applications and Methods
    • /
    • 제29권3호
    • /
    • pp.373-391
    • /
    • 2022
  • The distribution of observations in most econometric studies with spatial heterogeneity is skewed. Usually, a single transformation of the data is used to approximate normality and to model the transformed data with a normal assumption. This assumption is however not always appropriate due to the fact that panel data often exhibit non-normal characteristics. In this work, the normality assumption is relaxed in spatial mixed models, allowing for spatial heterogeneity. An inference procedure based on Bayesian mixed modeling is carried out with a multivariate skew-elliptical distribution, which includes the skew-t, skew-normal, student-t, and normal distributions as special cases. The methodology is illustrated through a simulation study and according to the empirical literature, we fit our models to non-life insurance consumption observed between 1998 and 2002 across a spatial panel of 103 Italian provinces in order to determine its determinants. Analyzing the posterior distribution of some parameters and comparing various model comparison criteria indicate the proposed model to be superior to conventional ones.

Multivariate Time Series Analysis for Rainfall Prediction with Artificial Neural Networks

  • Narimani, Roya;Jun, Changhyun
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2021년도 학술발표회
    • /
    • pp.135-135
    • /
    • 2021
  • In water resources management, rainfall prediction with high accuracy is still one of controversial issues particularly in countries facing heavy rainfall during wet seasons in the monsoon climate. The aim of this study is to develop an artificial neural network (ANN) for predicting future six months of rainfall data (from April to September 2020) from daily meteorological data (from 1971 to 2019) such as rainfall, temperature, wind speed, and humidity at Seoul, Korea. After normalizing these data, they were trained by using a multilayer perceptron (MLP) as a class of the feedforward ANN with 15,000 neurons. The results show that the proposed method can analyze the relation between meteorological datasets properly and predict rainfall data for future six months in 2020, with an overall accuracy over almost 70% and a root mean square error of 0.0098. This study demonstrates the possibility and potential of MLP's applications to predict future daily rainfall patterns, essential for managing flood risks and protecting water resources.

  • PDF

동적 다변량 그래프의 연속적 분석을 위한 질의 모델 설계 및 구현 (A Query Model for Consecutive Analyses of Dynamic Multivariate Graphs)

  • 배예찬;함도영;김태양;정혜진;김동윤
    • 컴퓨터교육학회논문지
    • /
    • 제17권6호
    • /
    • pp.103-113
    • /
    • 2014
  • 본 연구에서는 동적 다변량 그래프 데이터의 연속적 분석이 가능한 질의 모델을 설계 및 구현하였다. 먼저, 질의 모델을 판별함수 설정과 시간에 따른 통합 방법 선택의 두 단계로 설계하고, 질의 패널, 그래프 시각화 패널, 속성 패널로 구성된 질의 시스템으로 구현하였다. 또한, 그래프 표현에는 노드-링크 다이어그램과 Force-Directed Graph Drawing 알고리즘을 이용하였으며, 질의 결과로 선택된 대상들에 효과를 적용하여 사용자가 시각적으로 구분할 수 있도록 처리하였다. 마지막으로, 세계 소형 무기 거래량 데이터를 이용하여, 본 연구에서 설계한 동적 다변량 그래프 질의 모델을 검증하였다. 본 연구는 동적 그래프의 연속적 분석이 가능한 새로운 질의 모델을 설계하는 것을 통해, 기존 모델이 동적 그래프를 시점별로 이산적으로만 분석할 수 있는 한계를 개선하였다는데 의의가 있다. 본 연구는 추세 분석이나, 복잡계 네트워크 해석 등 동적 그래프를 사용하는 연구에 기여할 수 있을 것으로 기대된다.

  • PDF

인공 신경망 회귀 모델을 활용한 인버터 기반 태양광 발전량 예측 알고리즘 (Inverter-Based Solar Power Prediction Algorithm Using Artificial Neural Network Regression Model)

  • 박건하;임수창;김종찬
    • 한국전자통신학회논문지
    • /
    • 제19권2호
    • /
    • pp.383-388
    • /
    • 2024
  • 본 논문은 전라남도에서 측정한 태양광 발전 데이터를 기반으로 발전량 예측값을 도출하기 위한 연구이다. 발전량 측정을 위해 인버터에서 직류, 교류, 환경데이터와 같은 다변량 변수를 측정하였고, 측정값의 안정성과 신뢰성 확보를 위한 전처리 작업을 수행하였다. 상관관계 분석은 부분자기상관함수(PACF: Partial Autocorrelation Function)을 활용하여 시계열 데이터에서 발전량과 상관성이 높은 데이터만을 예측을 위해 사용하였다. 태양광 발전량 예측을 위해 딥러닝 모델을 이용하여 발전량을 측정했고, 예측 정확도를 높이기 위해 각 다변량 변수의 상관관계 분석 결과를 이용하였다. 정제된 데이터를 활용한 학습은 기존 데이터를 그대로 사용했을 때 보다 안정되었고, 상관관계 분석 결과를 반영하여 다변량 변수 중 상관성이 높은 변수만을 활용하여 태양광 발전량 예측 알고리즘을 개선하였다.

A Note on the Chi-Square Test for Multivariate Normality Based on the Sample Mahalanobis Distances

  • Park, Cheolyong
    • Journal of the Korean Statistical Society
    • /
    • 제28권4호
    • /
    • pp.479-488
    • /
    • 1999
  • Moore and Stubblebine(1981) suggested a chi-square test for multivariate normality based on cell counts calculated from the sample Mahalanobis distances. They derived the limiting distribution of the test statistic only when equiprobable cells are employed. Using conditional limit theorems, we derive the limiting distribution of the statistic as well as the asymptotic normality of the cell counts. These distributions are valid even when equiprobable cells are not employed. We finally apply this method to a real data set.

  • PDF

경영정보의 인과구조 구축을 위한 다변량통계기법 적용에 관한 연구 (A study on applying multivariate statistical method for making casual structure in management information)

  • 조성훈;김태성
    • 한국경영과학회:학술대회논문집
    • /
    • 한국경영과학회 1996년도 추계학술대회발표논문집; 고려대학교, 서울; 26 Oct. 1996
    • /
    • pp.117-120
    • /
    • 1996
  • The objective of this study is to suggest modified Covariance Structure Analysis that combine with existing Multivariate Statistical Method which is used Casual Analysis Method in Management Information. For this purpose, we'll consider special feature and limitation about Correlation Analysis, Regression Analysis, Path Analysis and connect Covariance Structure Analysis with Statistical Factor Analysis so that theoretical casual model compare with variables structure in collecting data. A example is also presented to show the practical applicability of this approach.

  • PDF