• Title/Summary/Keyword: Multivariate Data

검색결과 1,968건 처리시간 0.029초

가변추출간격상(假變抽出間格上)에서 분산(分散)-공분산(共分散) 행례(行例)에 대한 다변량(多變量) 기하이동평균(幾何移動平均) 처리원(處理圓) (Multivariate EWMA Control Charts for the Variance-Covariance Matrix with Variable Sampling Intervals)

  • 조교영
    • Journal of the Korean Data and Information Science Society
    • /
    • 제4권
    • /
    • pp.31-44
    • /
    • 1993
  • Multivariate exponentially weighted moving average (EWMA) control charts for monitoring the variance-covariance matrix are investigated. A variable sampling interval (VSI) feature is considered in these charts. Multivariate EWMA control charts for monitoring the variance-covariance matrix are compared on the basis of their average time to signal (ATS) performances. The numerical results show that multivariate VSI EWMA control charts are more efficient than corrsponding multivariate fixed sampling interval (FSI) EWMA control charts.

  • PDF

Multivariate Control Charts for Several Related Quality Characteristics

  • Chang, Duk-Joon;Shin, Jae-Kyoung
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권2호
    • /
    • pp.467-476
    • /
    • 2005
  • Multivariate control charts for monitoring mean vector of several related quality variables with combine-accumulate approach and accumulate-combine apprach were investigated. Shewhart chart is also proposed to compare the performances of CUSUM and EWMA charts. Numerical comparisons show that CUSUM and EWMA charts are more efficient than Shewhart chart for small or moderate shifts, and multivariate charts based on accumulate- combine approach is more efficient than corresponding multivariate charts based on combine-accumulate approach.

  • PDF

An application to Multivariate Zero-Inflated Poisson Regression Model

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권2호
    • /
    • pp.177-186
    • /
    • 2003
  • The Zero-Inflated Poisson regression is a model for count data with exess zeros. When the correlated response variables are intrested, we have to extend the univariate zero-inflated regression model to multivariate model. In this paper, we study and simulate the multivariate zero-inflated regression model. A real example was applied to this model. Regression parameters are estimated by using MLE's. We also compare the fitness of multivariate zero-inflated Poisson regression model with the decision tree model.

  • PDF

Practical Guide to NMR-based Metabolomics - III : NMR Spectrum Processing and Multivariate Analysis

  • Jung, Young-Sang
    • 한국자기공명학회논문지
    • /
    • 제22권3호
    • /
    • pp.46-53
    • /
    • 2018
  • NMR-based metabolomics needs various knowledge to elucidate metabolic perturbation such as NMR experiments, NMR spectrum processing, raw data processing, metabolite identification, statistical analysis, and metabolic pathway analysis regarding technical aspects. Among them, some concepts of raw data processing and multivariate analysis are not easy to understand but are important to correctly interpret metabolic profile. This article introduces NMR spectrum processing, raw data processing, and multivariate analysis.

NONPARAMETRIC ONE-SIDED TESTS FOR MULTIVARIATE AND RIGHT CENSORED DATA

  • Park, Hyo-Il;Na, Jong-Hwa
    • Journal of the Korean Statistical Society
    • /
    • 제32권4호
    • /
    • pp.373-384
    • /
    • 2003
  • In this paper, we formulate multivariate one-sided alternatives and propose a class of nonparametric tests for possibly right censored data. We obtain the asymptotic tail probability (or p-value) by showing that our proposed test statistics have asymptotically multivariate normal distributions. Also, we illustrate our procedure with an example and compare it with other procedures in terms of empirical powers for the bivariate case. Finally, we discuss some properties of our test.

독립성분분석을 이용한 다변량 시계열 모의 (Multivariate Time Series Simulation With Component Analysis)

  • 이태삼;호세살라스;주하카바넨;노재경
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2008년도 학술발표회 논문집
    • /
    • pp.694-698
    • /
    • 2008
  • In hydrology, it is a difficult task to deal with multivariate time series such as modeling streamflows of an entire complex river system. Normal distribution based model such as MARMA (Multivariate Autorgressive Moving average) has been a major approach for modeling the multivariate time series. There are some limitations for the normal based models. One of them might be the unfavorable data-transformation forcing that the data follow the normal distribution. Furthermore, the high dimension multivariate model requires the very large parameter matrix. As an alternative, one might be decomposing the multivariate data into independent components and modeling it individually. In 1985, Lins used Principal Component Analysis (PCA). The five scores, the decomposed data from the original data, were taken and were formulated individually. The one of the five scores were modeled with AR-2 while the others are modeled with AR-1 model. From the time series analysis using the scores of the five components, he noted "principal component time series might provide a relatively simple and meaningful alternative to conventional large MARMA models". This study is inspired from the researcher's quote to develop a multivariate simulation model. The multivariate simulation model is suggested here using Principal Component Analysis (PCA) and Independent Component Analysis (ICA). Three modeling step is applied for simulation. (1) PCA is used to decompose the correlated multivariate data into the uncorrelated data while ICA decomposes the data into independent components. Here, the autocorrelation structure of the decomposed data is still dominant, which is inherited from the data of the original domain. (2) Each component is resampled by block bootstrapping or K-nearest neighbor. (3) The resampled components bring back to original domain. From using the suggested approach one might expect that a) the simulated data are different with the historical data, b) no data transformation is required (in case of ICA), c) a complex system can be decomposed into independent component and modeled individually. The model with PCA and ICA are compared with the various statistics such as the basic statistics (mean, standard deviation, skewness, autocorrelation), and reservoir-related statistics, kernel density estimate.

  • PDF

Improving Interpretability of Multivariate Data Through Rotations of Artificial Variates

  • Hwang, S.Y.;Park, A.M.
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권2호
    • /
    • pp.297-306
    • /
    • 2004
  • It is usual that multivariate data analysis produces related (small number of) artificial variates for data reduction. Among them, refer to MDS(multidimensional scaling), MDPREF(multidimensional preference analysis), CDA(canonical discriminant analysis), CCA(canonical correlation analysis) and FA(factor analysis). Varimax rotation of artificial variables which is originally invented in FA for easy interpretations is applied to diverse multivariate techniques mentioned above. Real data analysisis is performed in order to manifest that rotation improves interpretations of artificial variables.

  • PDF

Bayesian Analysis of a New Skewed Multivariate Probit for Correlated Binary Response Data

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • 제30권4호
    • /
    • pp.613-635
    • /
    • 2001
  • This paper proposes a skewed multivariate probit model for analyzing a correlated binary response data with covariates. The proposed model is formulated by introducing an asymmetric link based upon a skewed multivariate normal distribution. The model connected to the asymmetric multivariate link, allows for flexible modeling of the correlation structure among binary responses and straightforward interpretation of the parameters. However, complex likelihood function of the model prevents us from fitting and analyzing the model analytically. Simulation-based Bayesian inference methodologies are provided to overcome the problem. We examine the suggested methods through two data sets in order to demonstrate their performances.

  • PDF

Diagnosis of Observations after Fit of Multivariate Skew t-Distribution: Identification of Outliers and Edge Observations from Asymmetric Data

  • Kim, Seung-Gu
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.1019-1026
    • /
    • 2012
  • This paper presents a method for the identification of "edge observations" located on a boundary area constructed by a truncation variable as well as for the identification of outliers and the after fit of multivariate skew $t$-distribution(MST) to asymmetric data. The detection of edge observation is important in data analysis because it provides information on a certain critical area in observation space. The proposed method is applied to an Australian Institute of Sport(AIS) dataset that is well known for asymmetry in data space.

A GEE approach for the semiparametric accelerated lifetime model with multivariate interval-censored data

  • Maru Kim;Sangbum Choi
    • Communications for Statistical Applications and Methods
    • /
    • 제30권4호
    • /
    • pp.389-402
    • /
    • 2023
  • Multivariate or clustered failure time data often occur in many medical, epidemiological, and socio-economic studies when survival data are collected from several research centers. If the data are periodically observed as in a longitudinal study, survival times are often subject to various types of interval-censoring, creating multivariate interval-censored data. Then, the event times of interest may be correlated among individuals who come from the same cluster. In this article, we propose a unified linear regression method for analyzing multivariate interval-censored data. We consider a semiparametric multivariate accelerated failure time model as a statistical analysis tool and develop a generalized Buckley-James method to make inferences by imputing interval-censored observations with their conditional mean values. Since the study population consists of several heterogeneous clusters, where the subjects in the same cluster may be related, we propose a generalized estimating equations approach to accommodate potential dependence in clusters. Our simulation results confirm that the proposed estimator is robust to misspecification of working covariance matrix and statistical efficiency can increase when the working covariance structure is close to the truth. The proposed method is applied to the dataset from a diabetic retinopathy study.