• Title/Summary/Keyword: multivariate data

Search Result 1,977, Processing Time 0.028 seconds

Selection probability of multivariate regularization to identify pleiotropic variants in genetic association studies

  • Kim, Kipoong;Sun, Hokeun
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.5
    • /
    • pp.535-546
    • /
    • 2020
  • In genetic association studies, pleiotropy is a phenomenon where a variant or a genetic region affects multiple traits or diseases. There have been many studies identifying cross-phenotype genetic associations. But, most of statistical approaches for detection of pleiotropy are based on individual tests where a single variant association with multiple traits is tested one at a time. These approaches fail to account for relations among correlated variants. Recently, multivariate regularization methods have been proposed to detect pleiotropy in analysis of high-dimensional genomic data. However, they suffer a problem of tuning parameter selection, which often results in either too many false positives or too small true positives. In this article, we applied selection probability to multivariate regularization methods in order to identify pleiotropic variants associated with multiple phenotypes. Selection probability was applied to individual elastic-net, unified elastic-net and multi-response elastic-net regularization methods. In simulation studies, selection performance of three multivariate regularization methods was evaluated when the total number of phenotypes, the number of phenotypes associated with a variant, and correlations among phenotypes are different. We also applied the regularization methods to a wild bean dataset consisting of 169,028 variants and 17 phenotypes.

A Comparison of Univariate and Multivariate AR Models for Monthly River Flow Series (월유량에 대한 일변량 및 다변량 AR모형의 비교)

  • 이원환;심재현
    • Water for future
    • /
    • v.23 no.1
    • /
    • pp.99-107
    • /
    • 1990
  • The statistical analysis based on the past hydrologic data required to set up the water resources development plan and design the hydraulic structres rationally. Because hydrologic events have random factors implied, the sotchastic analysis is necessary. In this paper, same order of stochastic models of monthly runoff data(multivariate AR(1) and AR(2) models, univariate AR(1) and AR(2) models) are applied to compare the statistical characteristics. The other purpose of this paper is to compare the monthly series, which is generated by univariate and multivariate models. By comparing and estimating of each simulated series, it is known that the multivariate models, including the time and spatial colinearity, are better in prediction than univariate models in the analysis of monthly flow at south Han river basin.

  • PDF

Multivariate assessment of the occurrence of compound Hazards at the pan-Asian region

  • Davy Jean Abella;Kuk-Hyun Ahn
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.166-166
    • /
    • 2023
  • Compound hazards (CHs) are two or more extreme climate events combined which occur simultaneously in the same region at the same time. Compared to individual hazards, the combination of hazards that cause CHs can result in greater economic losses and deaths. While several extreme climate events have been recorded across Asia for the past decades, many studies have only focused on a single hazard. In this study, we assess the spatiotemporal pattern of dry compound hazards which includes drought, heatwave, fire and wind across Asia for the last 42 years (1980-2021) using the historical data from ERA5 Reanalysis dataset. We utilize a daily spatial data of each climate event to assess the occurrence of such compound hazards on a daily basis. Heatwave, fire and wind hazard occurrences are analyzed using daily percentile-based thresholds while a pre-defined threshold for SPI is applied for drought occurrence. Then, the occurrence of each type of compound hazard is taken from overlapping the map of daily occurrences of a single hazard. Lastly, a multivariate assessment are conducted to quantify the occurrence frequency, hotspots and trends of each type of compound hazard across Asia. By conducting a multivariate analysis of the occurrence of these compound hazards, we identify the relationships and interactions in dry compound hazards including droughts, heatwaves, fires, and winds, ultimately leading to better-informed decisions and strategies in the natural risk management.

  • PDF

A Rao-Robson Chi-Square Test for Multivariate Normality Based on the Mahalanobis Distances

  • Park, Cheolyong
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.2
    • /
    • pp.385-392
    • /
    • 2000
  • Many tests for multivariate normality are based on the spherical coordinates of the scaled residuals of multivariate observations. Moore and Stubblebine's (1981) Pearson chi-square test is based on the radii of the scaled residuals, or equivalently the sample Mahalanobis distances of the observations from the sample mean vector. The chi-square statistic does not have a limiting chi-square distribution since the unknown parameters are estimated from ungrouped data. We will derive a simple closed form of the Rao-Robson chi-square test statistic and provide a self-contained proof that it has a limiting chi-square distribution. We then provide an illustrative example of application to a real data with a simulation study to show the accuracy in finite sample of the limiting distribution.

  • PDF

A Multivariate Statistical Approach to the Categorization of Body Types for Korean Adults (다변량 통계분석 방법을 이용한 한국인 성인 남녀 체형분류)

  • Seong, Deok-Hyun;Jung, Eui-S.
    • Journal of the Ergonomics Society of Korea
    • /
    • v.24 no.4
    • /
    • pp.39-46
    • /
    • 2005
  • The purpose of the study is to suggest a methodology for properly categorizing the body type of Koreans based on the multivariate statistical analysis. Anthropometric data used in the study were measured from the sampled strata of about fifteen thousand Koreans surveyed through the 5th national anthropometic data measurement project called Size Korea funded by ATS, Korea, during 2003-2004. In order to categorize whole body types, the normalized anthropometric variables, being divided by its stature, were used for obtaining a set of factors that supposedly represent body types through the factor analysis. These factors, which were again clustered, yielded the body types according to the gender. The body types classified are expected to be applied to product design for clothing, furniture, automobile packaging, etc.

A Study of Simple Rock Mass Rating for Tunnel Using Multivariate Analysis (다변량분석을 이용한 터널에서의 간편 RMR에 관한 연구)

  • 위용곤;노상림;윤지선
    • Proceedings of the Korean Geotechical Society Conference
    • /
    • 2000.11a
    • /
    • pp.493-500
    • /
    • 2000
  • Rock Mass Rating has been widely applied to the underground tunnel excavation and many other practical problems in rock engineering. However, Rock Mass Rating is hard to make out because it is difficult to estimate each valuation items through all kind of field situations and items of RMR have interdependence. So the experts of tunnel assessment have problems with rating rock mass. In this study, using multivariate analysis based on domestic data(1011EA) of water conveyance tunnel, we presented rock mass rating system which is objective and easy to use. The constituents of RMR are decided to RQD, condition of discontinuities, groundwater conditions, orientation of discontinuities, intact rock strength, spacing of discontinuities in important order. In each step, we proposed the best multiple regression model for RMR system. And using data which have been collected at other site, we examined that presented multiple regression model was useful.

  • PDF

Effects of Non-normality on the Performance of Univariate and Multivariate CUSUM Control Charts (비정규 모집단에 대한 일변량 및 다변량 누적합 관리도의 성능 분석)

  • Chang, Young-Soon
    • Journal of Korean Society for Quality Management
    • /
    • v.34 no.4
    • /
    • pp.102-109
    • /
    • 2006
  • This paper investigates the effects of non-normality on the performance of univariate and multivariate cumulative sum(CUSUM) control charts for monitoring the process mean. In-control and out-of-control average run lengths of the charts are examined for the univariate/multivariate lognormal and t distributions. The effects of the reference value and the correlation coefficient under the non-normal distributions are also studied. Simulation results show that the CUSUM charts with small reference values are robust to non-normality but those with moderate or large reference values are sensitive to non-normal data especially to process data from skewed distributions. The performance of the chart to detect mean shift of a process is not invariant to the direction of the shift for skewed distributions.

Multivariate Sequential Rectifying Inspection with Applicability to the Motor Vehicle Emission Certified Test (자동차 배출가스보증시험에 다변수 축차검사의 적용에 관한 연구)

  • Jo, Jae-Rip
    • Journal of Korean Society for Quality Management
    • /
    • v.19 no.2
    • /
    • pp.63-77
    • /
    • 1991
  • Currently the problem of air pollution caused by the motor vehicle emission is one of the most serious problems to be solved. Thus we needed the inspection method and technical innovation constraining the motor vehicle emission. In order to establish the more reasonable certified test, the multivariate sequential rectifying inspection plan designed in this paper has been applied to the domestic vehicles by analyzing the statistic characteristics of the emission distribution. This inspection method is designed to satisfy the evaluation measure constraining domestic vehicle emission, and it serves the defect rectifying system and performance certification of catalytic converts. As the prior parameter for the domestic vehicles, we used the data for the catalytic converts which passed the certified test excuted by the EPK. For the case of engine test, we used those data which passed the certified test of domestic vehicles. The multivariate sequential rectifying inspection plan of the vector parameter is able to minimize the average sample number and increase the pass probability of operating characteristic curve.

  • PDF

A change point estimator in monitoring the parameters of a multivariate IMA(1, 1) model

  • Sohn, Sun-Yoel;Cho, Gyo-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.525-533
    • /
    • 2015
  • Modern production process is a very complex structure combined observations which are correlated with several factors. When the error signal occurs in the process, it is very difficult to know the root causes of an out-of-control signal because of insufficient information. However, if we know the time of the change, the system can be controlled more easily. To know it, we derive a maximum likelihood estimator (MLE) of the change point in a process when observations are from a multivariate IMA(1,1) process by monitoring residual vectors of the model. In this paper, numerical results show that the MLE of change point is effective in detecting changes in a process.

A Study on the Multivariate Stratified Random Sampling with Multiplicity (중복수가 있는 다변량 층화임의추출에 관한 연구(층별로 독립인 경우의 배분문제))

  • Kim, Ho-Il
    • Journal of the Korean Data and Information Science Society
    • /
    • v.10 no.1
    • /
    • pp.79-89
    • /
    • 1999
  • A counting rule that allows an element to be linked to more than one enumeration unit is called a multiplicity counting rule. Sample designs that use multiplicity counting rules are called network samples. Defining a network to be a set of observation units with a given linkage pattern, a network may be linked with more than one selection unit, and a single selection unit may be linked with more than one network. This paper considers allocation for multivariate stratified random sampling with multiplicity.

  • PDF