• Title/Summary/Keyword: Multivariate Statistical Analysis

Search Result 637, Processing Time 0.028 seconds

Principal component analysis for Hilbertian functional data

  • Kim, Dongwoo;Lee, Young Kyung;Park, Byeong U.
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.1
    • /
    • pp.149-161
    • /
    • 2020
  • In this paper we extend the functional principal component analysis for real-valued random functions to the case of Hilbert-space-valued functional random objects. For this, we introduce an autocovariance operator acting on the space of real-valued functions. We establish an eigendecomposition of the autocovariance operator and a Karuhnen-Loève expansion. We propose the estimators of the eigenfunctions and the functional principal component scores, and investigate the rates of convergence of the estimators to their targets. We detail the implementation of the methodology for the cases of compositional vectors and density functions, and illustrate the method by analyzing time-varying population composition data. We also discuss an extension of the methodology to multivariate cases and develop the corresponding theory.

Monocyte Count and Systemic Immune-Inflammation Index Score as Predictors of Delayed Cerebral Ischemia after Aneurysmal Subarachnoid Hemorrhage

  • Yeonhu Lee;Yong Cheol Lim
    • Journal of Korean Neurosurgical Society
    • /
    • v.67 no.2
    • /
    • pp.177-185
    • /
    • 2024
  • Objective : Delayed cerebral ischemia (DCI) is a major cause of disability in patients who survive aneurysmal subarachnoid hemorrhage (aSAH). Systemic inflammatory markers, such as peripheral leukocyte count and systemic immune-inflammatory index (SII) score, have been considered predictors of DCI in previous studies. This study aims to investigate which systemic biomarkers are significant predictors of DCI. Methods : We conducted a retrospective, observational, single-center study of 170 patients with SAH admitted between May 2018 and March 2022. We analyzed the patients' clinical and laboratory parameters within 1 hour and 3-4 and 5-7 days after admission. The DCI and non-DCI groups were compared. Variables showing statistical significance in the univariate logistic analysis (p<0.05) were entered into a multivariate regression model. Results : Hunt-Hess grade "4-5" at admission, modified Fisher scale grade "3-4" at admission, hydrocephalus, intraventricular hemorrhage, and infection showed statistical significance (p<0.05) on a univariate logistic regression. Lymphocyte and monocyte count at admission, SII scores and C-reactive protein levels on days 3-4, and leukocyte and neutrophil counts on days 5-7 exhibited statistical significance on the univariate logistic regression. Multivariate logistic regression analysis revealed that monocyte count at admission (odds ratio [OR], 1.64; 95% confidence interval [CI], 1.04-2.65; p=0.036) and SII score at days 3-4 (OR, 1.55; 95% CI, 1.02-2.47; p=0.049) were independent predictors of DCI. Conclusion : Monocyte count at admission and SII score 3-4 days after rupture are independent predictors of clinical deterioration caused by DCI after aSAH. Peripheral monocytosis may be the primer for the innate immune reaction, and the SII score at days 3-4 can promptly represent the propagated systemic immune reaction toward DCI.

Non-parametric approach for the grouped dissimilarities using the multidimensional scaling and analysis of distance (다차원척도법과 거리분석을 활용한 그룹화된 비유사성에 대한 비모수적 접근법)

  • Nam, Seungchan;Choi, Yong-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.4
    • /
    • pp.567-578
    • /
    • 2017
  • Grouped multivariate data can be tested for differences between two or more groups using multivariate analysis of variance (MANOVA). However, this method cannot be used if several assumptions of MANOVA are violated. In this case, multidimensional scaling (MDS) and analysis of distance (AOD) can be applied to grouped dissimilarities based on the various distances. A permutation test is a non-parametric method that can also be used to test differences between groups. MDS is used to calculate the coordinates of observations from dissimilarities and AOD is useful for finding group structure using the coordinates. In particular, AOD is mathematically associated with MANOVA if using the Euclidean distance when computing dissimilarities. In this paper, we study the between and within group structure by applying MDS and AOD to the grouped dissimilarities. In addition, we propose a new test statistic using the group structure for the permutation test. Finally, we investigate the relationship between AOD and MANOVA from dissimilarities based on the Euclidean distance.

Complex sample design effects and inference for Korea National Health and Nutrition Examination Survey data (국민건강영양조사 자료의 복합표본설계효과와 통계적 추론)

  • Chung, Chin-Eun
    • Journal of Nutrition and Health
    • /
    • v.45 no.6
    • /
    • pp.600-612
    • /
    • 2012
  • Nutritional researchers world-wide are using large-scale sample survey methods to study nutritional health epidemiology and services utilization in general, non-clinical populations. This article provides a review of important statistical methods and software that apply to descriptive and multivariate analysis of data collected in sample surveys, such as national health and nutrition examination survey. A comparative data analysis of the Korea National Health and Nutrition Examination Survey (KNHANES) was used to illustrate analytical procedures and design effects for survey estimates of population statistics, model parameters, and test statistics. This article focused on the following points, method of approach to analyze of the sample survey data, right software tools available to perform these analyses, and correct survey analysis methods important to interpretation of survey data. It addresses the question of approaches to analysis of complex sample survey data. The latest developments in software tools for analysis of complex sample survey data are covered, and empirical examples are presented that illustrate the impact of survey sample design effects on the parameter estimates, test statistics, and significance probabilities (p values) for univariate and multivariate analyses.

Multivariate Analysis on Invertebrate Communities in Litter and Soils of Japanese Red Pine Forests treated by Beauveria bassiana (백강균(白殭菌)을 처리(處理)한 소나무림의 낙엽(落葉)과 토양(土壤)에 서식(棲息)하는 무척주동물(無脊柱動物) 군집(群集)에 대한 다변량분석(多變量分析))

  • Kwon, Tae-Sung;Park, Young-Seuk;Shin, Sang-Chul;Lee, Buom-Young
    • Journal of Korean Society of Forest Science
    • /
    • v.90 no.5
    • /
    • pp.593-599
    • /
    • 2001
  • We tested if the treatment of Beauveria bassiana would influence invertebrate communities in litter and soils by multivariate analysis. The PCA (principal components analysis) was used for the analysis. Using the distances between communities in the ordination space, we carried out statistical tests whether any factors would influence structures of the communities. We did not found any significant effects of the Beauveria treatment on invertebrate communities in both litter and soils.

  • PDF

Synonymous Codon Usage Analysis of the Mycobacteriophage Bxz1 and Its Plating Bacteria M. smegmatis: Identification of Highly and Lowly Expressed Genes of Bxz1 and the Possible Function of Its tRNA Species

  • Sahu, Keya;Gupta, Sanjib Kumar;Ghosh, Tapash Chandra;Sau, Subrata
    • BMB Reports
    • /
    • v.37 no.4
    • /
    • pp.487-492
    • /
    • 2004
  • The extent of codon usage in the protein coding genes of the mycobacteriophage, Bxz1, and its plating bacteria, M. smegmatis, were determined, and it was observed that the codons ending with either G and / or C were predominant in both the organisms. Multivariate statistical analysis showed that in both organisms, the genes were separated along the first major explanatory axis according to their expression levels and their genomic GC content at the synonymous third positions of the codons. The second major explanatory axis differentiates the genes according to their genome type. A comparison of the relative synonymous codon usage between 20 highly- and 20 lowly expressed genes from Bxz1 identified 21 codons, which are statistically over represented in the former group of genes. Further analysis found that the Bxz1- specific tRNA species could recognize 13 out of the 21 over represented synonymous codons, which incorporated 13 amino acid residues preferentially into the highly expressed proteins of Bxz1. In contrast, seven amino acid residues were preferentially incorporated into the lowly expressed proteins by 10 other tRNA species of Bxz1. This analysis predicts for the first time that the Bxz1-specific tRNA species modulates the optimal expression of its proteins during development.

HPLC-tandem Mass Spectrometric Analysis of the Marker Compounds in Forsythiae Fructus and Multivariate Analysis

  • Cho, Hwang-Eui;Ahn, Su-Youn;Son, In-Seop;Hwang, Gyung-Hwa;Kim, Sun-Chun;Woo, Mi-Hee;Lee, Seung-Ho;Son, Jong-Keun;Hong, Jin-Tae;Moon, Dong-Cheul
    • Natural Product Sciences
    • /
    • v.17 no.2
    • /
    • pp.147-159
    • /
    • 2011
  • A high-performance liquid chromatography-electrospray ionization-tandem mass spectrometric method was developed to determine simultaneously eight marker constituents of Forsythiae fructus, and subsequently applied it to classify its two botanical origins. The marker compounds of Forsythia suspensa were phillyrin, pinoresinol, phillygenin, lariciresinol and forsythiaside; those of F.viridissima were arctiin, arctigenin and matairesinol. Separation of the eight analytes was achieved on a phenyl-hexyl column (150${\times}$2.0 mm i.d., 3 ${\mu}M$) using gradient elution with the mobile phase: (A) 10% acetonitrile in 0.5% acetic acid, (B) 40% aqueous acetonitrile. A few fragment ions specific to the types of lignans, among the product ions generated by collisonally induced dissociation (CID) of molecular ion clusters, such as [M-H]$^-$ or [M+OAc]$^-$ were used not only for fingerprinting analysis but for the quantification of each epimer by using multiple-reaction monitoring mode. It was shown good linearity ($r^2{\geq}$ 0.9998) over the wide range of all analytes; intra- and inter-day precisions (RSD, %) were within 9.14% and the accuracy ranged from 84.3 to 115.1%. The analytical results of 40 drug samples, combined with multivariate statistical analyses - principal component analysis (PCA) and hierarchical cluster analysis (HCA) - clearly demonstrated the classification of the test samples according to their botanical origins. This method would provide a practical strategy for assessing the authenticity or quality of the herbal drug.

School Safety Education Factors Predicting Injury Prevalence Among Korean Adolescence (학교의 안전교육 관련 특성이 청소년의 사고발생 예측에 미치는 영향)

  • 이명선;박경옥
    • Korean Journal of Health Education and Promotion
    • /
    • v.21 no.2
    • /
    • pp.147-165
    • /
    • 2004
  • Injury is a leading cause of death in the children and adolescent populations. In particular, more than 80% of unintentional injury was related to risk-taking behaviors involved in diverse accidents around school and home. Therefore, educational approaches should be provided for children and adolescent populations, and schools are the essential and appropriate sites to conduct safety education. This study was conducted to identify injury prevalence and safety education at schools among middle and high school students in Korea. About 1,034 middle and high students in 28 schools participated in a self-administered survey. The target schools were selected from the stratified random sampling method throughout schools of seven metropolitan cities in Korea. The questionnaires were delivered to the vice-principals by ground mailing service and the vice-principals administered survey data collection. The questionnaire asked about safety education provided in schools, injury experience in the last year, needs for injury prevention class in school, and demographics. All survey responses were entered into SPSS worksheet. Multivariate analysis of variance (MANOVA) and descriptive discriminant analysis (DDA) were used in statistical analysis with SPSS software 11.1. Multivariate analysis of variance was conducted as a preliminary analysis of DDA. According to the result of multivariate analysis of variance, gender (man), grade (poor), living with both parents, and displaying injury prevention messages on school news board were significantly different between the injured student group and the uninjured student group (p= .00). These four factors also had significant effects on students' injury experience in DDA, although correlation of the four factors with injury experience was weak overall based on their canonical function coefficients. All structure coefficients of the four factors were greater than .30, which means the four factors have discriminant effects on injury prevalence. The sizes of the discriminant effects, in order, were largly from gender, grade, living with both parents, and safety message display on school news boards.

Evaluation of Water Quality and Phytoplankton Community Using a Multivariate Analysis in Bukhan River (다변량 통계분석을 이용한 북한강의 수질 및 식물플랑크톤 군집 특성 평가)

  • Kim, Hun Nyun;Youn, Seok Jea;Byeon, Myeong Seop;Yu, Soon Ju;Im, Jong Kwon
    • Journal of Korean Society on Water Environment
    • /
    • v.35 no.1
    • /
    • pp.19-27
    • /
    • 2019
  • The purpose of this study is to evaluate the water quality and phytoplankton community in Bukhan River which account for 44.4 % of the total inflow into Lake Paldang, using multivariate statistical techniques (i.e., correlation analysis, principal component analysis (PCA)/factor analysis (FA)). Water samples were collected from March to November 2015 and the following parameters measured; water temperature, pH, DO, EC, SS, BOD, Chl-a, COD, TN, $NO_3-N$, $NH_3-N$, TP, DTP, $PO_4-P$, and phytoplankton community. The water quality of the main stream and the tributaries were not significantly different apart from the relatively high concentration of BOD, COD and nutrients recorded in MH. The highest cell density of Stephanodiscus hantzschii and Merismopedia glauca dominated phytoplankton was observed in PD. Based on the correlation analysis, total phytoplankton and cyanophyceae were highly correlated with BOD, COD and nutrients. PCA/FA resulted in four main factors accounting for 82.240 % of the total variance in the water quality dataset. The group of component 1 (TN, DTN, DO, $NO_3-N$, water temperature) and component 2 ($PO_4-P$, T-P, DTP, SS) were classified as nutrient element factor whereas component 3 (Chl-a, COD, BOD, $NH_3-N$, pH) was related to organic substances. Hence, the identification of the main potential environmental pollution factors in Bukhan River will help policy makers make better and more informed decisions on how to improve the water quality.

Statistical Studies on the Formularies of Oriental Medicine(II) -Statistical Analyses of Ginseng Prescription- (한방 처방의 통계적 연구( II ) -인삼배합 한방처방의 통계적 연구-)

  • Hong, Moon-Wha
    • Korean Journal of Pharmacognosy
    • /
    • v.3 no.4
    • /
    • pp.187-197
    • /
    • 1972
  • In spite of the fact that the system of oriental medicine still remains in the realm of 'unproven-method of treatment', no one can deny that the oriental medicine is a rich source of idea and motivation for the discovery of new drug from natural sources. However, non-scientific, mystic hypothetical system of oriental medicine refuses to be revealed scientifically. For the purpose of drawing useful parameters for inductive reasoning of the system, a new approach which comprises statistical analyses of prescription was attempted in this study. One hundred and thirty two ginseng-compounds prescription in 'Bang-Yak-Hap-Pyon', one of the most popular formularies of oriental medicine in Korea, were analysed by multivariate analysis technique. The results revealed ginseng from many points of view, e.g., therapeutic indications, dose, and compatibility, etc. Among these, the most striking coincidence with scientific achievements of modern pharmacology, is the fact that the oriental medicine has characterized ginseng already from remote ancient times as neither a specific curative nor an aphrodisiac, but a non-specific adaptogenic drug for general infirmity.

  • PDF