• Title/Summary/Keyword: Multivariate Statistical Analysis

Search Result 639, Processing Time 0.025 seconds

A Study on Pollution Levels and Source of Polychlorinated Biphenyl (PCB) in the Ambient Air of Korea and Japan (한국과 일본의 환경대기 중 폴리염화비페닐(PCB)의 농도수준 및 발생원 해석에 관한 연구)

  • Kim, Kyoung-Soo;Song, Byung-Joo;Kim, Jong-Guk;Kim, Kyeo-Keun
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.27 no.2
    • /
    • pp.170-176
    • /
    • 2005
  • This study was conducted to investigate the level of PCBs and distribution of PCB congeners in the ambient air of Korea and Japan. The source of PCBs were also studied by a statistical method. The TEQ concentration of PCB in the ambient air of Korea and Japan were between 0.003 and $1.01\;pgTEQ/m^3$(mean value : $0.22\;pgTEQ/m^3$) and between 0.002 and $0.014\;pgTEQ/m^3$ (mean value: $0.007\;pgTEQ/m^3$), respectively. The ambient air of industrial area of Korea showed a fluctuation in PCB concentration than other sampling area. The isomer distribution patterns in the ambient air was more or less similar in all sampling places. In addition, highly chlorinated homologues ($7{\sim}10CB$) were detected in the only Korea industrial area. This observation suggests that there is a possibility of specific source of PCBs in the industrial area. The source identification of PCB in ambient air was performed using multivariate statistical analysis(principal component analysis). As a result, it is estimated that the Korean ambient air was more influenced by combustion process than the ambient air of Japan and also the effect of PCB commercial products was relatively a small.

Water Quality Assessment and Turbidity Prediction Using Multivariate Statistical Techniques: A Case Study of the Cheurfa Dam in Northwestern Algeria

  • ADDOUCHE, Amina;RIGHI, Ali;HAMRI, Mehdi Mohamed;BENGHAREZ, Zohra;ZIZI, Zahia
    • Applied Chemistry for Engineering
    • /
    • v.33 no.6
    • /
    • pp.563-573
    • /
    • 2022
  • This work aimed to develop a new equation for turbidity (Turb) simulation and prediction using statistical methods based on principal component analysis (PCA) and multiple linear regression (MLR). For this purpose, water samples were collected monthly over a five year period from Cheurfa dam, an important reservoir in Northwestern Algeria, and analyzed for 12 parameters, including temperature (T°), pH, electrical conductivity (EC), turbidity (Turb), dissolved oxygen (DO), ammonium (NH4+), nitrate (NO3-), nitrite (NO2-), phosphate (PO43-), total suspended solids (TSS), biochemical oxygen demand (BOD5) and chemical oxygen demand (COD). The results revealed a strong mineralization of the water and low dissolved oxygen (DO) content during the summer period. High levels of TSS and Turb were recorded during rainy periods. In addition, water was charged with phosphate (PO43-) in the whole period of study. The PCA results revealed ten factors, three of which were significant (eigenvalues >1) and explained 75.5% of the total variance. The F1 and F2 factors explained 36.5% and 26.7% of the total variance, respectively and indicated anthropogenic pollution of domestic agricultural and industrial origin. The MLR turbidity simulation model exhibited a high coefficient of determination (R2 = 92.20%), indicating that 92.20% of the data variability can be explained by the model. TSS, DO, EC, NO3-, NO2-, and COD were the most significant contributing parameters (p values << 0.05) in turbidity prediction. The present study can help with decision-making on the management and monitoring of the water quality of the dam, which is the primary source of drinking water in this region.

Evaluation of Water Quality Characteristics at Kyeongan Stream Using the Flow-Loading Equation and Factor Analysis (유량-오염부하량 관계식과 요인분석을 이용한 경안천의 수질특성 평가)

  • Kwon, Phil-Sang;Park, Min-Ji;Lee, Young-Joon;Cho, Yong-Chul;Noh, Chang-Wan;Jung, Woo-Seok;Kim, Ji-Ho;Yu, Soon-Ju
    • Ecology and Resilient Infrastructure
    • /
    • v.4 no.4
    • /
    • pp.226-236
    • /
    • 2017
  • In this study, we aimed to analyze the characteristics of water quality variation at Kyeongan Stream for a decade and to investigate by the flow-loading equation. The correlation analysis of water quality parameters and the influence factors were examined by statistical analysis. The characteristics of water quality variation showed that the fluctuations in $BOD_5$, $COD_{Mn}$ and TOC were repeated from year to year. TN and TP were decreased by year. By the flow-loading equation, the concentrations of $BOD_5$, $COD_{Mn}$, TOC and TN were decreased when the flow rate was on the rise. However, the flow did not affect the concentration of TP. According to correlation analysis, $BOD_5$ was highly correlated with $COD_{Mn}$ and TOC with the correlation coefficients of 0.890 (p<0.01) and 0.721 (p<0.01). The result of factor analysis, we identified that the water quality in Kyeongan Stream has been highly influenced by the organic matter index, followed by nitrogenous substance depending on the seasonal variations and the influx of suspended solid in accordance with the increase of flow.

Transmission of $Toxocara$ $canis$ via Ingestion of Raw Cow Liver: A Cross-Sectional Study in Healthy Adults

  • Choi, Dong-Il;Lim, Jae-Hoon;Choi, Dong-Chull;Lee, Kyung-Soo;Paik, Seung-Woon;Kim, Sun-Hee;Choi, Yoon-Ho;Huh, Sun
    • Parasites, Hosts and Diseases
    • /
    • v.50 no.1
    • /
    • pp.23-27
    • /
    • 2012
  • The aim of this study is to ascertain the relationship between ingestion of raw cow liver and $Toxocara$ $canis$ infection. A total of 150 apparently healthy adults were divided into 2 groups; 1 group consisted of 86 adults with positive results of Toxocara ELISA, and the other group of 64 adults with negative results. One researcher collected the history of ingestion of raw cow liver within 1 year and recent history of keeping dogs. Among 86 seropositive adults for $T.$ $canis$, 68 (79.1%) had a recent history of ingestion of raw cow liver. Multivariate statistical analysis showed that a recent ingestion of raw cow liver and keeping dogs were related to an increased risk of toxocariasis (odds ratios, 4.4 and 3.7; and 95% confidence intervals, 1.9-10.2 and 1.2-11.6, respectively). A recent history of ingestion of raw cow liver and keeping dogs was significantly associated with toxocariasis.

Roles of E-cadherin and Cyclooxygenase Enzymes in Predicting Different Survival Patterns of Optimally Cytoreduced Serous Ovarian Cancer Patients

  • Taskin, Salih;Dunder, Ilkkan;Erol, Ebru;Taskin, Elif Aylin;Kiremitci, Saba;Oztuna, Derya;Sertcelik, Ayse
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.11
    • /
    • pp.5715-5719
    • /
    • 2012
  • The relation between cyclooxygenase enzymes and E-cadherin, along with the roles of these markers in the prediction of survival in optimally cytoreduced serous ovarian cancer patients was investigated. Individuals who underwent primary staging surgery and achieved optimal cytoreduction (largest residual tumor volume <1 cm) constituted the study population. Specimens of 32 cases were immunohistochemically examined for cyclooxygenase-1, cyclooxygenase-2, and E-cadherin. Two could not be evaluated for E-cadherin and cyclooxygenase-1. Overall, 14/30, 19/30, and 15/32 cases were positive for E-cadherin, cyclooxygenase-1, and cyclooxygenase-2, respectively. The expressions of E-cadherin and cyclooxygenase-2 were inversely correlated (p:0.02). E-cadherin expression was related with favorable survival (p<0.001). The relation between the expression of cyclooxygenase enzymes and poor survival did not reach statistical significance. On multivariate analysis, E-cadherin appeared as an independent prognostic factor for survival. In conclusion, E-cadherin expression is strongly linked with favorable survival. E-cadherin and cyclooxygenase 2 may interact with each other during the carcinogenesis-invasion process. Further studies clarifying the relation between E-cadherin and cyclooxygenase enzymes may lead to new preventive and therapeutic targets in ovarian cancer.

The extension of the largest generalized-eigenvalue based distance metric Dij1) in arbitrary feature spaces to classify composite data points

  • Daoud, Mosaab
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.39.1-39.20
    • /
    • 2019
  • Analyzing patterns in data points embedded in linear and non-linear feature spaces is considered as one of the common research problems among different research areas, for example: data mining, machine learning, pattern recognition, and multivariate analysis. In this paper, data points are heterogeneous sets of biosequences (composite data points). A composite data point is a set of ordinary data points (e.g., set of feature vectors). We theoretically extend the derivation of the largest generalized eigenvalue-based distance metric Dij1) in any linear and non-linear feature spaces. We prove that Dij1) is a metric under any linear and non-linear feature transformation function. We show the sufficiency and efficiency of using the decision rule $\bar{{\delta}}_{{\Xi}i}$(i.e., mean of Dij1)) in classification of heterogeneous sets of biosequences compared with the decision rules min𝚵iand median𝚵i. We analyze the impact of linear and non-linear transformation functions on classifying/clustering collections of heterogeneous sets of biosequences. The impact of the length of a sequence in a heterogeneous sequence-set generated by simulation on the classification and clustering results in linear and non-linear feature spaces is empirically shown in this paper. We propose a new concept: the limiting dispersion map of the existing clusters in heterogeneous sets of biosequences embedded in linear and nonlinear feature spaces, which is based on the limiting distribution of nucleotide compositions estimated from real data sets. Finally, the empirical conclusions and the scientific evidences are deduced from the experiments to support the theoretical side stated in this paper.

Estimation and Performance Analysis of Risk Measures using Copula and Extreme Value Theory (코퓰러과 극단치이론을 이용한 위험척도의 추정 및 성과분석)

  • Yeo, Sung-Chil
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.481-504
    • /
    • 2006
  • VaR, a tail-related risk measure is now widely used as a tool for a measurement and a management of financial risks. For more accurate measurement of VaR, recently we are particularly concerned about the approach based on extreme value theory rather than the traditional method based on the assumption of normal distribution. However, many studies about the approaches using extreme value theory was done only for the univariate case. In this paper, we discuss portfolio risk measurements with modelling multivariate extreme value distributions by combining copulas and extreme value theory. We also discuss the estimation of ES together with VaR as portfolio risk measures. Finally, we investigate the relative superiority of EVT-copula approach than variance-covariance method through the back-testing of an empirical data.

Multidimensional scaling of categorical data using the partition method (분할법을 활용한 범주형자료의 다차원척도법)

  • Shin, Sang Min;Chun, Sun-Kyung;Choi, Yong-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.1
    • /
    • pp.67-75
    • /
    • 2018
  • Multidimensional scaling (MDS) is an exploratory analysis of multivariate data to represent the dissimilarity among objects in the geometric low-dimensional space. However, a general MDS map only shows the information of objects without any information about variables. In this study, we used MDS based on the algorithm of Torgerson (Theory and Methods of Scaling, Wiley, 1958) to visualize some clusters of objects in categorical data. For this, we convert given data into a multiple indicator matrix. Additionally, we added the information of levels for each categorical variable on the MDS map by applying the partition method of Shin et al. (Korean Journal of Applied Statistics, 28, 1171-1180, 2015). Therefore, we can find information on the similarity among objects as well as find associations among categorical variables using the proposed MDS map.

Time-varying modeling of the composite LN-GPD (시간에 따라 변화하는 로그-정규분포와 파레토 합성 분포의 모형 추정)

  • Park, Sojin;Baek, Changryong
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.1
    • /
    • pp.109-122
    • /
    • 2018
  • The composite lognormal-generalized Pareto distribution (LN-GPD) is a mixture of right-truncated lognormal and GPD for a given threshold value. Scollnik (Scandinavian Actuarial Journal, 2007, 20-33, 2007) shows that the composite LN-GPD is adequate to describe body distribution and heavy-tailedness. This paper considers time-varying modeling of the LN-GPD based on local polynomial maximum likelihood estimation. Time-varying model provides significant detailed information of time dependent data, hence it can be applied to disciplines such as service engineering for staffing and resources management. Our work also extends to Beirlant and Goegebeur (Journal of Multivariate Analysis, 89, 97-118, 2004) in the sense of losing no data by including truncated lognormal distribution. Our proposed method is shown to perform adequately in simulation. Real data application to the service time of the Israel bank call center shows interesting findings on the staffing policy.

National perioperative outcomes of flap coverage for pressure ulcers from 2005 to 2015 using American College of Surgeons National Surgical Quality Improvement Program

  • Tran, Bao Ngoc N.;Chen, Austin D.;Kamali, Parisa;Singhal, Dhruv;Lee, Bernard T.;Fukudome, Eugene Y.
    • Archives of Plastic Surgery
    • /
    • v.45 no.5
    • /
    • pp.418-424
    • /
    • 2018
  • Background Complication rates after flap coverage for pressure ulcers have been high historically. These patients have multiple risk factors associated with poor wound healing and complications including marginal nutritional status, prolonged immobilization, and a high comorbidities index. This study utilizes the National Surgical Quality Improvement Program (NSQIP) to examine perioperative outcomes of flap coverage for pressure ulcers. Methods Data from the NSQIP database (2005-2015) for patient undergoing flap coverage for pressure ulcers was identified. Demographic, perioperative information, and complications were reviewed. One-way analysis of variance and Pearson chi-square were used to assess differences for continuous variables and nominal variables, respectively. Multivariate logistic regression was performed to identify independent risk factors for complications. Results There were 755 cases identified: 365 (48.3%) sacral ulcers, 321 (42.5%) ischial ulcers, and 69 (9.1%) trochanteric ulcers. Most patients were older male, with some degree of dependency, neurosensory impairment, high functional comorbidities score, and American Society of Anesthesiologists class 3 or above. The sacral ulcer group had the highest incidence of septic shock and bleeding, while the trochanteric ulcer group had the highest incidence of superficial surgical site infection. There was an overall complication rate of 25% at 30-day follow-up. There was no statistical difference in overall complication among groups. Total operating time, diabetes, and non-elective case were independent risk factors for overall complications. Conclusions Despite patients with poor baseline functional status, flap coverage for pressure ulcer patients is safe with acceptable postoperative complications. This type of treatment should be considered for properly selected patients.