• Title/Summary/Keyword: 정준상관

Search Result 100, Processing Time 0.028 seconds

A comparison study of canonical methods: Application to -Omics data (오믹스 자료를 이용한 정준방법 비교)

  • Seungsoo Lee;Eun Jeong Min
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.2
    • /
    • pp.157-176
    • /
    • 2024
  • Integrative analysis for better understanding of complex biological systems gains more attention. Observing subjects from various perspectives and conducting integrative analysis of those multiple datasets enables a deeper understanding of the subject. In this paper, we compared two methods that simultaneously consider two datasets gathered from the same objects, canonical correlation analysis (CCA) and co-inertia analysis (CIA). Since CCA cannot handle the case when the data exhibit high-dimensionality, two strategies were considered instead: Utilization of a ridge constant (CCA-ridge) and substitution of covariance matrices of each data to identity matrix and then applying penalized singular value decomposition (CCA-PMD). To illustrate CIA and CCA, both extensions of CCA and CIA were applied to NCI60 cell line data. It is shown that both methods yield biologically meaningful and significant results by identifying important genes that enhance our comprehension of the data. Their results shows some dissimilarities arisen from the different criteria used to measure the relationship between two sets of data in each method. Additionally, CIA exhibits variations dependent on the weight matrices employed.

Relationships between Soil-Site Properties and Bamboo (Phyllostachys bambusoides) Growth (토양(土壤)의 이화학적(理化學的) 특성(特性)과 대나무 생장(生長)과의 관계(關係))

  • Chung, Young Gwan;Ramm, Carl W.
    • Journal of Korean Society of Forest Science
    • /
    • v.79 no.1
    • /
    • pp.16-20
    • /
    • 1990
  • Canonical correlation analysis was used to relate 17 soil-site variables to bamboo diameter, height, and internodal characteristics. The first canonical correlation was highly significant, explained much of the variance in both sets of variables, and the canonical variates made sense biologically. Surface soil depth, total nitrogen and percent organic matter had high positive correlations with the first soil-site canonical variate. Clay content (%) and cation exchange capacity were negatively correlated with the first soil-site canonical variate. Only 8 of predictor variables were considered relevant for predicting bamboo growth.

  • PDF

Development of a Compound Classification Process for Improving the Correctness of Land Information Analysis in Satellite Imagery - Using Principal Component Analysis, Canonical Correlation Classification Algorithm and Multitemporal Imagery - (위성영상의 토지정보 분석정확도 향상을 위한 응용체계의 개발 - 다중시기 영상과 주성분분석 및 정준상관분류 알고리즘을 이용하여 -)

  • Park, Min-Ho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.4D
    • /
    • pp.569-577
    • /
    • 2008
  • The purpose of this study is focused on the development of compound classification process by mixing multitemporal data and annexing a specific image enhancement technique with a specific image classification algorithm, to gain more accurate land information from satellite imagery. That is, this study suggests the classification process using canonical correlation classification technique after principal component analysis for the mixed multitemporal data. The result of this proposed classification process is compared with the canonical correlation classification result of one date images, multitemporal imagery and a mixed image after principal component analysis for one date images. The satellite images which are used are the Landsat 5 TM images acquired on July 26, 1994 and September 1, 1996. Ground truth data for accuracy assessment is obtained from topographic map and aerial photograph, and all of the study area is used for accuracy assessment. The proposed compound classification process showed superior efficiency to appling canonical correlation classification technique for only one date image in classification accuracy by 8.2%. Especially, it was valid in classifying mixed urban area correctly. Conclusively, to improve the classification accuracy when extracting land cover information using Landsat TM image, appling canonical correlation classification technique after principal component analysis for multitemporal imagery is very useful.

Underwater Target Analysis Using Canonical Correlation Analysis (정준상관분석을 이용한 수중표적 분석)

  • Seok, Jong-Won;Kim, Tae-Hwan;Bae, Keun-Sung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.9
    • /
    • pp.1878-1883
    • /
    • 2012
  • Generally, in the underwater target recognition, feature vectors are extracted from the target signal utilizing spatial information according to target shape/material characteristics. And, various signal processing techniques have been studied to extract feature vectors which is less sensitive to the location of the receiver. In this paper, we analyzed the characteristics of synthesized underwater objects using canonical correlation analysis method which is relatively less sensitive to the location of receiver. Canonical correlation analysis is applied to two consecutive backscattered sonar returns at different aspect angles to analyze the correlation characteristics in multi-aspect environment.

Equivalence study of canonical correspondence analysis by weighted principal component analysis and canonical correspondence analysis by Gaussian response model (가중주성분분석을 활용한 정준대응분석과 가우시안 반응 모형에 의한 정준대응분석의 동일성 연구)

  • Jeong, Hyeong Chul
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.6
    • /
    • pp.945-956
    • /
    • 2021
  • In this study, we considered the algorithm of Legendre and Legendre (2012), which derives canonical correspondence analysis from weighted principal component analysis. And, it was proved that the canonical correspondence analysis based on the weighted principal component analysis is exactly the same as Ter Braak's (1986) canonical correspondence analysis based on the Gaussian response model. Ter Braak (1986)'s canonical correspondence analysis derived from a Gaussian response curve that can explain the abundance of species in ecology well uses the basic assumption of the species packing model and then conducts generalized linear model and canonical correlation analysis. It is derived by way of binding. However, the algorithm of Legendre and Legendre (2012) is calculated in a method quite similar to Benzecri's correspondence analysis without such assumptions. Therefore, if canonical correspondence analysis based on weighted principal component analysis is used, it is possible to have some flexibility in using the results. In conclusion, this study shows that the two methods starting from different models have the same site scores, species scores, and species-environment correlations.

A Study on Shape Variability in Canonical Correlation Biplot with Missing Values (결측값이 있는 정준상관 행렬도의 형상변동 연구)

  • Hong, Hyun-Uk;Choi, Yong-Seok;Shin, Sang-Min;Ka, Chang-Wan
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.5
    • /
    • pp.955-966
    • /
    • 2010
  • Canonical correlation biplot is a useful biplot for giving a graphical description of the data matrix which consists of the association between two sets of variables, for detecting patterns and displaying results found by more formal methods of analysis. Nevertheless, when some values are missing in data, most biplots are not directly applicable. To solve this problem, we estimate the missing data using the median, mean, EM algorithm and MCMC imputation methods according to missing rates. Even though we estimate the missing values of biplot of incomplete data, we have different shapes of biplots according to the imputation methods and missing rates. Therefore we use a RMS(root mean square) which was proposed by Shin et al. (2007) and PS(procrustes statistic) for measuring and comparing the shape variability between the original biplots and the estimated biplots.

An Analysis of Economic Interdependency between Regions using the Canonical Correlation (for the working trip in Seoul metropolitan area) (정준상관분석기법을 이용한 지역간 경제적 의존성 분석 (수도권 출근목적통행량을 기준으로))

  • 노정현;변미정;김태균;차경준
    • Journal of Korean Society of Transportation
    • /
    • v.20 no.7
    • /
    • pp.5-13
    • /
    • 2002
  • 고도의 산업화성장에 따라 도시의 기능이 지역간의 완전분리가 아닌 경제적 사회적 상호 연관성이 점점 증가하고 그 내용도 복잡하고 다양해짐에 따라 이에 대한 체계적인 분석이 필요하게 되었으며, 그에 대한 연구 또한 활발히 진행되고 있다. 그러나 기존에 연구에서 제시하고 있는 각 지역단위별로 취업자수에 대한 고용자수 비율 또는 출근통행의 유출량에 대한 유입량 비율 등의 단순지표를 이용한 지역간의 경제적 의존성을 설명하는 데는 한계가 있다고 할 수 있다. 따라서 본 연구는 지역간의 경제적 연관관계 즉 경제적 의존성을 복합적이고 표준화 할 수 있는 계량치로 추정하기 위해 두 변수 집합간의 연관성을 추정하는데 매우 유용한 분석기법인 정준상관분석 기법을 이용하여 추정하고자 하였다. 이에 수도권 72개존의 출근통행자료을 이용하여 지역간의 경제적 의존성을 측정하였으며, 각 존들로 구성된 지역간의 정준상관계수 및 각 존들의 정준가 중계수를 통해 통계적으로 정산되어진 표준화된 계수를 산출하였다. 그 결과 대존의 경우 경기도와 인천시는 각각 0.9753. 0.2968 정도의 서울에 대한 경제적 의존정도를 보이는 것으로 나타나 서울에 대한 경기도의 경제적 의존성이 높은 것으로 나타났으며, 산출된 정준가중계수를 살펴보면 분당구와 서울시의 중구는 서울에 대한 경기도의 경제적 의존성에 가장 높은 영향을 미친 것으로 나타났으며, 또한 중존에 해당되는 인천의 3개 권역, 경기의 16개 권역의 서울 5개 권역에 대한 경제적 의존성도 분석되었다.

Canonical correlation between body information and lipid-profile: A study on the National Health Insurance Big Data in Korea

  • Jo, Han-Gue;Kang, Young-Heung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.1
    • /
    • pp.201-208
    • /
    • 2021
  • This study aims to provide the relevant basis upon which prediction of dyslipidemia should be made based on body information. Using the National Health Insurance big data (3,312,971 people) canonical correlation analysis was performed between body information and lipid-profile. Body information included age, height, weight and waist circumference, while the lipid-profile included total cholesterol, triglycerides, HDL cholesterol and LDL cholesterol. As a result, when the waist circumference and the weight are large, triglycerides increase and HDL cholesterol level decreases. In terms of age, weight, waist circumference, and HDL cholesterol, the canonical variates (the degree of influence) were significantly different according to sex. In particular, the canonical variate was dramatically changed around the forties and fifties in women in terms of weight, waist circumference, and HDL cholesterol. The canonical correlation results of the health care big data presented in this study will help construct a predictive model that can evaluate an individual's health status based on body information that can be easily measured in a non-invasive manner.

Quantification Method of Tunnel Face Classification Using Canonical Correlation Analysis (정준상관분석을 이용한 막장등급평가 수량화기법 연구)

  • Seo Yong-Seok;Kim Chang-Yong;Kim Kwang-Yeom;Lee Hyun-Woo
    • The Journal of Engineering Geology
    • /
    • v.15 no.4 s.42
    • /
    • pp.463-473
    • /
    • 2005
  • Because of using the same rating ranges for every rock types the RMR or the Q-system could not usually consider local geological characteristics They also could not present sufficiently the engineering anisotropy of rocks. The canonical correlation analysis was carried out with 3 kinds of face mapping data obtained from granite, sedimentary rock and phyllite in order to clarify a discrepancy between rock types. According to analysis results, as a type of rocks changes, RM factors have different influences on the total rating of RMR.

Usefulness of Canonical Correlation Classification Technique in Hyper-spectral Image Classification (하이퍼스펙트럴영상 분류에서 정준상관분류기법의 유용성)

  • Park, Min-Ho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.5D
    • /
    • pp.885-894
    • /
    • 2006
  • The purpose of this study is focused on the development of the effective classification technique using ultra multiband of hyperspectral image. This study suggests the classification technique using canonical correlation analysis, one of multivariate statistical analysis in hyperspectral image classification. High accuracy of classification result is expected for this classification technique as the number of bands increase. This technique is compared with Maximum Likelihood Classification(MLC). The hyperspectral image is the EO1-hyperion image acquired on September 2, 2001, and the number of bands for the experiment were chosen at 30, considering the band scope except the thermal band of Landsat TM. We chose the comparing base map as Ground Truth Data. We evaluate the accuracy by comparing this base map with the classification result image and performing overlay analysis visually. The result showed us that in MLC's case, it can't classify except water, and in case of water, it only classifies big lakes. But Canonical Correlation Classification (CCC) classifies the golf lawn exactly, and it classifies the highway line in the urban area well. In case of water, the ponds that are in golf ground area, the ponds in university, and pools are also classified well. As a result, although the training areas are selected without any trial and error, it was possible to get the exact classification result. Also, the ability to distinguish golf lawn from other vegetations in classification classes, and the ability to classify water was better than MLC technique. Conclusively, this CCC technique for hyperspectral image will be very useful for estimating harvest and detecting surface water. In advance, it will do an important role in the construction of GIS database using the spectral high resolution image, hyperspectral data.