• Title/Summary/Keyword: 주성분분석

Search Result 1,982, Processing Time 0.031 seconds

Molecular Profiling of Clinical Features in Breast Cancer Using Principal Component Analysis (주성분 분석 방법을 이용한 유방암의 임상적 특징과 관련된 유전자 분석)

  • Han, Mi-Ryung;Lee, Seok-Ho;Han, Won-Shik;Kim, Mi-Hyeon;Noh, Dong-Young;Kim, Ju-Han
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2004.11a
    • /
    • pp.29-35
    • /
    • 2004
  • 유방암 환자의 임상정보(clinical features)와 cDNA microarray 기술을 이용하여 얻은 유전자 발현 프로파일은 유방암 예후 인자를 찾는 데에 매우 중요하다. 본 논문에서는 임상정보와 유전자 발현 정보를 접목해서 분석하는 방법으로써 주성분 분석(Principal Component Analysis)을 이용하였다. 이 방법은 다변량 자료의 차원을 줄이는 방법으로써, 대용량 실험 데이터로 인해 발생하는 문제점을 해결하기 위하여 많이 쓰이고 있다. 본 연구에서는 주성분 분석을 이용하여 먼저 한국인 유방암 환자 73명의 cDNA microarray 데이터 차원을 줄이고, 이를 통해 얻어진 주성분(Principal Components)과 임상정보 데이터와의 상관관계를 보았다. One-way ANOVA를 이용한 상관관계 분석 결과의 P-value는 permutation test를 통해 검증하였다. 동일한 방법을 estrogen receptor(ER)(+) 환자 20명과 ER(-) 환자 31명에 적용해본 결과, ER(-) 환자 중에서 재발과 관련된 유전자를 찾을 수 있었다. 주성분 분석을 molecular phenotypic profiles of clinical features에 이용한 결과 발견된 유전자는 유방암의 재발과 관련된 예후 인자로서 의미가 있다.

  • PDF

Application of Numerical Methods in the Zonation and Correlation of Four Late Quaternary Pollen Data from lows (수치분석의 도식화를 통한 제사기 화분자료의 분대 및 대비)

  • Hyung Keun Kim
    • The Korean Journal of Quaternary Research
    • /
    • v.3 no.1
    • /
    • pp.55-68
    • /
    • 1989
  • This paper presents examples of the computer-aided zonation and correlation of pollen data from the Late-glacial to Holocene stratigraphic sequences at four sites in central Iowa, U.S.A. Spearman's rank correlation coefficient matrix and first four components of Principal components analysis plotted in a stratigraphic order are combined to provide an excellent zonation of the pollen data at each site. Correlation of the four pollen sequences are conducted by Principal components analysis of the data sets combined in one. The first and second principal components successfully provide correlation lines that match fairly closely the zone boundaries of each pollen sequence. The third and fourth components, in contrast, are greatly different from site to site, representing the unique pollen assemblages at each site.

  • PDF

A Multi-Resolution Distance Measure for Two Dimensional Images Using Principal Component Analysis and Independent Component Analysis (주성분분석 및 독립성분분석을 이용한 이차원 영상에서의 다중해상도 거리 측정)

  • 홍준식
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.04a
    • /
    • pp.247-249
    • /
    • 2002
  • 본 논문에서는 주성분 분석(principal component analysis; 이하 PCA) 및 독립성분분석(independent component analysis; 이하 ICA)을 이용, 이차원 영상을 분류하여 다중해상도에서 영상간의 거리를 측정하여 PCA 와 ICA 중에서 어느 것이 영상간의 상대적 식별을 용이하게 하는지 모의 실험을 통하여 확인하고자 한다. 모의 실험 결과로부터, ICA가 PCA에 비하여 영상간의 상대적 식별이 용이하여 빨리 수렴이 되는 것을 모의 실험을 통하여 확인하였다.

  • PDF

Principal Component Analysis on Marine Casualties Occurred at Korean Littoral Sea in Recent 5 Years (최근 5년간 국내 연근해에서 발생한 해양사고에 대한 주성분분석)

  • KIM, Yeong-Sik
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.28 no.2
    • /
    • pp.465-472
    • /
    • 2016
  • Principal Component Analysis (PCA) is useful statistical technique for finding patterns in data, and expressing the data in such a way as to highlight their similarities and differences. In this paper, 1417 marine casualties occurred in Korean littoral sea in recent 5 years, were examined by the PCA. The main results obtained were as follows : 1. Most of marine casualties resulted from the human factors such as careless operation and insufficient engine maintenance. 2. Collision and standing mainly resulted from steering room-related human factors such as careless guard, inadequate ship-handling, however engine damage and fire explosion mainly resulted from engine room-related human factor such as bad handling of engine system. 3. No. 1 principal component represents accident frequency, No. 2 principal component represents the cause and No. 3 principal component represents the pattern of marine casualties, respectively.

Development of Monitoring System for the LNG plant fractionation process based on Multi-mode Principal Component Analysis (다중모드 주성분분석에 기반한 천연가스 액화플랜트의 성분 분리공정 감시 시스템 개발)

  • Pyun, Hahyung;Lee, Chul-Jin;Lee, Won Bo
    • Journal of the Korean Institute of Gas
    • /
    • v.23 no.4
    • /
    • pp.19-27
    • /
    • 2019
  • The consumption of liquefied natural gas (LNG) has increased annually due to the strengthening of international environmental regulations. In order to produce stable and efficient LNG, it is essential to divide the global (overall) operating condition and construct a quick and accurate monitoring system for each operation condition. In this study, multi-mode monitoring system is proposed to the LNG plant fractionation process. First, global normal operation data is divided to local (subdivide) normal operation data using global principal component analysis (PCA) and k-means clustering method. And then, the data to be analyzed were matched with the local normal mode. Finally, it is determined the state of process abnormality through the local PCA. The proposed method is applied to 45 fault case and it proved to be more than 5~10% efficient compared to the global PCA and univariate monitoring.

Detecting Influential Observations in Multivariate Statistical Analysis of Incomplete Data by PCA (주성분분석에 의한 결손 자료의 영향값 검출에 대한 연구)

  • 김현정;문승호;신재경
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.383-392
    • /
    • 2000
  • Since late 1970, methods of influence or sensitivity analysis for detecting influential observations have been studied not only in regression and related methods but also in various multivariate methods. If results of multivariate analyses sometimes depend heavily on a small number of observations, we should be very careful to draw a conclusion. Similar phenomena may also occur in the case of incomplete data. In this research we try to study such influential observations in multivariate statistical analysis of incomplete data. Case of principal component analysis is studied with a numerical example.

  • PDF

Text Summarization using PCA and SVD (주성분 분석과 비정칙치 분해를 이용한 문서 요약)

  • Lee, Chang-Beom;Kim, Min-Soo;Baek, Jang-Sun;Park, Hyuk-Ro
    • The KIPS Transactions:PartB
    • /
    • v.10B no.7
    • /
    • pp.725-734
    • /
    • 2003
  • In this paper, we propose the text summarization method using PCA (Principal Component Analysis) and SVD (Singular Value Decomposition). The proposed method presents a summary by extracting significant sentences based on the distances between thematic words and sentences. To extract thematic words, we use both word frequency and co-occurence information that result from performing PCA. To extract significant sentences, we exploit Euclidean distances between thematic word vectors and sentence vectors that result from carrying out SVD. Experimental results using newspaper articles show that the proposed method is superior to the method using either word frequency or only PCA.

PCA­based Waveform Classification of Rabbit Retinal Ganglion Cell Activity (주성분분석을 이용한 토끼 망막 신경절세포의 활동전위 파형 분류)

  • 진계환;조현숙;이태수;구용숙
    • Progress in Medical Physics
    • /
    • v.14 no.4
    • /
    • pp.211-217
    • /
    • 2003
  • The Principal component analysis (PCA) is a well-known data analysis method that is useful in linear feature extraction and data compression. The PCA is a linear transformation that applies an orthogonal rotation to the original data, so as to maximize the retained variance. PCA is a classical technique for obtaining an optimal overall mapping of linearly dependent patterns of correlation between variables (e.g. neurons). PCA provides, in the mean-squared error sense, an optimal linear mapping of the signals which are spread across a group of variables. These signals are concentrated into the first few components, while the noise, i.e. variance which is uncorrelated across variables, is sequestered in the remaining components. PCA has been used extensively to resolve temporal patterns in neurophysiological recordings. Because the retinal signal is stochastic process, PCA can be used to identify the retinal spikes. With excised rabbit eye, retina was isolated. A piece of retina was attached with the ganglion cell side to the surface of the microelectrode array (MEA). The MEA consisted of glass plate with 60 substrate integrated and insulated golden connection lanes terminating in an 8${\times}$8 array (spacing 200 $\mu$m, electrode diameter 30 $\mu$m) in the center of the plate. The MEA 60 system was used for the recording of retinal ganglion cell activity. The action potentials of each channel were sorted by off­line analysis tool. Spikes were detected with a threshold criterion and sorted according to their principal component composition. The first (PC1) and second principal component values (PC2) were calculated using all the waveforms of the each channel and all n time points in the waveform, where several clusters could be separated clearly in two dimension. We verified that PCA-based waveform detection was effective as an initial approach for spike sorting method.

  • PDF

FPCA for volatility from high-frequency time series via R-function (FPCA를 통한 고빈도 시계열 변동성 분석: R함수 소개와 응용)

  • Yoon, Jae Eun;Kim, Jong-Min;Hwang, Sun Young
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.6
    • /
    • pp.805-812
    • /
    • 2020
  • High-frequency data are now prevalent in financial time series. As a functional data arising from high-frequency financial time series, we are concerned with the intraday volatility to which functional principal component analysis (FPCA) is applied in order to achieve a dimension reduction. A review on FPCA and R function is made and high-frequency KOSPI volatility is analysed as an application.

The Provenance and Characteristic Classification of the White Porcelain in the Gyeongsangnam-do by Neutron Activation Analysis (중성자방사화분석을 활용한 경상남도 백자의 산지 및 특성 분류)

  • Kim, Na-Young;Kim, Gyu-Ho
    • Journal of Conservation Science
    • /
    • v.21
    • /
    • pp.89-100
    • /
    • 2007
  • This study analyze concentration of minor and trace elements on 47 white porcelains excavated from Dudong-ri, Baekryeon-ri, Sachon-ri kilns in Gyeonsangnam-do by NAA(neutron activation analysis) and try to classify the provenance and characteristics according to the analytical result. Each kilns are divided into the group by PCA(principal component analysis) and LDA(linear discrimination analysis) using 17 elements; Ba Ce, Co, Cr, Cs, Dy, Eu, Hf, La Lu, Rb, Sc, Sm, Ta, Th, V, Yb. The contribution elements are Dy, Sm, La, Ce, Lu, Sc. And soft and hard white porcelains are similar with the chemical composition of the use materials therefore the difference of the chemical composition not confirmed a cause. The analytical results of the fine(I) and poor(II) quality white porcelains presume the difference of the povenance of clay materials or the poduction process such as difference purify and additive materials.

  • PDF