• Title/Summary/Keyword: Principal component analysis(PCA)

Search Result 1,231, Processing Time 0.029 seconds

Document Clustering Technique by K-means Algorithm and PCA (주성분 분석과 k 평균 알고리즘을 이용한 문서군집 방법)

  • Kim, Woosaeng;Kim, Sooyoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.18 no.3
    • /
    • pp.625-630
    • /
    • 2014
  • The amount of information is increasing rapidly with the development of the internet and the computer. Since these enormous information is managed by the document forms, it is necessary to search and process them efficiently. The document clustering technique which clusters the related documents through the similarity between the documents help to classify, search, and process the large amount of documents automatically. This paper proposes a method to find the initial seed points through principal component analysis when the documents represented by vectors in the feature vector space are clustered by K-means algorithm in order to increase clustering performance. The experiment shows that our method has a better performance than the traditional K-means algorithm.

Noise Reduction for the MEG and MCG using the PCA (주 성분 분석법을 이용한 심자도 및 유발자게 신호에서 펄스 잡음 및 뇌자도 잡음 제거)

  • Lee, D.H.;Chang, K.S.;Kim, I.G.;Chung, D.H.;Choi, J.P.;Lee, H.K.;Huh, Y.;Ahn, C.B.
    • Proceedings of the KIEE Conference
    • /
    • 2003.07d
    • /
    • pp.2786-2788
    • /
    • 2003
  • 본 논문에서는 생체자기신호의 잡음제거 기법 중 PCA(Principal Component Analysis) 알고리즘을 사용하여 효과적으로 노이즈를 제거하기 위한 방법을 제안하였다. 61 채널 SQUID 시스템을 이용하여 심자도 신호를 측정하였고, 40 채널 SQUID 시스템을 이용하여 뇌자도 신호를 측정하였다. 그리고, 측정한 신호 성분들을 제안한 방법을 이용하여 주성분들을 분리하였고, 이들 중에서 노이즈 성분을 추정하여 측정한 신호에서 제거하였다. 이러한 방법을 이용한 결과, 심자도 신호에 존재하는 펄스 노이즈로 인하여 왜곡된 생체 자기 신호의 노이즈를 감소 시킬 수 있었으며, 뇌자도 신호에 존재하는 외부 노이즈 성분을 제거하여 임상 진단에 유용한 데이터를 얻을 수 있었다.

  • PDF

Re-classifying Method for Face Recognition (얼굴 인식 성능 향상을 위한 재분류 방법)

  • Bae Kyoung-Yul
    • Journal of Intelligence and Information Systems
    • /
    • v.10 no.3
    • /
    • pp.105-114
    • /
    • 2004
  • In the past year, the increasing concern about the biometric recognition makes the great activities on the security fields, such as the entrance control or user authentication. In particular, although the features of face recognition, such as user friendly and non-contact made it to be used widely, unhappily it has some disadvantages of low accuracy or low Re-attempts Rates. For this reason, I suggest the new approach to re-classify the classified data of recognition result data to solve the problems. For this study, I will use the typical appearance-based, PCA(Principal Component Analysis) algorithm and verify the performance improvement by adopting the re-classification approach using 200 peoples (10 pictures per one person).

  • PDF

Effects of Continuous Application of Green Manures on Microbial Community in Paddy Soil

  • Kim, Sook-Jin;Kim, Kwang Seop;Choi, Jong-Seo;Kim, Min-Tae;Lee, Yong Bok;Park, Ki-Do;Hur, Seonggi
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.48 no.5
    • /
    • pp.528-534
    • /
    • 2015
  • Green manure crops have been well recognized as the alternative for chemical fertilizer, especially N fertilizer, because of its positive effect on soil and the environment. Hairy vetch and green barley are the most popular crops for cultivation of rice in paddy field. This study was conducted to evaluate effects of hairy vetch and green barley on soil microbial community and chemical properties during short-term application (three years). For this study, treatments were composed of hairy vetch (Hv), green barley (Gb), hairy vetch + green barley (Hv+Gb), and chemical fertilizer without green manure crops (Con.). Hv+Gb treatment showed the highest microbial biomass among treatments. Principal component analysis (PCA) showed that PC1 (73.0 %) was affected by microbial biomass and PC2 (21.5 %) was affected by fungi, cy19:0/18:$1{\omega}7c$ (stress indicator). Combined treatment with hairy vetch and green barley could be more efficient than green manure crop treatment as well as chemical fertilizer treatment for improvement of soil microorganisms.

Antioxidant Activities and Quality Characteristics of Rice Cookie with Added Butterbur (Petasites japonicus) Powder (머위 분말 첨가 쌀쿠키의 항산화 활성 및 품질 특성)

  • Choi, Hee Won;Sim, Ki Hyeon
    • The Korean Journal of Food And Nutrition
    • /
    • v.34 no.1
    • /
    • pp.1-14
    • /
    • 2021
  • This study evaluated the antioxidant activity and quality characteristics of rice cookie with added butterbur powder in a ratio of 0, 5, 10, 15, and 20% to confirm the possibility of butterbur as a functional food. The moisture content, spread factor, leavening rate, and hardness of rice cookies increased with an increase in the amount of butterbur powder; whereas a decrease in the pH and baking loss rate was observed. The L and b values decreased as the amount of butterbur powder increased, but the value was the lowest when 5% of butterbur powder was added. The sensory liking score showed the highest preference for 10% butterbur powder regarding appearance, flavor, taste, texture, and overall preference. In the principal component analysis (PCA), the addition of 10% butterbur powder positively affected the measure of food acceptance in terms of organoleptic properties of butterbur. Besides, as the amount of added butterbur powder increased, the antioxidant activity of rice cookies increased. Based on these results, it appears that the addition of butterbur powder to rice cookies in a 10% ratio can produce rice cookies with excellent antioxidant activity, overall quality, and high preference.

Histogram Equalized Eigen Co-occurrence Features for Color Image Classification (컬러이미지 검색을 위한 히스토그램 평활화 기반 고유 병발 특징에 관한 연구)

  • Yoon, TaeBok;Choi, YoungMee;Choo, MoonWon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.705-708
    • /
    • 2010
  • An eigen color co-occurrence approach is proposed that exploits the correlation between color channels to identify the degree of image similarity. This method is based on traditional co-occurrence matrix method and histogram equalization. On the purpose of feature extraction, eigen color co-occurrence matrices are computed for extracting the statistical relationships embedded in color images by applying Principal Component Analysis (PCA) on a set of color co-occurrence matrices, which are computed on the histogram equalized images. That eigen space is created with a set of orthogonal axes to gain the essential structures of color co-occurrence matrices, which is used to identify the degree of similarity to classify an input image to be tested for various purposes. In this paper RGB, Gaussian color space are compared with grayscale image in terms of PCA eigen features embedded in histogram equalized co-occurrence features. The experimental results are presented.

Development of an SNP set for marker-assisted breeding based on the genotyping-by-sequencing of elite inbred lines in watermelon (수박 엘리트 계통의 GBS를 통한 마커이용 육종용 SNP 마커 개발)

  • Lee, Junewoo;Son, Beunggu;Choi, Youngwhan;Kang, Jumsoon;Lee, Youngjae;Je, Byoung Il;Park, Younghoon
    • Journal of Plant Biotechnology
    • /
    • v.45 no.3
    • /
    • pp.242-249
    • /
    • 2018
  • This study was conducted to develop an SNP set that can be useful for marker-assisted breeding (MAB) in watermelon (Citrullus. lanatus L) using Genotyping-by-sequencing (GBS) analysis of 20 commercial elite watermelon inbreds. The result of GBS showed that 77% of approximately 1.1 billion raw reads were mapped on the watermelon genome with an average mapping region of about 4,000 Kb, which indicated genome coverage of 2.3%. After the filtering process, a total of 2,670 SNPs with an average depth of 31.57 and the PIC (Polymorphic Information Content) value of 0.1~0.38 for 20 elite inbreds were obtained. Among those SNPs, 55 SNPs (5 SNPs per chromosome that are equally distributed on each chromosome) were selected. For the understanding genetic relationship of 20 elite inbreds, PCA (Principal Component Analysis) was carried out with 55 SNPs, which resulted in the classification of inbreds into 4 groups based on PC1 (52%) and PC2 (11%), thus causing differentiation between the inbreds. A similar classification pattern for PCA was observed from hierarchical clustering analysis. The SNP set developed in this study has the potential for application to cultivar identification, F1 seed purity test, and marker-assisted backcross (MABC) not only for 20 elite inbreds but also for diverse resources for watermelon breeding.

Pattern Recognition of the Herbal Drug, Magnoliae Flos According to their Essential Oil Components

  • Jeong, Eun-Sook;Choi, Kyu-Yeol;Kim, Sun-Chun;Son, In-Seop;Cho, Hwang-Eui;Ahn, Su-Youn;Woo, Mi-Hee;Hong, Jin-Tae;Moon, Dong-Cheul
    • Bulletin of the Korean Chemical Society
    • /
    • v.30 no.5
    • /
    • pp.1121-1126
    • /
    • 2009
  • This paper describes a pattern recognition method of Magnoliae flos based on a gas chromatographic/mass spectrometric (GC/MS) analysis of the essential oil components. The botanical drug is mainly comprised of the four magnolia species (M. denudata, M. biondii, M. kobus, and M. liliflora) in Korea, although some other species are also being dealt with the drug. The GC/MS separation of the volatile components, which was extracted by the simultaneous distillation and extraction (SDE), was performed on a carbowax column (supelcowax 10; 30 m{\time}0.25 mm{\time}0.25{\mu}m$) using temperature programming. Variance in the retention times for all peaks of interests was within RSD 2% for repeated analyses (n = 9). Of the 74 essential oil components identified from the magnolia species, approximately 10 major components, which is $\alpha$-pinene, $\beta$-pinene, sabinene, myrcene, d-limonene, eucarlyptol (1,8-cineol), $\gamma$-terpinene, p-cymene, linalool, $\alpha$-terpineol, were commonly present in the four species. For statistical analysis, the original dataset was reduced to the 13 variables by Fisher criterion and factor analysis (FA). The essential oil patterns were processed by means of the multivariate statistical analysis including hierarchical cluster analysis (HCA), principal component analysis (PCA) and discriminant analysis (DA). All samples were divided into four groups with three principal components by PCA and according to the plant origins by HCA. Thirty-three samples (23 training sets and 10 test samples to be assessed) were correctly classified into the four groups predicted by PCA. This method would provide a practical strategy for assessing the authenticity or quality of the well-known herbal drug, Magnoliae flos.

Detection of Forest Fire Damage from Sentinel-1 SAR Data through the Synergistic Use of Principal Component Analysis and K-means Clustering (Sentinel-1 SAR 영상을 이용한 주성분분석 및 K-means Clustering 기반 산불 탐지)

  • Lee, Jaese;Kim, Woohyeok;Im, Jungho;Kwon, Chunguen;Kim, Sungyong
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.5_3
    • /
    • pp.1373-1387
    • /
    • 2021
  • Forest fire poses a significant threat to the environment and society, affecting carbon cycle and surface energy balance, and resulting in socioeconomic losses. Widely used multi-spectral satellite image-based approaches for burned area detection have a problem in that they do not work under cloudy conditions. Therefore, in this study, Sentinel-1 Synthetic Aperture Radar (SAR) data from Europe Space Agency, which can be collected in all weather conditions, were used to identify forest fire damaged area based on a series of processes including Principal Component Analysis (PCA) and K-means clustering. Four forest fire cases, which occurred in Gangneung·Donghae and Goseong·Sokcho in Gangwon-do of South Korea and two areas in North Korea on April 4, 2019, were examined. The estimated burned areas were evaluated using fire reference data provided by the National Institute of Forest Science (NIFOS) for two forest fire cases in South Korea, and differenced normalized burn ratio (dNBR) for all four cases. The average accuracy using the NIFOS reference data was 86% for the Gangneung·Donghae and Goseong·Sokcho fires. Evaluation using dNBR showed an average accuracy of 84% for all four forest fire cases. It was also confirmed that the stronger the burned intensity, the higher detection the accuracy, and vice versa. Given the advantage of SAR remote sensing, the proposed statistical processing and K-means clustering-based approach can be used to quickly identify forest fire damaged area across the Korean Peninsula, where a cloud cover rate is high and small-scale forest fires frequently occur.

Straight Line Detection Using PCA and Hough Transform (주성분 분석과 허프 변환을 이용한 직선 검출)

  • Oh, Jeong-su
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.2
    • /
    • pp.227-232
    • /
    • 2018
  • In a Hough transform that is a representative algorithm for the straight line detection, a great number of edge pixels generated from noisy or complex images cause enormous amount of computation and pseudo straight lines. This paper proposes a two step straight line detection algorithm to improve the conventional Hough transform. In the first step, the proposed algorithm divides an image into non-overlapping blocks and detects the information related to the straight line of the edge pixels in the block using a principal component analysis (PCA). In the second step, it detects the straight lines by performing the Hough transform limited slope area to the pixels associated with the straight line. Simulation results show that the proposed algorithm reduces average of ${\rho}$ computation by 94.6% and prevents the pseudo straight lines although some additional computation is needed.