• Title/Summary/Keyword: PCA(Principal Component Analysis

Search Result 1,243, Processing Time 0.028 seconds

The Effect of Meteorological Factors on PM10 Depletion in the Atmosphere and Evaluation of Rainwater Quality (기상인자에 따른 대기 중 미세먼지 감소 및 빗물 특성 연구)

  • Park, Hyemin;Kim, Taeyong;Yang, Minjune
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.6_3
    • /
    • pp.1733-1741
    • /
    • 2020
  • This study analyzed the effect of meteorological factors on the concentration of PM10 (particulate matter 10) in the atmosphere and the variation of rainwater quality using multivariate statistical analysis. The concentration of PM10 in the atmosphere was continuously measured during eleven precipitation events with a custom-built PM sensor node. A total of 183 rainwater samples were analyzed for pH, EC (electrical conductivity), and water-soluble cations (Na+, Mg2+, K+, Ca2+, NH4+) and anions (Cl-, NO3-, SO42-). The data has been analyzed using two multivariate statistical techniques (principal component analysis, PCA, and Pearson correlation analysis) to identify relationships among PM10 concentrations in the atmosphere, meteorological factors, and rainwater quality factors. When the rainfall intensity was relatively strong (> 5 mm/h, rainfall type 1), the PM10 concentration in the atmosphere showed a negative correlation (r = -0.55, p < 0.05) with cumulative rainfall. The PM10 concentration increased the concentration of water-soluble ions (r = 0.25) and EC (r = 0.4), and decreased the pH (r = -0.7) of rainwater samples. However, for rainfall type 2 (< 5 mm/h), there was no negative correlation between the PM10 concentration in the atmosphere and cumulative rainfall and no statistically significant correlation between the PM10 concentration in the atmosphere and rainwater quality.

Optimization of Extraction of Cycloalliin from Garlic (Allium sativum L.) by Using Principal Components Analysis

  • Lee, Hyun Jung;Suh, Hyung Joo;Han, Sung Hee;Hong, Jungil;Choi, Hyeon-Son
    • Preventive Nutrition and Food Science
    • /
    • v.21 no.2
    • /
    • pp.138-146
    • /
    • 2016
  • In this study, we report the optimal extraction conditions for obtaining organosulfur compounds, such as cycloalliin, from garlic by using principal component analysis (PCA). Extraction variables including temperature ($40{\sim}80^{\circ}C$), time (0.5~12 h), and pH (4~12) were investigated for the highest cycloalliin yields. The cycloalliin yield (5.5 mmol/mL) at pH 10 was enhanced by ~40% relative to those (~3.9 mmol/mL) at pH 4 and pH 6. The cycloalliin level at $80^{\circ}C$ showed the highest yield among the tested temperatures (5.05 mmol/mL). Prolonged extraction times also increased cycloalliin yield; the yield after 12 h was enhanced ~2-fold (4 mmol/mL) compared to the control. Isoalliin and cycloalliin levels were inversely correlated, whereas a direct correlation between polyphenol and cycloalliin levels was observed. In storage for 30 days, garlic stored at $60^{\circ}C$ (11 mmol/mL) showed higher levels of cycloalliin and polyphenols than those at $40^{\circ}C$, with the maximum cycloalliin level (13 mmol/mL) on day 15. Based on the PCA analysis, the isoalliin level depended on the extraction time, while cycloalliin amounts were influenced not only by extraction time, but also by pH and temperature. Taken together, extraction of garlic at $80^{\circ}C$, with an incubation time of 12 h, at pH 10 afforded the maximum yield of cycloalliin.

An Arabic Script Recognition System

  • Alginahi, Yasser M.;Mudassar, Mohammed;Nomani Kabir, Muhammad
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.9
    • /
    • pp.3701-3720
    • /
    • 2015
  • A system for the recognition of machine printed Arabic script is proposed. The Arabic script is shared by three languages i.e., Arabic, Urdu and Farsi. The three languages have a descent amount of vocabulary in common, thus compounding the problems for identification. Therefore, in an ideal scenario not only the script has to be differentiated from other scripts but also the language of the script has to be recognized. The recognition process involves the segregation of Arabic scripted documents from Latin, Han and other scripted documents using horizontal and vertical projection profiles, and the identification of the language. Identification mainly involves extracting connected components, which are subjected to Principle Component Analysis (PCA) transformation for extracting uncorrelated features. Later the traditional K-Nearest Neighbours (KNN) algorithm is used for recognition. Experiments were carried out by varying the number of principal components and connected components to be extracted per document to find a combination of both that would give the optimal accuracy. An accuracy of 100% is achieved for connected components >=18 and Principal components equals to 15. This proposed system would play a vital role in automatic archiving of multilingual documents and the selection of the appropriate Arabic script in multi lingual Optical Character Recognition (OCR) systems.

EXTRACTION OF WATERMARKS BASED ON INDEPENDENT COMPONENT ANALYSIS

  • Thai, Hien-Duy;Zensho Nakao;Yen- Wei Chen
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.407-410
    • /
    • 2003
  • We propose a new logo watermark scheme for digital images which embed a watermark by modifying middle-frequency sub-bands of wavelet transform. Independent component analysis (ICA) is introduced to authenticate and copyright protect multimedia products by extracting the watermark. To exploit the Human visual system (HVS) and the robustness, a perceptual model is applied with a stochastic approach based on noise visibility function (NVF) for adaptive watermarking algorithm. Experimental results demonstrated that the watermark is perfectly extracted by ICA technique with excellent invisibility, robust against various image and digital processing operators, and almost all compression algorithms such as Jpeg, jpeg 2000, SPIHT, EZW, and principal components analysis (PCA) based compression.

  • PDF

Dimensionality reduction for pattern recognition based on difference of distribution among classes

  • Nishimura, Masaomi;Hiraoka, Kazuyuki;Mishima, Taketoshi
    • Proceedings of the IEEK Conference
    • /
    • 2002.07c
    • /
    • pp.1670-1673
    • /
    • 2002
  • For pattern recognition on high-dimensional data, such as images, the dimensionality reduction as a preprocessing is effective. By dimensionality reduction, we can (1) reduce storage capacity or amount of calculation, and (2) avoid "the curse of dimensionality" and improve classification performance. Popular tools for dimensionality reduction are Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and Independent Component Analysis (ICA) recently. Among them, only LDA takes the class labels into consideration. Nevertheless, it, has been reported that, the classification performance with ICA is better than that with LDA because LDA has restriction on the number of dimensions after reduction. To overcome this dilemma, we propose a new dimensionality reduction technique based on an information theoretic measure for difference of distribution. It takes the class labels into consideration and still it does not, have restriction on number of dimensions after reduction. Improvement of classification performance has been confirmed experimentally.

  • PDF

Remote Face Recognition System through Internet (인터넷을 통한 원격 얼굴인식 시스템)

  • Song, Jee-Hwan;Park, Jong-Jin;Bae, Kyoung-Yul
    • Annual Conference of KIPS
    • /
    • 2003.11c
    • /
    • pp.2005-2008
    • /
    • 2003
  • 본 논문에서는 생체의 특징을 이용해 신분을 증명 또는 인증하는 생체인식 기술 중 지문이나 장문, 정맥, 홍채를 이용한 인식과 같이 장비에 접촉해야만 인증이 이루어지는 것과 달리 거부감이 없고, 별도의 전문 장비를 필요로 하지 않아 일반 대중들에 쉽게 접근할 수 있는 얼굴인식을 인터넷에 적용한 원격 신분증명 및 인증 시스템을 제안한다. 얼굴인식 알고리즘은 얼굴 특징을 분석하는 방식에 따라 PCA (Principal Component Analysis), ICA (Independent Component Analysis), FDA (Fisher Discriminant Analysis) 등이 발표되어 있다. 이들 알고리즘을 이용해 얼굴 특징을 분석한 결과를 원격지에 신속하고 정확하게 송수신할 수 있는 시스템이 요구됨에 따라 생체인식 시스템의 비교 평가와 함께 인터넷 상에서 얼굴인식을 이용한 원격 얼굴인식 시스템의 구성을 제안한다.

  • PDF

Design and Implementation of a Real-Time Lipreading System Using PCA & HMM (PCA와 HMM을 이용한 실시간 립리딩 시스템의 설계 및 구현)

  • Lee chi-geun;Lee eun-suk;Jung sung-tae;Lee sang-seol
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.11
    • /
    • pp.1597-1609
    • /
    • 2004
  • A lot of lipreading system has been proposed to compensate the rate of speech recognition dropped in a noisy environment. Previous lipreading systems work on some specific conditions such as artificial lighting and predefined background color. In this paper, we propose a real-time lipreading system which allows the motion of a speaker and relaxes the restriction on the condition for color and lighting. The proposed system extracts face and lip region from input video sequence captured with a common PC camera and essential visual information in real-time. It recognizes utterance words by using the visual information in real-time. It uses the hue histogram model to extract face and lip region. It uses mean shift algorithm to track the face of a moving speaker. It uses PCA(Principal Component Analysis) to extract the visual information for learning and testing. Also, it uses HMM(Hidden Markov Model) as a recognition algorithm. The experimental results show that our system could get the recognition rate of 90% in case of speaker dependent lipreading and increase the rate of speech recognition up to 40~85% according to the noise level when it is combined with audio speech recognition.

  • PDF

Network-based regularization for analysis of high-dimensional genomic data with group structure (그룹 구조를 갖는 고차원 유전체 자료 분석을 위한 네트워크 기반의 규제화 방법)

  • Kim, Kipoong;Choi, Jiyun;Sun, Hokeun
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.6
    • /
    • pp.1117-1128
    • /
    • 2016
  • In genetic association studies with high-dimensional genomic data, regularization procedures based on penalized likelihood are often applied to identify genes or genetic regions associated with diseases or traits. A network-based regularization procedure can utilize biological network information (such as genetic pathways and signaling pathways in genetic association studies) with an outstanding selection performance over other regularization procedures such as lasso and elastic-net. However, network-based regularization has a limitation because cannot be applied to high-dimension genomic data with a group structure. In this article, we propose to combine data dimension reduction techniques such as principal component analysis and a partial least square into network-based regularization for the analysis of high-dimensional genomic data with a group structure. The selection performance of the proposed method was evaluated by extensive simulation studies. The proposed method was also applied to real DNA methylation data generated from Illumina Innium HumanMethylation27K BeadChip, where methylation beta values of around 20,000 CpG sites over 12,770 genes were compared between 123 ovarian cancer patients and 152 healthy controls. This analysis was also able to indicate a few cancer-related genes.

Time Series Data Analysis and Prediction System Using PCA (주성분 분석 기법을 활용한 시계열 데이터 분석 및 예측 시스템)

  • Jin, Young-Hoon;Ji, Se-Hyun;Han, Kun-Hee
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.11
    • /
    • pp.99-107
    • /
    • 2021
  • We live in a myriad of data. Various data are created in all situations in which we work, and we discover the meaning of data through big data technology. Many efforts are underway to find meaningful data. This paper introduces an analysis technique that enables humans to make better choices through the trend and prediction of time series data as a principal component analysis technique. Principal component analysis constructs covariance through the input data and presents eigenvectors and eigenvalues that can infer the direction of the data. The proposed method computes a reference axis in a time series data set having a similar directionality. It predicts the directionality of data in the next section through the angle between the directionality of each time series data constituting the data set and the reference axis. In this paper, we compare and verify the accuracy of the proposed algorithm with LSTM (Long Short-Term Memory) through cryptocurrency trends. As a result of comparative verification, the proposed method recorded relatively few transactions and high returns(112%) compared to LSTM in data with high volatility. It can mean that the signal was analyzed and predicted relatively accurately, and it is expected that better results can be derived through a more accurate threshold setting.

Differentiation between Normal and White Striped Turkey Breasts by Visible/Near Infrared Spectroscopy and Multivariate Data Analysis

  • Zaid, Amal;Abu-Khalaf, Nawaf;Mudalal, Samer;Petracci, Massimiliano
    • Food Science of Animal Resources
    • /
    • v.40 no.1
    • /
    • pp.96-105
    • /
    • 2020
  • The appearance of white striations over breast meat is an emerging and growing problem. The main purpose of this study was to employ the reflectance of visible-near infrared (VIS/NIR) spectroscopy to differentiate between normal and white striped turkey breasts. Accordingly, 34 turkey breast fillets were selected representing a different level of white striping (WS) defects (normal, moderate and severe). The findings of VIS/NIR were analyzed by principal component (PC1) analysis (PCA). It was found that the first PC1 for VIS, NIR and VIS/NIR region explained 98%, 97%, and 96% of the total variation, respectively. PCA showed high performance to differentiate normal meat from abnormal meat (moderate and severe WS). In conclusion, the results of this research showed that VIS/NIR spectroscopy was satisfactory to differentiate normal from severe WS turkey fillets by using several quality traits.