• Title/Summary/Keyword: PCA-LDA

Search Result 142, Processing Time 0.028 seconds

Pattern Recognition for Typification of Whiskies and Brandies in the Volatile Components using Gas Chromatographic Data

  • Myoung, Sungmin;Oh, Chang-Hwan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.5
    • /
    • pp.167-175
    • /
    • 2016
  • The volatile component analysis of 82 commercialized liquors(44 samples of single malt whisky, 20 samples of blended whisky and 18 samples of brandy) was carried out by gas chromatography after liquid-liquid extraction with dichloromethane. Pattern recognition techniques such as principle component analysis(PCA), cluster analysis(CA), linear discriminant analysis(LDA) and partial least square discriminant analysis(PLSDA) were applied for the discrimination of different liquor categories. Classification rules were validated by considering sensitivity and specificity of each class. Both techniques, LDA and PLSDA, gave 100% sensitivity and specificity for all of the categories. These results suggested that the common characteristics and identities as typification of whiskies and brandys was founded by using multivariate data analysis method.

Development of Learning Algorithm using Brain Modeling of Hippocampus for Face Recognition (얼굴인식을 위한 해마의 뇌모델링 학습 알고리즘 개발)

  • Oh, Sun-Moon;Kang, Dae-Seong
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.5 s.305
    • /
    • pp.55-62
    • /
    • 2005
  • In this paper, we propose the face recognition system using HNMA(Hippocampal Neuron Modeling Algorithm) which can remodel the cerebral cortex and hippocampal neuron as a principle of a man's brain in engineering, then it can learn the feature-vector of the face images very fast and construct the optimized feature each image. The system is composed of two parts. One is feature-extraction and the other is teaming and recognition. In the feature extraction part, it can construct good-classified features applying PCA(Principal Component Analysis) and LDA(Linear Discriminants Analysis) in order. In the learning part, it cm table the features of the image data which are inputted according to the order of hippocampal neuron structure to reaction-pattern according to the adjustment of a good impression in the dentate gyrus region and remove the noise through the associate memory in the CA3 region. In the CA1 region receiving the information of the CA3, it can make long-term memory learned by neuron. Experiments confirm the each recognition rate, that are face changes, pose changes and low quality image. The experimental results show that we can compare a feature extraction and learning method proposed in this paper of any other methods, and we can confirm that the proposed method is superior to existing methods.

A Study on Clutter Rejection using PCA and Stochastic features of Edge Image (주성분 분석법 및 외곽선 영상의 통계적 특성을 이용한 클러터 제거기법 연구)

  • Kang, Suk-Jong;Kim, Do-Jong;Bae, Hyeon-Deok
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.47 no.6
    • /
    • pp.12-18
    • /
    • 2010
  • Automatic Target Detection (ATD) systems that use forward-looking infrared (FLIR) consists of three stages. preprocessing, detection, and clutter rejection. All potential targets are extracted in preprocessing and detection stages. But, this results in a high false alarm rates. To reduce false alarm rates of ATD system, true targets are extracted in the clutter rejection stage. This paper focuses on clutter rejection stage. This paper presents a new clutter rejection technique using PCA features and stochastic features of clutters and targets. PCA features are obtained from Euclidian distances using which potential targets are projected to reduced eigenspace selected from target eigenvectors. CV is used for calculating stochastic features of edges in targets and clutters images. To distinguish between target and clutter, LDA (Linear Discriminant Analysis) is applied. The experimental results show that the proposed algorithm accurately classify clutters with a low false rate compared to PCA method or CV method

A study on the classification of research topics based on COVID-19 academic research using Topic modeling (토픽모델링을 활용한 COVID-19 학술 연구 기반 연구 주제 분류에 관한 연구)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.155-174
    • /
    • 2022
  • From January 2020 to October 2021, more than 500,000 academic studies related to COVID-19 (Coronavirus-2, a fatal respiratory syndrome) have been published. The rapid increase in the number of papers related to COVID-19 is putting time and technical constraints on healthcare professionals and policy makers to quickly find important research. Therefore, in this study, we propose a method of extracting useful information from text data of extensive literature using LDA and Word2vec algorithm. Papers related to keywords to be searched were extracted from papers related to COVID-19, and detailed topics were identified. The data used the CORD-19 data set on Kaggle, a free academic resource prepared by major research groups and the White House to respond to the COVID-19 pandemic, updated weekly. The research methods are divided into two main categories. First, 41,062 articles were collected through data filtering and pre-processing of the abstracts of 47,110 academic papers including full text. For this purpose, the number of publications related to COVID-19 by year was analyzed through exploratory data analysis using a Python program, and the top 10 journals under active research were identified. LDA and Word2vec algorithm were used to derive research topics related to COVID-19, and after analyzing related words, similarity was measured. Second, papers containing 'vaccine' and 'treatment' were extracted from among the topics derived from all papers, and a total of 4,555 papers related to 'vaccine' and 5,971 papers related to 'treatment' were extracted. did For each collected paper, detailed topics were analyzed using LDA and Word2vec algorithms, and a clustering method through PCA dimension reduction was applied to visualize groups of papers with similar themes using the t-SNE algorithm. A noteworthy point from the results of this study is that the topics that were not derived from the topics derived for all papers being researched in relation to COVID-19 (

    ) were the topic modeling results for each research topic (
    ) was found to be derived from For example, as a result of topic modeling for papers related to 'vaccine', a new topic titled Topic 05 'neutralizing antibodies' was extracted. A neutralizing antibody is an antibody that protects cells from infection when a virus enters the body, and is said to play an important role in the production of therapeutic agents and vaccine development. In addition, as a result of extracting topics from papers related to 'treatment', a new topic called Topic 05 'cytokine' was discovered. A cytokine storm is when the immune cells of our body do not defend against attacks, but attack normal cells. Hidden topics that could not be found for the entire thesis were classified according to keywords, and topic modeling was performed to find detailed topics. In this study, we proposed a method of extracting topics from a large amount of literature using the LDA algorithm and extracting similar words using the Skip-gram method that predicts the similar words as the central word among the Word2vec models. The combination of the LDA model and the Word2vec model tried to show better performance by identifying the relationship between the document and the LDA subject and the relationship between the Word2vec document. In addition, as a clustering method through PCA dimension reduction, a method for intuitively classifying documents by using the t-SNE technique to classify documents with similar themes and forming groups into a structured organization of documents was presented. In a situation where the efforts of many researchers to overcome COVID-19 cannot keep up with the rapid publication of academic papers related to COVID-19, it will reduce the precious time and effort of healthcare professionals and policy makers, and rapidly gain new insights. We hope to help you get It is also expected to be used as basic data for researchers to explore new research directions.

  • Fault Classification of Induction Motors by k-NN and SVM (k-NN과 SVM을 이용한 유도전동기 고장 분류)

    • Park, Seong-Mu;Lee, Dae-Jong;Gwon, Seok-Yeong;Kim, Yong-Sam;Jun, Myeong-Geun
      • Proceedings of the Korean Institute of Intelligent Systems Conference
      • /
      • 2006.11a
      • /
      • pp.109-112
      • /
      • 2006
    • 본 논문에서는 PCA에 의한 특징추출과 k-NN과 SVM에 기반을 계층구조의 분류기에 의한 유도전동기의 고장진단 알고리즘을 제안한다. 제안된 방법은 k-NN에 의해 선형적으로 분류 가능한 고장패턴을 분류한 후, 분류가 되지 않는 부분을 커널 함수에 의해 고차원 공간으로 입력패턴을 매핑한 후 SVM에 의해 고장을 진단하는 계층구조를 갖는다. 실험장치를 구축한 후, 다양한 부하에 대하여 몇몇의 전기적 고장과 기계적 고장 하에서 획득한 데이터를 이용하여 제안된 방법의 타당성을 검증한다.

    • PDF

    Analysis of Research Trends Related to drug Repositioning Based on Machine Learning (머신러닝 기반의 신약 재창출 관련 연구 동향 분석)

    • So Yeon Yoo;Gyoo Gun Lim
      • Information Systems Review
      • /
      • v.24 no.1
      • /
      • pp.21-37
      • /
      • 2022
    • Drug repositioning, one of the methods of developing new drugs, is a useful way to discover new indications by allowing drugs that have already been approved for use in people to be used for other purposes. Recently, with the development of machine learning technology, the case of analyzing vast amounts of biological information and using it to develop new drugs is increasing. The use of machine learning technology to drug repositioning will help quickly find effective treatments. Currently, the world is having a difficult time due to a new disease caused by coronavirus (COVID-19), a severe acute respiratory syndrome. Drug repositioning that repurposes drugsthat have already been clinically approved could be an alternative to therapeutics to treat COVID-19 patients. This study intends to examine research trends in the field of drug repositioning using machine learning techniques. In Pub Med, a total of 4,821 papers were collected with the keyword 'Drug Repositioning'using the web scraping technique. After data preprocessing, frequency analysis, LDA-based topic modeling, random forest classification analysis, and prediction performance evaluation were performed on 4,419 papers. Associated words were analyzed based on the Word2vec model, and after reducing the PCA dimension, K-Means clustered to generate labels, and then the structured organization of the literature was visualized using the t-SNE algorithm. Hierarchical clustering was applied to the LDA results and visualized as a heat map. This study identified the research topics related to drug repositioning, and presented a method to derive and visualize meaningful topics from a large amount of literature using a machine learning algorithm. It is expected that it will help to be used as basic data for establishing research or development strategies in the field of drug repositioning in the future.

    Performance Comparison of 2DPCA based Face Recognition algorithm under Robotic Environments (로봇 환경에서의 2DPCA 기반 알고리즘의 비교 연구)

    • Park, Beom-Chul;Kwak, Keun-Chang;Yoon, Ho-Seop
      • Proceedings of the IEEK Conference
      • /
      • 2007.07a
      • /
      • pp.217-218
      • /
      • 2007
    • Face recognition, recognizing the human faces, is one of the most important techniques for making intelligent robot that provide commendable services to human. In this paper, we make a comparative study of Original PCA, 2DPCA, 2DPCA based algorithms and LDA in robot environment. Database is obtained through the robot's camera in a laboratory what is made like home environment for experiment.. We consider distance state what can be generated in home environment for database.

    • PDF

    Design of Pattern Classification for Face Image Using Feature Space and SVM (얼굴영상의 특징공간 추출과 SVM을 이용한 패턴분류 설계)

    • 김진숙;강진숙;차의영
      • Proceedings of the Korea Multimedia Society Conference
      • /
      • 2003.05b
      • /
      • pp.146-149
      • /
      • 2003
    • 개인의 신분을 확인하는 생체인식기술은 정보기술이 고도화된 사회 속에서 정보보안의 관점에서 더욱 중요한 문제로 인식되기 시작했다. 이러한 생체인식 영역 중에서 비교적 거부감을 덜 주면서 어느 정도의 인식율을 제공하는 얼굴인식 분야의 연구는 지난 수년간 활발하게 진행되었다. 보통 전통적으로 얼굴인식에는 우선 PCA가 적용되어 데이터를 축소하고 LDA가 얼굴 인식을 위한 특징벡터를 추출하는 역할을 수행한다 본 논문에서는 이러한 이원적인 과정을 동시 대각화를 통해 하나의 과정으로 통합하고 기존의 유클리디언 디스턴스 대신에 SVM(Support Vector Machine) 패턴 분류기를 사용하여 얼굴인식을 수행하는 알고리즘을 제안한다.

    • PDF

    Face Recognition using Fisherface Method with Fuzzy Membership Degree (퍼지 소속도를 갖는 Fisherface 방법을 이용한 얼굴인식)

    • 곽근창;고현주;전명근
      • Journal of KIISE:Software and Applications
      • /
      • v.31 no.6
      • /
      • pp.784-791
      • /
      • 2004
    • In this study, we deal with face recognition using fuzzy-based Fisherface method. The well-known Fisherface method is more insensitive to large variation in light direction, face pose, and facial expression than Principal Component Analysis method. Usually, the various methods of face recognition including Fisherface method give equal importance in determining the face to be recognized, regardless of typicalness. The main point here is that the proposed method assigns a feature vector transformed by PCA to fuzzy membership rather than assigning the vector to particular class. In this method, fuzzy membership degrees are obtained from FKNN(Fuzzy K-Nearest Neighbor) initialization. Experimental results show better recognition performance than other methods for ORL and Yale face databases.

    Analysis of PD Distribution Characteristics and Comparison of Classification Methods according to Electrical Tree Source in Power Cable (전력용 케이블 시편에서 전기트리 발생원에 따른 부분방전 분포 특성 및 발생원 분류기법 비교)

    • Park, Seong-Hee;Jeong, Hae-Eun;Lim, Kee-Joe;Kang, Seong-Hwa
      • Journal of the Korean Institute of Electrical and Electronic Material Engineers
      • /
      • v.20 no.1
      • /
      • pp.57-64
      • /
      • 2007
    • One of the cause of insulation failure in power cable is well known by electrical treeing discharge. This is occurred for imposed continuous stress at cable. And this event is related to safety, reliability and maintenance. In this paper, throughout analysis of partial discharge(PD) distribution when occurring the electrical tree, is studied for the purpose of knowing of electrical treeing discharge characteristics according to defects. Own characteristic of tree will be differently processed in each defect and this reason is the first purpose of this paper. To acquire PD data, three defective tree models were made. And their own data is shown by the phase-resolved partial discharge method (PRPD). As a result of PRPD, tree discharge sources have their own characteristics. And if other defects (void, metal particle) exist internal power cable then their characteristics are shown very different. This result Is related to the time of breakdown and this is importance of cable diagnosis. And classification method of PD sources was studied in this paper. It needs select the most useful method to apply PD data classification one of the proposed method. To meet the requirement, we select methods of different type. That is, neural network(NN-BP), adaptive neuro-fuzzy inference system and PCA-LDA were applied to result. As a result of, ANFIS shows the highest rate which value is 98 %. Generally, PCA-LDA and ANFIS are better than BP. Finally, we performed classification of tree progress using ANFIS and that result is 92 %.


    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.