• 제목/요약/키워드: Partial least squares- discriminant analysis

검색결과 64건 처리시간 0.029초

Partial Least Squares-discriminant Analysis for the Prediction of Hemodynamic Changes Using Near Infrared Spectroscopy

  • Seo, Youngwook;Lee, Seungduk;Koh, Dalkwon;Kim, Beop-Min
    • Journal of the Optical Society of Korea
    • /
    • 제16권1호
    • /
    • pp.57-62
    • /
    • 2012
  • Using continuous wave near-infrared spectroscopy, we measured time-resolved concentration changes of oxy-hemoglobin and deoxy-hemoglobin from the primary motor cortex following finger tapping tasks. These data were processed using partial least squares-discriminant analysis (PLS-DA) to develop a prediction model for a brain-computer interface. The tasks were composed of a series of finger tapping for 15 sec and relaxation for 45 sec. The location of the motor cortex was confirmed by the anti-phasic behavior of the oxy- and deoxy-hemoglobin changes. The results were compared with those obtained using the hidden Markov model (HMM) which has been known to produce the best prediction model. Our data imply that PLS-DA makes better judgments in determining the onset of the events than HMM.

GC-MS 기반 대사체학 기법을 이용한 택사의 산지판별모델 (Discrimination Model of Cultivation Area of Alismatis Rhizoma using a GC-MS-Based Metabolomics Approach)

  • 임재윤
    • 약학회지
    • /
    • 제60권1호
    • /
    • pp.29-35
    • /
    • 2016
  • Traditional Korean medicines may be managed more scientifically, through the development of logical criterion to verify their cultivation region. It contributes to advance the industry of traditional herbal medicines. Volatile compounds were obtained from 14 samples of domestic Taeksa and 30 samples of Chinese Taeksa by steam distillation. The metabolites were identified by NIST mass spectral library in the obtained gas chromatography/mass spectrometer (GC/MS) data of 35 training samples. The multivariate statistical analysis, such as Principal Component Analysis (PCA), Partial Least Squares Discriminant Analysis (PLS-DA), and Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA), were performed based on the qualitative and quantitative data. Finally trans-(2,3-diphenylcyclopropyl)methyl phenyl sulfoxide (47.265 min), 1,2,3,4-tetrahydro-1-phenyl-naphthalene (47.781 min), spiro[4-oxatricyclo[5.3.0.0.(2,6)]decan-3-one-5,2'-cyclohexane] (54.62 min), 6-[7-nitrobenzofurazan-4-yl]amino-morphinan-4,5-epoxy (54.86 min), p-hydroxynorephedrine (55.14 min) were determined as marker metabolites to verify candidates for the origin of Taeksa. The statistical model was well established to determine the origin of Taeksa. The cultivation areas of test samples, each 3 domestic and 6 Chinese Taeksa were predicted by the established OPLS-DA model and it was confirmed that all 9 samples were precisely classified.

벌점 부분최소자승법을 이용한 분류방법 (A new classification method using penalized partial least squares)

  • 김윤대;전치혁;이혜선
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권5호
    • /
    • pp.931-940
    • /
    • 2011
  • 분류분석은 학습표본으로부터 분류규칙을 도출한 후 새로운 표본에 적용하여 특정 범주로 분류하는 방법이다. 데이터의 복잡성에 따라 다양한 분류분석 방법이 개발되어 왔지만, 데이터 차원이 높고 변수간 상관성이 높은 경우 정확하게 분류하는 것은 쉽지 않다. 본 연구에서는 데이터차원이 상대적으로 높고 변수간 상관성이 높을 때 강건한 분류방법을 제안하고자 한다. 부분최소자승법은 연속형데이터에 사용되는 기법으로서 고차원이면서 독립변수간 상관성이 높을 때 예측력이 높은 통계기법으로 알려져 있는 다변량 분석기법이다. 벌점 부분최소자승법을 이용한 분류방법을 실제데이터와 시뮬레이션을 적용하여 성능을 비교하고자 한다.

GC-MS 기반 대사체학 기술을 응용한 참당귀의 산지비교분석 (Comparative Analysis of Cultivation Region of Angelica gigas Using a GC-MS-Based Metabolomics Approach)

  • 강귀보;임재윤
    • 한국약용작물학회지
    • /
    • 제24권2호
    • /
    • pp.93-100
    • /
    • 2016
  • Background: A set of logical criteria that can accurately identify and verify the cultivation region of raw materials is a critical tool for the scientific management of traditional herbal medicine. Methods and Results: Volatile compounds were obtained from 19 and 32 samples of Angelica gigas Nakai cultivated in Korea and China, respectively, by using steam distillation extraction. The metabolites were identified using GC/MS by querying against the NIST reference library. Data binning was performed to normalize the number of variables used in statistical analysis. Multivariate statistical analyses, such as Principal Component Analysis (PCA), Partial Least Squares-Discriminant Analysis (PLS-DA), and Orthogonal Partial Least Squares-Discriminant Analysis (OPLS-DA) were performed using the SIMCA-P software. Significant variables with a Variable Importance in the Projection (VIP) score higher than 1.0 as obtained through OPLS-DA and those that resulted in p-values less than 0.05 through one-way ANOVA were selected to verify the marker compounds. Among the 19 variables extracted, styrene, ${\alpha}$-pinene, and ${\beta}$-terpinene were selected as markers to indicate the origin of A. gigas. Conclusions: The statistical model developed was suitable for determination of the geographical origin of A. gigas. The cultivation regions of six Korean and eight Chinese A. gigas. samples were predicted using the established OPLS-DA model and it was confirmed that 13 of the 14 samples were accurately classified.

Comparison of 12 Isoflavone Profiles of Soybean (Glycine max (L.) Merrill) Seed Sprouts from Three Different Countries

  • Park, Soo-Yun;Kim, Jae Kwang;Kim, Eun-Hye;Kim, Seung-Hyun;Prabakaran, Mayakrishnan;Chung, Ill-Min
    • 한국작물학회지
    • /
    • 제63권4호
    • /
    • pp.360-377
    • /
    • 2018
  • The levels of 12 isoflavones were measured in soybean (Glycine max (L.) Merrill) sprouts of 68 genetic varieties from three countries (China, Japan, and Korea). The isoflavone profile differences were analyzed using data mining methods. A principal component analysis (PCA) revealed that the CSRV021 variety was separated from the others by the first two principal components. This variety appears to be most suited for functional food production due to its high isoflavone levels. Partial least squares discriminant analysis (PLS-DA) and orthogonal projections to latent structures discriminant analysis (OPLS-DA) showed that there are meaningful isoflavone compositional differences in samples that have different countries of origin. Hierarchical clustering analysis (HCA) of these phytochemicals resulted in clusters derived from closely related biochemical pathways. These results indicate the usefulness of metabolite profiling combined with chemometrics as a tool for assessing the quality of foods and identifying metabolic links in biological systems.

Differentiation of Roots of Glycyrrhiza Species by 1H Nuclear Magnetic Resonance Spectroscopy and Multivariate Statistical Analysis

  • Yang, Seung-Ok;Hyun, Sun-Hee;Kim, So-Hyun;Kim, Hee-Su;Lee, Jae-Hwi;Whang, Wan-Kyun;Lee, Min-Won;Choi, Hyung-Kyoon
    • Bulletin of the Korean Chemical Society
    • /
    • 제31권4호
    • /
    • pp.825-828
    • /
    • 2010
  • To classify Glycyrrhiza species, samples of different species were analyzed by $^1H$ NMR-based metabolomics technique. Partial least squares discriminant analysis (PLS-DA) was used as the multivariate statistical analysis of the 1H NMR data sets. There was a clear separation between various Glycyrrhiza species in the PLS-DA derived score plots. The PLS-DA model was validated, and the key metabolites contributing to the separation in the score plots of various Glycyrrhiza species were lactic acid, alanine, arginine, proline, malic acid, asparagine, choline, glycine, glucose, sucrose, 4-hydroxy-phenylacetic acid, and formic acid. The compounds present at relatively high levels were glucose, and 4-hydroxyphenylacetic acid in G. glabra; lactic acid, alanine, and proline in G. inflata; and arginine, malic acid, and sucrose in G. uralensis. This is the first study to perform the global metabolomic profiling and differentiation of Glycyrrhiza species using $^1H$ NMR and multivariate statistical analysis.

Unraveling dynamic metabolomes underlying different maturation stages of berries harvested from Panax ginseng

  • Lee, Mee Youn;Seo, Han Sol;Singh, Digar;Lee, Sang Jun;Lee, Choong Hwan
    • Journal of Ginseng Research
    • /
    • 제44권3호
    • /
    • pp.413-423
    • /
    • 2020
  • Background: Ginseng berries (GBs) show temporal metabolic variations among different maturation stages, determining their organoleptic and functional properties. Methods: We analyzed metabolic variations concomitant to five different maturation stages of GBs including immature green (IG), mature green (MG), partially red (PR), fully red (FR), and overmature red (OR) using mass spectrometry (MS)-based metabolomic profiling and multivariate analyses. Results: The partial least squares discriminant analysis score plot based on gas chromatography-MS datasets highlighted metabolic disparity between preharvest (IG and MG) and harvest/postharvest (PR, FR, and OR) GB extracts along PLS1 (34.9%) with MG distinctly segregated across PLS2 (18.2%). Forty-three significantly discriminant primary metabolites were identified encompassing five developmental stages (variable importance in projection > 1.0, p < 0.05). Among them, most amino acids, organic acids, 5-C sugars, ethanolamines, purines, and palmitic acid were detected in preharvest GB extracts, whereas 6-C sugars, phenolic acid, and oleamide levels were distinctly higher during later maturation stages. Similarly, the partial least squares discriminant analysis based on liquid chromatography-MS datasets displayed preharvest and harvest/postharvest stages clustered across PLS1 (11.1 %); however, MG and PR were separated from IG, FR, and OR along PLS2 (5.6 %). Overall, 24 secondary metabolites were observed significantly discriminant (variable importance in projection > 1.0, p < 0.05), with most displaying higher relative abundance during preharvest stages excluding ginsenosides Rg1 and Re. Furthermore, we observed strong positive correlations between total flavonoid and phenolic metabolite contents in GB extracts and antioxidant activity. Conclusion: Comprehending the dynamic metabolic variations associated with GB maturation stages rationalize their optimal harvest time per se the related agroeconomic traits.

GC-MS 기반 대사체학 기법을 이용한 산수유의 산지판별모델 (Discrimination model of cultivation area of Corni Fructus using a GC-MS-Based metabolomics approach)

  • 임재윤
    • 분석과학
    • /
    • 제29권1호
    • /
    • pp.1-9
    • /
    • 2016
  • 생약의 원산지를 판별하는 논리적인 일련의 기준을 개발한다면, 현재 유통되는 한약을 좀 더 과학적으로 관리 할 수 있을 것이다. 이러한 노력은 전통적인 한약 산업 발전에 기여할 것이라고 사료된다. 산수유의 원산지 판별법을 개발하기 위해, 본 연구에서는 우선 국산 산수유와 중국산 산수유를 각각 수증기 증류하고 이 때 얻은 휘발성분을 GC/MS를 이용하여 분석하였다. NIST mass spectral library의 데이터베이스로부터 정성분석한 결과를 바탕으로 데이터를 범주화(binning)하여 변수를 얻고, 이에 대하여 PCA, OPLS-DA 등 다변량 통계 분석을 수행함으로써 신속, 정확하게 국산 산수유와 중국산 산수유의 산지를 판별할 수 있는 산지 판별모델을 확립하였다. 산지 판별모델 개발을 위해서 학습집합(n=53)을 분석하여 산지 판별모델을 수립한 후, 검증집합(n=12)을 산지 판별모델에 적용함으로써 그 타당성을 확인하였다. 더불어 1-ethylbutyl-hydroperoxide, nonadecane, butylated hydroxytoluene, 5β,7βH,10α-Eudesm-11-en-1α-ol, 7,9-bis (2-methyl-2-propanyl)-1-oxaspiro[4.5]deca-6,9-diene-2,8-dione, 그리고 2-decyldodecyl-benzene 등 6개의 마커성분을 선정할 수 있었다. 최근에 NMR을 활용한 산수유 원산지 판별에 대한 보고는 있었으나, GC/MS를 기반으로 한 대사체학 연구기법을 이용하여 산지판별 모델을 제시하는 것은 최초의 보고로서 그 의미가 크다. 본 연구결과를 활용하여 한약의 원산지 판별모델 확립과 산수유 원산지의 과학적인 관리에 적용할 수 있으리라 사료된다.

Impurity profiling and chemometric analysis of methamphetamine seizures in Korea

  • Shin, Dong Won;Ko, Beom Jun;Cheong, Jae Chul;Lee, Wonho;Kim, Suhkmann;Kim, Jin Young
    • 분석과학
    • /
    • 제33권2호
    • /
    • pp.98-107
    • /
    • 2020
  • Methamphetamine (MA) is currently the most abused illicit drug in Korea. MA is produced by chemical synthesis, and the final target drug that is produced contains small amounts of the precursor chemicals, intermediates, and by-products. To identify and quantify these trace compounds in MA seizures, a practical and feasible approach for conducting chromatographic fingerprinting with a suite of traditional chemometric methods and recently introduced machine learning approaches was examined. This was achieved using gas chromatography (GC) coupled with a flame ionization detector (FID) and mass spectrometry (MS). Following appropriate examination of all the peaks in 71 samples, 166 impurities were selected as the characteristic components. Unsupervised (principal component analysis (PCA), hierarchical cluster analysis (HCA), and K-means clustering) and supervised (partial least squares-discriminant analysis (PLS-DA), orthogonal partial least squares-discriminant analysis (OPLS-DA), support vector machines (SVM), and deep neural network (DNN) with Keras) chemometric techniques were employed for classifying the 71 MA seizures. The results of the PCA, HCA, K-means clustering, PLS-DA, OPLS-DA, SVM, and DNN methods for quality evaluation were in good agreement. However, the tested MA seizures possessed distinct features, such as chirality, cutting agents, and boiling points. The study indicated that the established qualitative and semi-quantitative methods will be practical and useful analytical tools for characterizing trace compounds in illicit MA seizures. Moreover, they will provide a statistical basis for identifying the synthesis route, sources of supply, trafficking routes, and connections between seizures, which will support drug law enforcement agencies in their effort to eliminate organized MA crime.

열수 탄화 공정을 거친 리그닌 하이드로차(hydrochar)의 탄화 거동 분석과 근적외선 분광법을 이용한 예측 모델 개발 (Analysis of Carbonization Behavior of Hydrochar Produced by Hydrothermal Carbonization of Lignin and Development of a Prediction Model for Carbonization Degree Using Near-Infrared Spectroscopy)

  • HWANG, Un Taek;BAE, Junsoo;LEE, Taekyeong;HWANG, Sung-Yun;KIM, Jong-Chan;PARK, Jinseok;CHOI, In-Gyu;KWAK, Hyo Won;HWANG, Sung-Wook;YEO, Hwanmyeong
    • Journal of the Korean Wood Science and Technology
    • /
    • 제49권3호
    • /
    • pp.213-225
    • /
    • 2021
  • 본 논문에서는 열수 탄화(hydrothermal carbonization)에 의해 제조된 리그닌 하이드로차의 탄화 특성을 조사하였고, 근적외선 분광법과 부분 최소 제곱(partial least squares) 회귀를 이용하여 탄화 거동을 예측하기 위한 모델을 수립하였다. 온도 200℃에서 열수 탄화된 리그닌의 탄소 함량은 무처리 시료 보다 약 3 wt% 높았으며 가열 시간이 증가할수록 탄소 함량도 서서히 증가하는 경향이 나타났다. 열수 탄화는 리그닌을 더욱 탄소 집약적으로 변화시키고 마이크로 파티클을 제거하여 더욱 균질한 특성을 부여하였다. 근적외선 분광법과 부분 최소 제곱 회귀를 이용한 판별 및 예측 모델은 수열 탄화의 적용 여부를 완벽히 구분했으며 높은 정확도로 열수 탄화 리그닌의 탄소 함량을 예측하였다. 본 연구로부터 근적외선 분광법과 결합된 부분 최소 제곱 회귀 모델을 이용하여 열수 탄화에 의해 제조된 리그닌 하이드로차의 탄화 특성을 빠르고 비파괴적으로 예측할 수 있다는 것이 확인되었다.