• Title/Summary/Keyword: partial least squares

Search Result 619, Processing Time 0.024 seconds

A Study on the Several Robust Regression Estimators

  • Kim, Jee-Yun;Roh, Kyung-Mi;Hwang, Jin-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.2
    • /
    • pp.307-316
    • /
    • 2004
  • Principal Component Regression(PCR) and Partial Least Squares Regression(PLSR) are the two most popular regression techniques in chemometrics. In the field of chemometrics usually the number of regressor variables greatly exceeds the number of observation. So we have to reduce the number of regressors to avoid the identifiability problem. In this paper we compare PCR and PLSR techniques combined with various robust regression methods including regression depth estimation. We compare the efficiency, goodness-of-fit and robustness of each estimators under several contamination schemes.

  • PDF

Investigation of Partial Least Squares (PLS) Calibration Performance based on Different Resolutions of Near Infrared Spectra

  • Chung, Hoe-Il;Choi, Seung-Yeol;Choo, Jae-Bum;Lee, Young-Il
    • Bulletin of the Korean Chemical Society
    • /
    • v.25 no.5
    • /
    • pp.647-651
    • /
    • 2004
  • Partial Least Squares (PLS) calibration performance has been systematically investigated by changing spectral resolutions of near-infrared (NIR) spectra. For this purpose, synthetic samples simulating naphtha were prepared to examine the calibration performance in complex chemical matrix. These samples were composed of $C_6-C_9$ normal paraffin, iso-paraffin, naphthene, and aromatic hydrocarbons. NIR spectra with four different resolutions of 4, 8, 16, and 32$cm^{-1}$ were collected and then PLS regression was performed. For PLS calibration, five different group compositions (such as total paraffin content) and six different pure components (such as benzene concentration) were selected. The overall results showed that at least 8$cm^{-1}$ resolution was required to resolve the complex chemical matrix such as naphtha. It was found that the influence of resolution on the PLS calibration was varied by the spectral features of a component.

Partial Least Squares Analysis on Near-Infrared Absorbance Spectra by Air-dried Specific Gravity of Major Domestic Softwood Species

  • Yang, Sang-Yun;Park, Yonggun;Chung, Hyunwoo;Kim, Hyunbin;Park, Se-Yeong;Choi, In-Gyu;Kwon, Ohkyung;Cho, Kyu-Chae;Yeo, Hwanmyeong
    • Journal of the Korean Wood Science and Technology
    • /
    • v.45 no.4
    • /
    • pp.399-408
    • /
    • 2017
  • Research on the rapid and accurate prediction of physical properties of wood using near-infrared (NIR) spectroscopy has attracted recent attention. In this study, partial least squares analysis was performed between NIR spectra and air-dried specific gravity of five domestic conifer species including larch (Larix kaempferi), Korean pine (Pinus koraiensis), red pine (Pinus densiflora), cedar (Cryptomeria japonica), and cypress (Chamaecyparis obtusa). Fifty different lumbers per species were purchased from the five National Forestry Cooperative Federations of Korea. The air-dried specific gravity of 100 knot- and defect-free specimens of each species was determined by NIR spectroscopy in the range of 680-2500 nm. Spectral data preprocessing including standard normal variate, detrend and forward first derivative (gap size = 8, smoothing = 8) were applied to all the NIR spectra of the specimens. Partial least squares analysis including cross-validation (five groups) was performed with the air-dried specific gravity and NIR spectra. When the performance of the regression model was expressed as $R^2$ (coefficient of determination) and root mean square error of calibration (RMSEC), $R^2$ and RMSEC were 0.63 and 0.027 for larch, 0.68 and 0.033 for Korean pine, 0.62 and 0.033 for red pine, 0.76 and 0.022 for cedar, and 0.79 and 0.027 for cypress, respectively. For the calibration model, which contained all species in this study, the $R^2$ was 0.75 and the RMSEC was 0.37.

Non-linear Data Classification Using Partial Least Square and Residual Compensator (부분 최소 자승법과 잔차 보상기를 이용한 비선형 데이터 분류)

  • 김경훈;김태영;최원호
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.10 no.2
    • /
    • pp.185-191
    • /
    • 2004
  • Partial least squares(PLS) is one of multiplicate statistical process methods and has been developed in various algorithms with the characteristics of principal component analysis, dimensionality reduction, and analysis of the relationship between input variables and output variables. But it has been limited somewhat by their dependency on linear mathematics. The algorithm is proposed to classify for the non-linear data using PLS and the residual compensator(RC) based on radial basis function network (RBFN). It compensates for the error of the non-linear data using the RC based on RBFN. The experimental result is given to verify its efficiency compared with those of previous works.

Robust Velocity Estimation of an Omnidirectional Mobile Robot Using a Polygonal Array of Optical Mice

  • Kim, Sung-Bok;Lee, Sang-Hyup
    • International Journal of Control, Automation, and Systems
    • /
    • v.6 no.5
    • /
    • pp.713-721
    • /
    • 2008
  • This paper presents the robust velocity estimation of an omnidirectional mobile robot using a polygonal array of optical mice that are installed at the bottom of the mobile robot. First, the velocity kinematics from a mobile robot to an array of optical mice is derived as an overdetermined linear system. The least squares velocity estimate of a mobile robot is then obtained, which becomes the same as the simple average for a regular polygonal arrangement of optical mice. Next, several practical issues that need be addressed for the use of the least squares mobile robot velocity estimation using optical mice are investigated, which include measurement noises, partial malfunctions, and imperfect installation. Finally, experimental results with different number of optical mice and under different floor surface conditions are given to demonstrate the validity and performance of the proposed least squares mobile robot velocity estimation method.

Milling tool wear forecast based on the partial least-squares regression analysis

  • Xu, Chuangwen;Chen, Hualing
    • Structural Engineering and Mechanics
    • /
    • v.31 no.1
    • /
    • pp.57-74
    • /
    • 2009
  • Power signals resulting from spindle and feed motor, present a rich content of physical information, the appropriate analysis of which can lead to the clear identification of the nature of the tool wear. The partial least-squares regression (PLSR) method has been established as the tool wear analysis method for this purpose. Firstly, the results of the application of widely used techniques are given and their limitations of prior methods are delineated. Secondly, the application of PLSR is proposed. The singular value theory is used to noise reduction. According to grey relational degree analysis, sample variable is filtered as part sample variable and all sample variables as independent variables for modelling, and the tool wear is taken as dependent variable, thus PLSR model is built up through adapting to several experimental data of tool wear in different milling process. Finally, the prediction value of tool wear is compare with actual value, in order to test whether the model of the tool wear can adopt to new measuring data on the independent variable. In the new different cutting process, milling tool wear was predicted by the methods of PLSR and MLR (Multivariate Linear Regression) as well as BPNN (BP Neural Network) at the same time. Experimental results show that the methods can meet the needs of the engineering and PLSR is more suitable for monitoring tool wear.

Discrimination Model of Cultivation Area of Alismatis Rhizoma using a GC-MS-Based Metabolomics Approach (GC-MS 기반 대사체학 기법을 이용한 택사의 산지판별모델)

  • Leem, Jae-Yoon
    • YAKHAK HOEJI
    • /
    • v.60 no.1
    • /
    • pp.29-35
    • /
    • 2016
  • Traditional Korean medicines may be managed more scientifically, through the development of logical criterion to verify their cultivation region. It contributes to advance the industry of traditional herbal medicines. Volatile compounds were obtained from 14 samples of domestic Taeksa and 30 samples of Chinese Taeksa by steam distillation. The metabolites were identified by NIST mass spectral library in the obtained gas chromatography/mass spectrometer (GC/MS) data of 35 training samples. The multivariate statistical analysis, such as Principal Component Analysis (PCA), Partial Least Squares Discriminant Analysis (PLS-DA), and Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA), were performed based on the qualitative and quantitative data. Finally trans-(2,3-diphenylcyclopropyl)methyl phenyl sulfoxide (47.265 min), 1,2,3,4-tetrahydro-1-phenyl-naphthalene (47.781 min), spiro[4-oxatricyclo[5.3.0.0.(2,6)]decan-3-one-5,2'-cyclohexane] (54.62 min), 6-[7-nitrobenzofurazan-4-yl]amino-morphinan-4,5-epoxy (54.86 min), p-hydroxynorephedrine (55.14 min) were determined as marker metabolites to verify candidates for the origin of Taeksa. The statistical model was well established to determine the origin of Taeksa. The cultivation areas of test samples, each 3 domestic and 6 Chinese Taeksa were predicted by the established OPLS-DA model and it was confirmed that all 9 samples were precisely classified.

Analysis of Carbonization Behavior of Hydrochar Produced by Hydrothermal Carbonization of Lignin and Development of a Prediction Model for Carbonization Degree Using Near-Infrared Spectroscopy (열수 탄화 공정을 거친 리그닌 하이드로차(hydrochar)의 탄화 거동 분석과 근적외선 분광법을 이용한 예측 모델 개발)

  • HWANG, Un Taek;BAE, Junsoo;LEE, Taekyeong;HWANG, Sung-Yun;KIM, Jong-Chan;PARK, Jinseok;CHOI, In-Gyu;KWAK, Hyo Won;HWANG, Sung-Wook;YEO, Hwanmyeong
    • Journal of the Korean Wood Science and Technology
    • /
    • v.49 no.3
    • /
    • pp.213-225
    • /
    • 2021
  • In this paper, we investigated the carbonization characteristics of lignin hydrochar prepared by hydrothermal carbonization and established a model for predicting the carbonization degree using near-infrared spectroscopy and partial least squares regression. The carbon content of the hydrothermally carbonized lignin at the temperature of 200 ℃ was higher by approximately 3 wt% than that of the untreated sample, and the carbon content tended to gradually increase as the heating time increased. Hydrothermal carbonization made lignin more carbon-intensive and more homogeneous by eliminating the microparticles. The discriminant and predictive models using near-infrared spectroscopy and partial least squares regression approppriately determined whether hydrothermal carbonization has been applied and predicted the carbon content of hydrothermal carbonized lignin with high accuracy. In this study, we confirmed that we can quickly and nondestructively predict the carbonization characteristics of lignin hydrochar manufactured by hydrothermal carbonization using a partial least squares regression model combined with near-infrared spectroscopy.

Gene Selection and Classification by Partial Least Squares and Principal component analysis (부분최소자승법과 주성분분석을 이용한 유전자 선택과 분류)

  • Park, Hoseok;Kim, Hey-Jin;Park, Seugj in;Bang, Sung-Yang
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.10a
    • /
    • pp.598-600
    • /
    • 2001
  • DNA chip technology enables us to monitor thousands of gene expressions per sample simultaneously. Typically, DNA microarray data has at least several thousands of variables (genes) wish relatively smal1 number of samples. Thus feature (gene) selection by dimensionality reduction is necessary for efficient data analysis. In this paper we employ the partial least squares (PLS) method for gene selection and the principal component analysis (PCA) method for classification. The useful behavior of the PLS is verified by computer simulations.

  • PDF