• 제목/요약/키워드: Least Squares Support Vector Machine

검색결과 67건 처리시간 0.021초

A comparison of ATR-FTIR and Raman spectroscopy for the non-destructive examination of terpenoids in medicinal plants essential oils

  • Rahul Joshi;Sushma Kholiya;Himanshu Pandey;Ritu Joshi;Omia Emmanuel;Ameeta Tewari;Taehyun Kim;Byoung-Kwan Cho
    • 농업과학연구
    • /
    • 제50권4호
    • /
    • pp.675-696
    • /
    • 2023
  • Terpenoids, also referred to as terpenes, are a large family of naturally occurring chemical compounds present in the essential oils extracted from medicinal plants. In this study, a nondestructive methodology was created by combining ATR-FT-IR (attenuated total reflectance-Fourier transform infrared), and Raman spectroscopy for the terpenoids assessment in medicinal plants essential oils from ten different geographical locations. Partial least squares regression (PLSR) and support vector regression (SVR) were used as machine learning methodologies. However, a deep learning based model called as one-dimensional convolutional neural network (1D CNN) were also developed for models comparison. With a correlation coefficient (R2) of 0.999 and a lowest RMSEP (root mean squared error of prediction) of 0.006% for the prediction datasets, the SVR model created for FT-IR spectral data outperformed both the PLSR and 1 D CNN models. On the other hand, for the classification of essential oils derived from plants collected from various geographical regions, the created SVM (support vector machine) classification model for Raman spectroscopic data obtained an overall classification accuracy of 0.997% which was superior than the FT-IR (0.986%) data. Based on the results we propose that FT-IR spectroscopy, when coupled with the SVR model, has a significant potential for the non-destructive identification of terpenoids in essential oils compared with destructive chemical analysis methods.

Evaluation of soil-concrete interface shear strength based on LS-SVM

  • Zhang, Chunshun;Ji, Jian;Gui, Yilin;Kodikara, Jayantha;Yang, Sheng-Qi;He, Lei
    • Geomechanics and Engineering
    • /
    • 제11권3호
    • /
    • pp.361-372
    • /
    • 2016
  • The soil-concrete interface shear strength, although has been extensively studied, is still difficult to predict as a result of the dependence on many factors such as normal stresses, surface roughness, particle sizes, moisture contents, dilation angles of soils, etc. In this study, a well-known rigorous statistical learning approach, namely the least squares support vector machine (LS-SVM) realized in a ubiquitous spreadsheet platform is firstly used in estimating the soil-structure interface shear strength. Instead of studying the complicated mechanism, LS-SVM enables to explore the possible link between the fundamental factors and the interface shear strengths, via a sophisticated statistic approach. As a preliminary investigation, the authors study the expansive soils that are found extensively in most countries. To reduce the complexity, three major influential factors, e.g., initial moisture contents, initial dry densities and normal stresses of soils are taken into account in developing the LS-SVM models for the soil-concrete interface shear strengths. The predicted results by LS-SVM show reasonably good agreement with experimental data from direct shear tests.

An Optimization Algorithm with Novel Flexible Grid: Applications to Parameter Decision in LS-SVM

  • Gao, Weishang;Shao, Cheng;Gao, Qin
    • Journal of Computing Science and Engineering
    • /
    • 제9권2호
    • /
    • pp.39-50
    • /
    • 2015
  • Genetic algorithm (GA) and particle swarm optimization (PSO) are two excellent approaches to multimodal optimization problems. However, slow convergence or premature convergence readily occurs because of inappropriate and inflexible evolution. In this paper, a novel optimization algorithm with a flexible grid optimization (FGO) is suggested to provide adaptive trade-off between exploration and exploitation according to the specific objective function. Meanwhile, a uniform agents array with adaptive scale is distributed on the gird to speed up the calculation. In addition, a dominance centroid and a fitness center are proposed to efficiently determine the potential guides when the population size varies dynamically. Two types of subregion division strategies are designed to enhance evolutionary diversity and convergence, respectively. By examining the performance on four benchmark functions, FGO is found to be competitive with or even superior to several other popular algorithms in terms of both effectiveness and efficiency, tending to reach the global optimum earlier. Moreover, FGO is evaluated by applying it to a parameter decision in a least squares support vector machine (LS-SVM) to verify its practical competence.

준지도 커널능형회귀모형에 관한 연구 (A study on semi-supervised kernel ridge regression estimation)

  • 석경하
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권2호
    • /
    • pp.341-353
    • /
    • 2013
  • 데이터마이닝과 기계학습의 응용분야에서는 라벨 없는 자료를 이용하는 연구가 많이 진행되고 있다. 이러한 연구는 분류문제에 집중되었다가 최근에 회귀분석문제로 관심이 모아지고 있다. 본 연구에서는 커널능형회귀모형 형태의 준지도 회귀분석 방법을 제시한다. 제안된 방법은 기존의 전환적 방법과는 달리 라벨 없는 자료의 라벨을 추정하는 과정을 필요로 하지 않기 때문에 선택해야 할 모수의 수도 적고, 계산과정도 단순할 뿐 아니라 일반화에 강점이 있다. 모의실험과 실제 자료 분석을 통해 제안된 방법이 라벨 없는 자료를 잘 활용하여 라벨 있는 자료만 이용하는 방법보다 더 우수한 추정을 하는 것을 볼 수 있었다.

서브 밴드 CSP기반 FLD 및 PCA를 이용한 동작 상상 EEG 특징 추출 방법 연구 (A Method of Feature Extraction on Motor Imagery EEG Using FLD and PCA Based on Sub-Band CSP)

  • 박상훈;이상국
    • 정보과학회 논문지
    • /
    • 제42권12호
    • /
    • pp.1535-1543
    • /
    • 2015
  • 뇌-컴퓨터 인터페이스는 사용자의 뇌전도(Electroencephalogram: EEG)를 획득하여 생각만으로 기계를 제어하거나 신체장애를 가진 사람에게 손 또는 발과 같은 신체를 대신하여 의사 전달 수단으로 사용될 수 있다. 본 논문에서는 동작 상상 EEG를 분류하기 위해 Sub-Band Common Spatial Pattern(SBCSP)를 기반으로 필터 선택을 하지 않는 특징 추출 방법에 대해 연구한다. 4~40Hz의 동작 상상 신호를 4Hz 대역마다 나눈 9개의 서브 밴드에 각각 CSP를 적용한다. 이후 Fisher's Linear Discriminant(FLD)를 사용하여 도출된 값들을 결합한 FLD 점수 벡터에 차원 축소를 위한 Principal Component Analysis(PCA)를 적용하여 클래스 구분을 위한 최적의 평면에 특징을 투영한다. 데이터베이스는 BCI CompetitionIII dataset IVa(2 클래스: 오른손 다리)를 이용하며, 추출된 특징은 Least Squares Support Vector Machine(LS-SVM)의 입력으로 사용된다. 제안된 방법의 성능은 $10{\times}10$ fold cross-validation을 이용하여 분류 정확도로 나타낸다. 본 논문에서 제안하는 방법은 피험자 'aa', 'al', 'av', 'aw', 'ay'에 대하여 각각 $85.29{\pm}0.93%$, $95.43{\pm}0.57%$, $72.57{\pm}2.37%$, $91.82{\pm}1.38%$, $93.50{\pm}0.69%$의 분류 정확도를 보였다.

정확히 재가중되는 온라인 전체 에러율 최소화 기반의 객체 추적 (Object Tracking Based on Exactly Reweighted Online Total-Error-Rate Minimization)

  • 장세인;박충식
    • 지능정보연구
    • /
    • 제25권4호
    • /
    • pp.53-65
    • /
    • 2019
  • 영상 기반의 보안 시스템의 증가함에 따라 각 용도마다 다른 다양한 객체들에 대한 처리들이 중요해지고 있다. 객체 추적은 객체 인식, 검출과 같은 작업들과 함께 필수적인 작업으로 다뤄진다. 이 객체 추적을 달성하기 위해서 다양한 머신러닝이 적용될 수 있다. 성공적인 분류기로써 전체 에러율 최소화(total-error-rate minimization) 기반의 방법론이 사용될 수 있다. 이 전체 에러율 최소화 기반의 방법론은 오프라인 학습을 기반으로 하고 있다. 객체 추적은 실시간으로 처리하며 갱신해야하는 것이 필수적이므로 온라인 학습(online learning)을 기반으로 하는 것이 적합하다. 온라인 전체 에러율 최소화 방법론이 개발되었지만 점근적으로 재가중되는(approximately reweighted) 작업이 포함되어 에러를 누적시킬 수 있다는 단점이 있다. 본 논문에서는 정확하게 재가중되는(exactly reweighted) 방법론을 제안하면서 온라인 전체 에러율 최소화가 달성되었다. 이 제안된 온라인 학습 방법론을 객체 추적에 적용하여 총 8개의 데이터베이스에서 다른 추적 방법론들 보다 좋은 성능이 달성되었다.

Application of Terahertz Spectroscopy and Imaging in the Diagnosis of Prostate Cancer

  • Zhang, Ping;Zhong, Shuncong;Zhang, Junxi;Ding, Jian;Liu, Zhenxiang;Huang, Yi;Zhou, Ning;Nsengiyumva, Walter;Zhang, Tianfu
    • Current Optics and Photonics
    • /
    • 제4권1호
    • /
    • pp.31-43
    • /
    • 2020
  • The feasibility of the application of terahertz electromagnetic waves in the diagnosis of prostate cancer was examined. Four samples of incomplete cancerous prostatic paraffin-embedded tissues were examined using terahertz spectral imaging (TPI) system and the results obtained by comparing the absorption coefficient and refractive index of prostate tumor, normal prostate tissue and smooth muscle from one of the paraffin tissue masses examined were reported. Three hundred and sixty cases of absorption coefficients from one of the paraffin tissues examined were used as raw data to classify these three tissues using the Principal Component Analysis (PCA) and Least Squares Support Vector Machine (LS-SVM). An excellent classification with an accuracy of 92.22% in the prediction set was achieved. Using the distribution information of THz reflection signal intensity from sample surface and absorption coefficient of the sample, an attempt was made to use the TPI system to identify the boundaries of the different tissues involved (prostate tumors, normal and smooth muscles). The location of three identified regions in the terahertz images (frequency domain slice absorption coefficient imaging, 1.2 THz) were compared with those obtained from the histopathologic examination. The tissue tumor region had a distinctively visible color and could well be distinguished from other tissue regions in terahertz images. Results indicate that a THz spectroscopy imaging system can be efficiently used in conjunction with the proposed advanced computer-based mathematical analysis method to identify tumor regions in the paraffin tissue mass of prostate cancer.