• 제목/요약/키워드: SVM Model

검색결과 698건 처리시간 0.03초

다양한 기계학습 기법의 암상예측 적용성 비교 분석 (Comparative Application of Various Machine Learning Techniques for Lithology Predictions)

  • 정진아;박은규
    • 한국지하수토양환경학회지:지하수토양환경
    • /
    • 제21권3호
    • /
    • pp.21-34
    • /
    • 2016
  • In the present study, we applied various machine learning techniques comparatively for prediction of subsurface structures based on multiple secondary information (i.e., well-logging data). The machine learning techniques employed in this study are Naive Bayes classification (NB), artificial neural network (ANN), support vector machine (SVM) and logistic regression classification (LR). As an alternative model, conventional hidden Markov model (HMM) and modified hidden Markov model (mHMM) are used where additional information of transition probability between primary properties is incorporated in the predictions. In the comparisons, 16 boreholes consisted with four different materials are synthesized, which show directional non-stationarity in upward and downward directions. Futhermore, two types of the secondary information that is statistically related to each material are generated. From the comparative analysis with various case studies, the accuracies of the techniques become degenerated with inclusion of additive errors and small amount of the training data. For HMM predictions, the conventional HMM shows the similar accuracies with the models that does not relies on transition probability. However, the mHMM consistently shows the highest prediction accuracy among the test cases, which can be attributed to the consideration of geological nature in the training of the model.

Model Predictive Power Control of a PWM Rectifier for Electromagnetic Transmitters

  • Zhang, Jialin;Zhang, Yiming;Guo, Bing;Gao, Junxia
    • Journal of Power Electronics
    • /
    • 제18권3호
    • /
    • pp.789-801
    • /
    • 2018
  • Model predictive direct power control (MPDPC) is a widely recognized high-performance control strategy for a three-phase grid-connected pulse width modulation (PWM) rectifier. Unlike those of conventional grid-connected PWM rectifiers, the active and reactive powers of permanent magnet synchronous generator (PMSG)-connected PWM rectifiers, which are used in electromagnetic transmitters, cannot be calculated as the product of voltage and current because the back electromotive force (EMF) of the generator cannot be measured directly. In this study, the predictive power model of the rectifier is obtained by analyzing the relationship among flux, back EMF, active/reactive power, converter voltage, and stator current of the generator. The concept of duty cycle control in the proposed MPDPC is introduced by allocating a fraction of the control period for a nonzero vector and rest time for a zero vector. When nonzero vectors and their duration in the predefined cost function are simultaneously evaluated, the global power ripple minimization is obtained. Simulation and experimental results prove that the proposed MPDPC strategy with duty cycle control for the PMSG-connected PWM rectifier can achieve better control performance than the conventional MPDPC-SVM with grid-connected PWM rectifier.

An Efficient Machine Learning-based Text Summarization in the Malayalam Language

  • P Haroon, Rosna;Gafur M, Abdul;Nisha U, Barakkath
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권6호
    • /
    • pp.1778-1799
    • /
    • 2022
  • Automatic text summarization is a procedure that packs enormous content into a more limited book that incorporates significant data. Malayalam is one of the toughest languages utilized in certain areas of India, most normally in Kerala and in Lakshadweep. Natural language processing in the Malayalam language is relatively low due to the complexity of the language as well as the scarcity of available resources. In this paper, a way is proposed to deal with the text summarization process in Malayalam documents by training a model based on the Support Vector Machine classification algorithm. Different features of the text are taken into account for training the machine so that the system can output the most important data from the input text. The classifier can classify the most important, important, average, and least significant sentences into separate classes and based on this, the machine will be able to create a summary of the input document. The user can select a compression ratio so that the system will output that much fraction of the summary. The model performance is measured by using different genres of Malayalam documents as well as documents from the same domain. The model is evaluated by considering content evaluation measures precision, recall, F score, and relative utility. Obtained precision and recall value shows that the model is trustable and found to be more relevant compared to the other summarizers.

Development of Big Data-based Cardiovascular Disease Prediction Analysis Algorithm

  • Kyung-A KIM;Dong-Hun HAN;Myung-Ae CHUNG
    • 한국인공지능학회지
    • /
    • 제11권3호
    • /
    • pp.29-34
    • /
    • 2023
  • Recently, the rapid development of artificial intelligence technology, many studies are being conducted to predict the risk of heart disease in order to lower the mortality rate of cardiovascular diseases worldwide. This study presents exercise or dietary improvement contents in the form of a software app or web to patients with cardiovascular disease, and cardiovascular disease through digital devices such as mobile phones and PCs. LR, LDA, SVM, XGBoost for the purpose of developing "Life style Improvement Contents (Digital Therapy)" for cardiovascular disease care to help with management or treatment We compared and analyzed cardiovascular disease prediction models using machine learning algorithms. Research Results XGBoost. The algorithm model showed the best predictive model performance with overall accuracy of 80% before and after. Overall, accuracy was 80.0%, F1 Score was 0.77~0.79, and ROC-AUC was 80%~84%, resulting in predictive model performance. Therefore, it was found that the algorithm used in this study can be used as a reference model necessary to verify the validity and accuracy of cardiovascular disease prediction. A cardiovascular disease prediction analysis algorithm that can enter accurate biometric data collected in future clinical trials, add lifestyle management (exercise, eating habits, etc.) elements, and verify the effect and efficacy on cardiovascular-related bio-signals and disease risk. development, ultimately suggesting that it is possible to develop lifestyle improvement contents (Digital Therapy).

Development of an integrated machine learning model for rheological behaviours and compressive strength prediction of self-compacting concrete incorporating environmental-friendly materials

  • Pouryan Hadi;KhodaBandehLou Ashkan;Hamidi Peyman;Ashrafzadeh Fedra
    • Structural Engineering and Mechanics
    • /
    • 제86권2호
    • /
    • pp.181-195
    • /
    • 2023
  • To predict the rheological behaviours along with the compressive strength of self-compacting concrete that incorporates environmentally friendly ingredients as cement substitutes, a comparative evaluation of machine learning methods is conducted. To model four parameters, slump flow diameter, L-box ratio, V-funnel time, as well as compressive strength at 28 days-a complete mix design dataset from available pieces of literature is gathered and used to construct the suggested machine learning standards, SVM, MARS, and Mp5-MT. Six input variables-the amount of binder, the percentage of SCMs, the proportion of water to the binder, the amount of fine and coarse aggregates, and the amount of superplasticizer are grouped in a particular pattern. For optimizing the hyper-parameters of the MARS model with the lowest possible prediction error, a gravitational search algorithm (GSA) is required. In terms of the correlation coefficient for modelling slump flow diameter, L-box ratio, V-funnel duration, and compressive strength, the prediction results showed that MARS combined with GSA could improve the accuracy of the solo MARS model with 1.35%, 11.1%, 2.3%, as well as 1.07%. By contrast, Mp5-MT often demonstrates greater identification capability and more accurate prediction in comparison to MARS-GSA, and it may be regarded as an efficient approach to forecasting the rheological behaviors and compressive strength of SCC in infrastructure practice.

근육 활성화 모델 기반의 데이터 증강을 활용한 동시 동작 인식 프레임워크 (Simultaneous Motion Recognition Framework using Data Augmentation based on Muscle Activation Model)

  • 김세진;정완균
    • 로봇학회논문지
    • /
    • 제19권2호
    • /
    • pp.203-212
    • /
    • 2024
  • Simultaneous motion is essential in the activities of daily living (ADL). For motion intention recognition, surface electromyogram (sEMG) and corresponding motion label is necessary. However, this process is time-consuming and it may increase the burden of the user. Therefore, we propose a simultaneous motion recognition framework using data augmentation based on muscle activation model. The model consists of multiple point sources to be optimized while the number of point sources and their initial parameters are automatically determined. From the experimental results, it is shown that the framework has generated the data which are similar to the real one. This aspect is quantified with the following two metrics: structural similarity index measure (SSIM) and mean squared error (MSE). Furthermore, with k-nearest neighbor (k-NN) or support vector machine (SVM), the classification accuracy is also enhanced with the proposed framework. From these results, it can be concluded that the generalization property of the training data is enhanced and the classification accuracy is increased accordingly. We expect that this framework reduces the burden of the user from the excessive and time-consuming data acquisition.

기상 데이터를 이용한 데이터 마이닝 기반의 산불 예측 모델 (Data Mining based Forest Fires Prediction Models using Meteorological Data)

  • 김삼근;안재근
    • 한국산학기술학회논문지
    • /
    • 제21권8호
    • /
    • pp.521-529
    • /
    • 2020
  • 산불은 경제, 자연환경, 건강과 같은 삶의 여러 측면에서 몇 가지 악영향을 주는 가장 핵심적인 환경위험 중의 하나이다. 산불의 조기발견, 빠른 예측, 신속한 대응은 산불 위험으로부터 재산과 생명을 구하는데 본질적인 역할을 할 수 있다. 산불의 빠른 발견을 위해 기상청에서 각 지역에 설치한 로컬 센서를 통해 획득한 기상 데이터를 이용하는 방법이 있다. 기상 조건(예: 온도, 바람)은 산불 발생에 영향을 미친다고 알려져 있다. 본 논문에서는 산불의 피해 면적을 예측하기 위해 데이터 마이닝(DM) 기법을 적용한다. 다섯 종류의 DM 모델, 예를 들어 Stochastic Gradient Descent(SGD), Support Vector Machines(SVM), Decision Tree(DT), Random Forests(RF), Deep Neural Network(DNN)과 네 가지 입력 특성 그룹(공간, 시간, 기상 데이터 이용)을 최근 5년간의 경기도 지역에서 수집한 실제 산불 발생 데이터에 적용하였다. 실험결과는 기상 데이터만을 이용한 DNN 모델이 가장 우수한 성능을 보였다. 제안한 모델은 빈도수가 높은 작은 규모의 산불 예측에 더 효과적이었다. 제안한 예측 모델을 통해 도출된 이러한 지식은 소방 자원 관리를 개선하는데 특히 유용하다.

A Classification Model for Illegal Debt Collection Using Rule and Machine Learning Based Methods

  • Kim, Tae-Ho;Lim, Jong-In
    • 한국컴퓨터정보학회논문지
    • /
    • 제26권4호
    • /
    • pp.93-103
    • /
    • 2021
  • 금융당국의 채권추심 가이드라인, 추심업자에 대한 직접적인 관리 감독 수행 등의 노력에도 불구하고 채무자에 대한 불법, 부당한 채권 추심은 지속되고 있다. 이러한 불법, 부당한 채권추심행위를 효과적으로 예방하기 위해서는 비정형데이터 기계학습 등 기술을 활용하여 적은 인력으로도 불법 추심행위에 대한 점검 등에 대한 모니터링을 강화 할 수 있는 방법이 필요하다. 본 연구에서는 대부업체의 추심 녹취 파일을 입수하여 이를 텍스트 데이터로 변환하고 위법, 위규 행위를 판별하는 규칙기반 검출과 SVM(Support Vector Machine) 등 기계학습을 결합한 불법채권추심 분류 모델을 제안하고 기계학습 알고리즘에 따라 얼마나 정확한 식별을 하였는지를 비교해 보았다. 본 연구는 규칙기반 불법 검출과 기계학습을 결합하여 분류에 활용할 경우 기존에 연구된 기계학습만을 적용한 분류모델 보다 정확도가 우수하다는 것을 보여 주었다. 본 연구는 규칙기반 불법검출과 기계학습을 결합하여 불법여부를 분류한 최초의 시도이며 후행연구를 진행하여 모델의 완성도를 높인다면 불법채권 추심행위에 대한 소비자 피해 예방에 크게 기여할 수 있을 것이다.

머신러닝 기법을 활용한 토압식 쉴드TBM 막장압 예측에 관한 연구 (A study on EPB shield TBM face pressure prediction using machine learning algorithms)

  • 권기범;최항석;오주영;김동구
    • 한국터널지하공간학회 논문집
    • /
    • 제24권2호
    • /
    • pp.217-230
    • /
    • 2022
  • 쉴드TBM (Tunnel Boring Machine) 터널 시공에 있어 막장압 관리는 막장면 붕괴, 지반침하 등을 방지하여 막장 안정성을 유지하는 데 중요한 역할을 담당한다. 특히, 챔버 내부의 굴착토로 막장압을 조절하는 토압식 쉴드TBM의 경우, 이수식 쉴드TBM에 비해 막장압의 관리가 어렵다. 본 연구에서는 국내 토압식 쉴드TBM 터널 시공 현장의 지반조건 및 굴진특성 데이터를 분석하여, 토압식 쉴드TBM 터널의 세그먼트 링별 막장압 예측모델을 제시하였다. 예측모델의 입력특성으로 7가지를 선정하였으며, 912개의 학습 데이터 세트(Training data set)와 228개의 시험 데이터 세트(Test data set)를 확보하였다. 최적의 토압식 쉴드TBM 막장압 예측모델 선정을 위하여 KNN (K-Nearest Neighbors), SVM (Support Vector Machine), RF (Random Forest), XGB (eXtreme Gradient Boosting) 모델의 하이퍼파라미터(Hyperparameter)를 최적화하여 예측성능을 비교한 결과, RF 모델이 7.35 kPa의 평균 제곱근 오차(Root Mean Square Error, RMSE)로 가장 우수한 성능을 나타냈다. 추가적으로, RF 모델의 특성 중요도(Feature importance) 분석을 수행한 결과, 입력특성 중 수압의 영향도가 0.38로 가장 높았으며, 전반적으로 지반조건이 굴진특성보다 높은 중요도를 보여주었다.

Stress Identification and Analysis using Observed Heart Beat Data from Smart HRM Sensor Device

  • Pramanta, SPL Aditya;Kim, Myonghee;Park, Man-Gon
    • 한국멀티미디어학회논문지
    • /
    • 제20권8호
    • /
    • pp.1395-1405
    • /
    • 2017
  • In this paper, we analyses heart beat data to identify subjects stress state (binary) using heart rate variability (HRV) features extracted from heart beat data of the subjects and implement supervised machine learning techniques to create the mental stress classifier. There are four steps need to be done: data acquisition, data processing (HRV analysis), features selection, and machine learning, before doing performance measurement. There are 56 features generated from the HRV Analysis module with several of them are selected (using own algorithm) after computing the Pearson Correlation Matrix (p-values). The results of the list of selected features compared with all features data are compared by its model error after training using several machine learning techniques: support vector machine, decision tree, and discriminant analysis. SVM model and decision tree model with using selected features shows close results compared to using all recording by only 1% difference. Meanwhile, the discriminant analysis differs about 5%. All the machine learning method used in this works have 90% maximum average accuracy.