• Title/Summary/Keyword: Hyper parameters

Search Result 190, Processing Time 0.032 seconds

Deep Learning Based Rumor Detection for Arabic Micro-Text

  • Alharbi, Shada;Alyoubi, Khaled;Alotaibi, Fahd
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.11
    • /
    • pp.73-80
    • /
    • 2021
  • Nowadays microblogs have become the most popular platforms to obtain and spread information. Twitter is one of the most used platforms to share everyday life event. However, rumors and misinformation on Arabic social media platforms has become pervasive which can create inestimable harm to society. Therefore, it is imperative to tackle and study this issue to distinguish the verified information from the unverified ones. There is an increasing interest in rumor detection on microblogs recently, however, it is mostly applied on English language while the work on Arabic language is still ongoing research topic and need more efforts. In this paper, we propose a combined Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) to detect rumors on Twitter dataset. Various experiments were conducted to choose the best hyper-parameters tuning to achieve the best results. Moreover, different neural network models are used to evaluate performance and compare results. Experiments show that the CNN-LSTM model achieved the best accuracy 0.95 and an F1-score of 0.94 which outperform the state-of-the-art methods.

LSTM algorithm to determine the state of minimum horizontal stress during well logging operation

  • Arsalan Mahmoodzadeh;Seyed Mehdi Seyed Alizadeh;Adil Hussein Mohammed;Ahmed Babeker Elhag;Hawkar Hashim Ibrahim;Shima Rashidi
    • Geomechanics and Engineering
    • /
    • v.34 no.1
    • /
    • pp.43-49
    • /
    • 2023
  • Knowledge of minimum horizontal stress (Shmin) is a significant step in determining full stress tensor. It provides crucial information for the production of sand, hydraulic fracturing, determination of safe mud weight window, reservoir production behavior, and wellbore stability. Calculating the Shmin using indirect methods has been proved to be awkward because a lot of data are required in all of these models. Also, direct techniques such as hydraulic fracturing are costly and time-consuming. To figure these problems out, this work aims to apply the long-short-term memory (LSTM) algorithm to Shmin time-series prediction. 13956 datasets obtained from an oil well logging operation were applied in the models. 80% of the data were used for training, and 20% of the data were used for testing. In order to achieve the maximum accuracy of the LSTM model, its hyper-parameters were optimized significantly. Through different statistical indices, the LSTM model's performance was compared with with other machine learning methods. Finally, the optimized LSTM model was recommended for Shmin prediction in the well logging operation.

Optimize rainfall prediction utilize multivariate time series, seasonal adjustment and Stacked Long short term memory

  • Nguyen, Thi Huong;Kwon, Yoon Jeong;Yoo, Je-Ho;Kwon, Hyun-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.373-373
    • /
    • 2021
  • Rainfall forecasting is an important issue that is applied in many areas, such as agriculture, flood warning, and water resources management. In this context, this study proposed a statistical and machine learning-based forecasting model for monthly rainfall. The Bayesian Gaussian process was chosen to optimize the hyperparameters of the Stacked Long Short-term memory (SLSTM) model. The proposed SLSTM model was applied for predicting monthly precipitation of Seoul station, South Korea. Data were retrieved from the Korea Meteorological Administration (KMA) in the period between 1960 and 2019. Four schemes were examined in this study: (i) prediction with only rainfall; (ii) with deseasonalized rainfall; (iii) with rainfall and minimum temperature; (iv) with deseasonalized rainfall and minimum temperature. The error of predicted rainfall based on the root mean squared error (RMSE), 16-17 mm, is relatively small compared with the average monthly rainfall at Seoul station is 117mm. The results showed scheme (iv) gives the best prediction result. Therefore, this approach is more straightforward than the hydrological and hydraulic models, which request much more input data. The result indicated that a deep learning network could be applied successfully in the hydrology field. Overall, the proposed method is promising, given a good solution for rainfall prediction.

  • PDF

Analyzing performance of time series classification using STFT and time series imaging algorithms

  • Sung-Kyu Hong;Sang-Chul Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.4
    • /
    • pp.1-11
    • /
    • 2023
  • In this paper, instead of using recurrent neural network, we compare a classification performance of time series imaging algorithms using convolution neural network. There are traditional algorithms that imaging time series data (e.g. GAF(Gramian Angular Field), MTF(Markov Transition Field), RP(Recurrence Plot)) in TSC(Time Series Classification) community. Furthermore, we compare STFT(Short Time Fourier Transform) algorithm that can acquire spectrogram that visualize feature of voice data. We experiment CNN's performance by adjusting hyper parameters of imaging algorithms. When evaluate with GunPoint dataset in UCR archive, STFT(Short-Time Fourier transform) has higher accuracy than other algorithms. GAF has 98~99% accuracy either, but there is a disadvantage that size of image is massive.

Future inflow projection based on Bayesian optimization for hyper-parameters (하이퍼매개변수 베이지안 최적화 기법을 적용한 미래 유입량 예측)

  • Tran, Trung Duc;Kim, Jongho
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.347-347
    • /
    • 2022
  • 최근 데이터 사이언스의 비약적인 발전과 함께 다양한 형태의 딥러닝 알고리즘이 개발되어 수자원 분야에도 적용되고 있다. 이 연구에서는 LSTM(Long Short-Term Memory) 네트워크와 BO-LSTM이라는 베이지안 최적화(BO) 기술을 결합하여 일단위 앙상블 미래 댐유입량을 projection하는 딥 러닝 모델을 제안하였다. BO-LSTM 하이퍼파라미터 및 손실 함수는 베이지안 최적화 기법을 통해 훈련 및 최적화되며, BO 접근법은 모델의 하이퍼파라미터와 손실 함수를 높은 정확도로 빠르게 최적화할 수 있었다(R=0.92 및 NSE=0.85). 또한 미래 댐 유입량을 예측하기 위한 LSTM의 구조는 Forecasting 모형과 Proiection 모형으로 구분하여 두 모형의 장단점을 분석하였으며, 본 연구의 결과로부터 데이터 처리 단계가 모델 훈련의 효율성을 높이고 노이즈를 줄이는 데 효과적이고 미래 예측에 있어 LSTM 구조에 따른 영향을 확인할 수 있었다. 본 연구는 소양강 유역, 2020-2100년 기간 동안의 미래 예측에 적용되었다. 전반적으로, CIMIP6 데이터에 따르면 10%에서 50%의 미래 유입량 증가가 발생하는 것으로 확인되었으며, 이는 미래 강수량의 증가의 폭과 유사함을 확인하였다. 유입량 산정에 있어 신뢰할 수 있는 예측은 저수지 운영, 계획 및 관리에 있어 정책 입안자와 운영자에게 도움이 될 것입니다.

  • PDF

Hyperparameter optimization for Lightweight and Resource-Efficient Deep Learning Model in Human Activity Recognition using Short-range mmWave Radar (mmWave 레이더 기반 사람 행동 인식 딥러닝 모델의 경량화와 자원 효율성을 위한 하이퍼파라미터 최적화 기법)

  • Jiheon Kang
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.18 no.6
    • /
    • pp.319-325
    • /
    • 2023
  • In this study, we proposed a method for hyperparameter optimization in the building and training of a deep learning model designed to process point cloud data collected by a millimeter-wave radar system. The primary aim of this study is to facilitate the deployment of a baseline model in resource-constrained IoT devices. We evaluated a RadHAR baseline deep learning model trained on a public dataset composed of point clouds representing five distinct human activities. Additionally, we introduced a coarse-to-fine hyperparameter optimization procedure, showing substantial potential to enhance model efficiency without compromising predictive performance. Experimental results show the feasibility of significantly reducing model size without adversely impacting performance. Specifically, the optimized model demonstrated a 3.3% improvement in classification accuracy despite a 16.8% reduction in number of parameters compared th the baseline model. In conclusion, this research offers valuable insights for the development of deep learning models for resource-constrained IoT devices, underscoring the potential of hyperparameter optimization and model size reduction strategies. This work contributes to enhancing the practicality and usability of deep learning models in real-world environments, where high levels of accuracy and efficiency in data processing and classification tasks are required.

Standardization and Development of Pharmacopoeial Standard Operating Procedures (SOPs) of Classical Unani Formulation

  • Mannan, Mohd Nazir;Kazmi, Munawwar Husain;Zakir, Mohammad;Naikodi, Mohammed Abdul Rasheed;Zahid, Uzma;Siddiqui, Javed Inam
    • CELLMED
    • /
    • v.10 no.2
    • /
    • pp.16.1-16.8
    • /
    • 2020
  • Standardization of drug deals with confirmation of drug identity and determination of drug quality and purity. Unani herbal formulations are used in traditional medicine for the treatment of various diseases. Cancer is a disease which causes abnormal, uncontrolled growth of body tissue or cells, which tend to proliferate in an uncontrolled way. Spread of cancer from site of origin to other organs of the body is called metastasis. It is a hyper proliferative disorder involving, transformation, dysregulation of apoptosis, invasion and angiogenesis. The present study aimed to standardize a classical Unani formulation (CUF) described as anticancer properties. The CUF has been used for anti-cancerous activity (Dāfi'-i-saraṭān) in human population by Unani physicians for centuries. The standardization parameters carried out for classical Unani formulation are pharmacognostical studies, physicochemical parameters, high-performance thin layer chromatography (HPTLC), microbial load, aflatoxins, and heavy metals revealing specific identities and to evaluate Pharmacopoeial standards. Experiment and the data obtained established the Pharmacopoeial standards for this formulation for identification and quality control purpose. The CUF has been successfully standardized and standard operating procedures (SOPs) for its preparation has been laid down which may serve as a standard reference in future. The standardization data of this formulation may be used as a standard guideline for preparation of the formulation in future.

QoS-Aware Optimal SNN Model Parameter Generation Method in Neuromorphic Environment (뉴로모픽 환경에서 QoS를 고려한 최적의 SNN 모델 파라미터 생성 기법)

  • Seoyeon Kim;Bongjae Kim;Jinman Jung
    • Smart Media Journal
    • /
    • v.12 no.4
    • /
    • pp.19-26
    • /
    • 2023
  • IoT edge services utilizing neuromorphic hardware architectures are suitable for autonomous IoT applications as they perform intelligent processing on the device itself. However, spiking neural networks applied to neuromorphic hardware are difficult for IoT developers to comprehend due to their complex structures and various hyper-parameters. In this paper, we propose a method for generating spiking neural network (SNN) models that satisfy user performance requirements while considering the constraints of neuromorphic hardware. Our proposed method utilizes previously trained models from pre-processed data to find optimal SNN model parameters from profiling data. Comparing our method to a naive search method, both methods satisfy user requirements, but our proposed method shows better performance in terms of runtime. Additionally, even if the constraints of new hardware are not clearly known, the proposed method can provide high scalability by utilizing the profiled data of the hardware.

DNN based Binary Classification Model by Particular Matter Concentration (DNN 기반의 미세먼지 농도별 이진 분류 모델)

  • Lee, Jong-sung;Jung, Yong-jin;Oh, Chang-heon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.277-279
    • /
    • 2021
  • There is a problem that learning of a prediction model is not well performed depending on the characteristics of each particular matter concentration. To solve this problem, it is necessary to design a prediction model for low concentration and high concentration separately. Therefore, a classification model is needed to classify the concentration of particular matter into low and high concentrations. This paper proposes a classification model to classify low and high concentrations based on the concentration of particular matter. DNN was used as the classification model algorithm, and the classification model was designed by applying the optimal parameters after searching for hyper parameters. As for the result of evaluating the performance of the model, 97.54% of the low concentration classification was measured. And in the case of high concentration classification, 85.51% was measured.

  • PDF

Application and Comparison of Data Mining Technique to Prevent Metal-Bush Omission (메탈부쉬 누락예방을 위한 데이터마이닝 기법의 적용 및 비교)

  • Sang-Hyun Ko;Dongju Lee
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.46 no.3
    • /
    • pp.139-147
    • /
    • 2023
  • The metal bush assembling process is a process of inserting and compressing a metal bush that serves to reduce the occurrence of noise and stable compression in the rotating section. In the metal bush assembly process, the head diameter defect and placement defect of the metal bush occur due to metal bush omission, non-pressing, and poor press-fitting. Among these causes of defects, it is intended to prevent defects due to omission of the metal bush by using signals from sensors attached to the facility. In particular, a metal bush omission is predicted through various data mining techniques using left load cell value, right load cell value, current, and voltage as independent variables. In the case of metal bush omission defect, it is difficult to get defect data, resulting in data imbalance. Data imbalance refers to a case where there is a large difference in the number of data belonging to each class, which can be a problem when performing classification prediction. In order to solve the problem caused by data imbalance, oversampling and composite sampling techniques were applied in this study. In addition, simulated annealing was applied for optimization of parameters related to sampling and hyper-parameters of data mining techniques used for bush omission prediction. In this study, the metal bush omission was predicted using the actual data of M manufacturing company, and the classification performance was examined. All applied techniques showed excellent results, and in particular, the proposed methods, the method of mixing Random Forest and SA, and the method of mixing MLP and SA, showed better results.