• Title/Summary/Keyword: Hyperbolic tangent

Search Result 52, Processing Time 0.026 seconds

Approximation of Polynomials and Step function for cosine modulated Gaussian Function in Neural Network Architecture (뉴로 네트워크에서 코사인 모듈화 된 가우스함수의 다항식과 계단함수의 근사)

  • Lee, Sang-Wha
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.49 no.2
    • /
    • pp.115-122
    • /
    • 2012
  • We present here a new class of activation functions for neural networks, which herein will be called CosGauss function. This function is a cosine-modulated gaussian function. In contrast to the sigmoidal-, hyperbolic tangent- and gaussian activation functions, more ridges can be obtained by the CosGauss function. It will be proven that this function can be used to aproximate polynomials and step functions. The CosGauss function was tested with a Cascade-Correlation-Network of the multilayer structure on the Tic-Tac-Toe game and iris plants problems, and results are compared with those obtained with other activation functions.

Evaluation of existing bridges using neural networks

  • Molina, Augusto V.;Chou, Karen C.
    • Structural Engineering and Mechanics
    • /
    • v.13 no.2
    • /
    • pp.187-209
    • /
    • 2002
  • The infrastructure system in the United States has been aging faster than the resource available to restore them. Therefore decision for allocating the resources is based in part on the condition of the structural system. This paper proposes to use neural network to predict the overall rating of the structural system because of the successful applications of neural network to other fields which require a "symptom-diagnostic" type relationship. The goal of this paper is to illustrate the potential of using neural network in civil engineering applications and, particularly, in bridge evaluations. Data collected by the Tennessee Department of Transportation were used as "test bed" for the study. Multi-layer feed forward networks were developed using the Levenberg-Marquardt training algorithm. All the neural networks consisted of at least one hidden layer of neurons. Hyperbolic tangent transfer functions were used in the first hidden layer and log-sigmoid transfer functions were used in the subsequent hidden and output layers. The best performing neural network consisted of three hidden layers. This network contained three neurons in the first hidden layer, two neurons in the second hidden layer and one neuron in the third hidden layer. The neural network performed well based on a target error of 10%. The results of this study indicate that the potential for using neural networks for the evaluation of infrastructure systems is very good.

Fast Convergence GRU Model for Sign Language Recognition

  • Subramanian, Barathi;Olimov, Bekhzod;Kim, Jeonghong
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.9
    • /
    • pp.1257-1265
    • /
    • 2022
  • Recognition of sign language is challenging due to the occlusion of hands, accuracy of hand gestures, and high computational costs. In recent years, deep learning techniques have made significant advances in this field. Although these methods are larger and more complex, they cannot manage long-term sequential data and lack the ability to capture useful information through efficient information processing with faster convergence. In order to overcome these challenges, we propose a word-level sign language recognition (SLR) system that combines a real-time human pose detection library with the minimized version of the gated recurrent unit (GRU) model. Each gate unit is optimized by discarding the depth-weighted reset gate in GRU cells and considering only current input. Furthermore, we use sigmoid rather than hyperbolic tangent activation in standard GRUs due to performance loss associated with the former in deeper networks. Experimental results demonstrate that our pose-based optimized GRU (Pose-OGRU) outperforms the standard GRU model in terms of prediction accuracy, convergency, and information processing capability.

Precision and Safety Comparison for SM, CRM and ATD in Phase I Clinical Trials (제 1상 임상시험의 SM, CRM, ATD에서 결정된 MTD의 정확성과 안전성 비교)

  • Kim, Dong-Uk;Kil, Sun-Kyoung
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.1
    • /
    • pp.51-65
    • /
    • 2009
  • The purpose of a phase I clinical trial is to determine the maximum tolerated dose(MTD) of a new drug. This paper investigates the performance of standard method, continual reassessment method and accelerated titration designs in phase I clinical trials. Especially we study the precision and safety at the MTD of these methods. We utilize hyperbolic tangent function and power function to define dose-toxicity model. For each method, expected toxicity rate at MTD is computed and compared with target toxicity probability. We also suggest some modifications of these methods and show some improvements in performance.

Modelling of starch industry wastewater microfiltration parameters by neural network

  • Jokic, Aleksandar I.;Seres, Laslo L.;Milovic, Nemanja R.;Seres, Zita I.;Maravic, Nikola R.;Saranovic, Zana;Dokic, Ljubica P.
    • Membrane and Water Treatment
    • /
    • v.9 no.2
    • /
    • pp.115-121
    • /
    • 2018
  • Artificial neural network (ANN) simulation is used to predict the dynamic change of permeate flux during wheat starch industry wastewater microfiltration with and without static turbulence promoter. The experimental program spans range of a sedimentation times from 2 to 4 h, for feed flow rates 50 to 150 L/h, at transmembrane pressures covering the range of $1{\times}10^5$ to $3{\times}10^5Pa$. ANN predictions of the wastewater microfiltration are compared with experimental results obtained using two different set of microfiltration experiments, with and without static turbulence promoter. The effects of the training algorithm, neural network architectures on the ANN performance are discussed. For the most of the cases considered, the ANN proved to be an adequate interpolation tool, where an excellent prediction was obtained using automated Bayesian regularization as training algorithm. The optimal ANN architecture was determined as 4-10-1 with hyperbolic tangent sigmoid transfer function transfer function for hidden and output layers. The error distributions of data revealed that experimental results are in very good agreement with computed ones with only 2% data points had absolute relative error greater than 20% for the microfiltration without static turbulence promoter whereas for the microfiltration with static turbulence promoter it was 1%. The contribution of filtration time variable to flux values provided by ANNs was determined in an important level at the range of 52-66% due to increased membrane fouling by the time. In the case of microfiltration with static turbulence promoter, relative importance of transmembrane pressure and feed flow rate increased for about 30%.

Development of hybrid activation function to improve accuracy of water elevation prediction algorithm (수위예측 알고리즘 정확도 향상을 위한 Hybrid 활성화 함수 개발)

  • Yoo, Hyung Ju;Lee, Seung Oh
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2019.05a
    • /
    • pp.363-363
    • /
    • 2019
  • 활성화 함수(activation function)는 기계학습(machine learning)의 학습과정에 비선형성을 도입하여 심층적인 학습을 용이하게 하고 예측의 정확도를 높이는 중요한 요소 중 하나이다(Roy et al., 2019). 일반적으로 기계학습에서 사용되고 있는 활성화 함수의 종류에는 계단 함수(step function), 시그모이드 함수(sigmoid 함수), 쌍곡 탄젠트 함수(hyperbolic tangent function), ReLU 함수(Rectified Linear Unit function) 등이 있으며, 예측의 정확도 향상을 위하여 다양한 형태의 활성화 함수가 제시되고 있다. 본 연구에서는 기계학습을 통하여 수위예측 시 정확도 향상을 위하여 Hybrid 활성화 함수를 제안하였다. 연구대상지는 조수간만의 영향을 받는 한강을 대상으로 선정하였으며, 2009년 ~ 2018년까지 10년간의 수문자료를 활용하였다. 수위예측 알고리즘은 Python 내 Tensorflow의 RNN (Recurrent Neural Networks) 모델을 이용하였으며, 강수량, 수위, 조위, 댐 방류량, 하천 유량의 수문자료를 학습시켜 3시간 및 6시간 후의 수위를 예측하였다. 예측정확도 향상을 위하여 입력 데이터는 정규화(Normalization)를 시켰으며, 민감도 분석을 통하여 신경망모델의 은닉층 개수, 학습률의 최적 값을 도출하였다. Hybrid 활성화 함수는 쌍곡 탄젠트 함수와 ReLU 함수를 혼합한 형태로 각각의 가중치($w_1,w_2,w_1+w_2=1$)를 변경하여 정확도를 평가하였다. 그 결과 가중치의 비($w_1/w_2$)에 따라서 예측 결과의 RMSE(Roote Mean Square Error)가 최소가 되고 NSE (Nash-Sutcliffe model Efficiency coefficient)가 최대가 되는 지점과 Peak 수위의 예측정확도가 최대가 되는 지점을 확인할 수 있었다. 본 연구는 현재 Data modeling을 통한 수위예측의 정확도 향상을 위해 기초가 되는 연구이나, 향후 다양한 형태의 활성화 함수를 제안하여 정확도를 향상시킨다면 예측 결과를 통하여 침수예보에 대한 의사결정이 가능할 것으로 기대된다.

  • PDF

Early adjusting damping force for sloped rolling-type seismic isolators based on earthquake early warning information

  • Hsu, Ting-Yu;Huang, Chih-Hua;Wang, Shiang-Jung
    • Earthquakes and Structures
    • /
    • v.20 no.1
    • /
    • pp.39-53
    • /
    • 2021
  • By means of installing sloped rolling-type seismic isolators (SRI), the horizontal acceleration transmitted to the to-be-protected object above can be effectively and significantly reduced under external disturbance. To prevent the maximum horizontal displacement response of SRI from reaching a threshold, designing large and conservative damping force for SRI might be required, which will also enlarge the transmitted acceleration response. In a word, when adopting seismic isolation, minimizing acceleration or displacement responses is always a trade-off. Therefore, this paper proposes that by exploiting the possible information provided by an earthquake early warning system, the damping force applied to SRI which can better control both acceleration and displacement responses might be determined in advance and accordingly adjusted in a semi-active control manner. By using a large number of ground motion records with peak ground acceleration not less than 80 gal, the numerical results present that the maximum horizontal displacement response of SRI is highly correlated with and proportional to some important parameters of input excitations, the velocity pulse energy rate and peak velocity in particular. A control law employing the basic form of hyperbolic tangent function and two objective functions are considered in this study for conceptually developing suitable control algorithms. Compared with the numerical results of simply designing a constant, large damping factor to prevent SRI from pounding, adopting the recommended control algorithms can have more than 60% reduction of acceleration responses in average under the excitations. More importantly, it is effective in reducing acceleration responses under approximately 98% of the excitations.

A study on activation functions of Artificial Neural Network model suitable for prediction of the groundwater level in the mid-mountainous area of eastern Jeju island (제주도 동부 중산간지역 지하수위 예측에 적합한 인공신경망 모델의 활성화함수 연구)

  • Mun-Ju Shin;Jeong-Hun Kim;Su-Yeon Kang;Jeong-Han Lee;Kyung Goo Kang
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.520-520
    • /
    • 2023
  • 제주도 동부 중산간 지역은 화산암으로 구성된 지하지질로 인해 지하수위의 변동폭이 크고 변동양상이 복잡하여 인공신경망(Artificial Neural Network, ANN) 모델 등을 활용한 지하수위의 예측이 어렵다. ANN에 적용되는 활성화함수에 따라 지하수의 예측성능은 달라질 수 있으므로 활성화함수의 비교분석 후 적절한 활성화함수의 사용이 반드시 필요하다. 본 연구에서는 5개 활성화함수(sigmoid, hyperbolic tangent(tanh), Rectified Linear Unit(ReLU), Leaky Rectified Linear Unit(Leaky ReLU), Exponential Linear Unit(ELU))를 제주도 동부 중산간지역에 위치한 2개 지하수 관정에 대해 비교분석하여 최적 활성화함수 도출을 목표로 한다. 또한 최적 활성화함수를 활용한 ANN의 적용성을 평가하기 위해 최근 널리 사용되고 있는 순환신경망 모델인 Long Short-Term Memory(LSTM) 모델과 비교분석 하였다. 그 결과, 2개 관정 중 지하수위 변동폭이 상대적으로 큰 관정은 ELU 함수, 상대적으로 작은 관정은 Leaky ReLU 함수가 지하수위 예측에 적절하였다. 예측성능이 가장 낮은 활성화함수는 sigmoid 함수로 나타나 첨두 및 최저 지하수위 예측 시 사용을 지양해야 할 것으로 판단된다. 도출된 최적 활성화함수를 사용한 ANN-ELU 모델 및 ANN-Leaky ReLU 모델을 LSTM 모델과 비교분석한 결과 대등한 지하수위 예측성능을 나타내었다. 이것은 feed-forward 방식인 ANN 모델을 사용하더라도 적절한 활성화함수를 사용하면 최신 순환신경망과 대등한 결과를 도출하여 활용 가능성이 충분히 있다는 것을 의미한다. 마지막으로 LSTM 모델은 가장 적절한 예측성능을 나타내어 다양한 인공지능 모델의 예측성능 비교를 위한 기준이 되는 참고모델로 활용 가능하다. 본 연구에서 제시한 방법은 지하수위 예측과 더불어 하천수위 예측 등 다양한 시계열예측 및 분석연구에 유용하게 사용될 수 있다.

  • PDF

Analysis of Activation Energy of Thermal Aging Embrittlement in Cast Austenite Stainless Steels (주조 오스테나이트 스테인리스강의 열취화 활성화에너지 분석)

  • Gyeong-Geun Lee;Suk-Min Hong;Ji-Su Kim;Dong-Hyun Ahn;Jong-Min Kim
    • Transactions of the Korean Society of Pressure Vessels and Piping
    • /
    • v.20 no.1
    • /
    • pp.56-65
    • /
    • 2024
  • Cast austenitic stainless steels (CASS) and austenitic stainless steel weldments with a ferrite-austenite duplex structure are widely used in nuclear power plants, incorporating ferrite phase to enhance strength, stress relief, and corrosion resistance. Thermal aging at 290-325℃ can induce embrittlement, primarily due to spinodal decomposition and G-phase precipitation in the ferrite phase. This study evaluates the effects of thermal aging by collecting and analyzing various mechanical properties, such as Charpy impact energy, ferrite microhardness, and tensile strength, from various literature sources. Different model expressions, including hyperbolic tangent and phase transformation equations, are applied to calculate activation energy (Q) of room-temperature impact energies, and the results are compared. Additionally, predictive models for Q based on material composition are evaluated, and the potential of machine learning techniques for improving prediction accuracy is explored. The study also examines the use of ferrite microhardness and tensile strength in calculating Q and assessing thermal embrittlement. The findings provide insights for developing advanced prediction models for the thermal embrittlement behavior of CASS and the weldments of austenitic steels, contributing to the safety and reliability of nuclear power plant components.

A Study on the Data Driven Neural Network Model for the Prediction of Time Series Data: Application of Water Surface Elevation Forecasting in Hangang River Bridge (시계열 자료의 예측을 위한 자료 기반 신경망 모델에 관한 연구: 한강대교 수위예측 적용)

  • Yoo, Hyungju;Lee, Seung Oh;Choi, Seohye;Park, Moonhyung
    • Journal of Korean Society of Disaster and Security
    • /
    • v.12 no.2
    • /
    • pp.73-82
    • /
    • 2019
  • Recently, as the occurrence frequency of sudden floods due to climate change increased, the flood damage on riverside social infrastructures was extended so that there has been a threat of overflow. Therefore, a rapid prediction of potential flooding in riverside social infrastructure is necessary for administrators. However, most current flood forecasting models including hydraulic model have limitations which are the high accuracy of numerical results but longer simulation time. To alleviate such limitation, data driven models using artificial neural network have been widely used. However, there is a limitation that the existing models can not consider the time-series parameters. In this study the water surface elevation of the Hangang River bridge was predicted using the NARX model considering the time-series parameter. And the results of the ANN and RNN models are compared with the NARX model to determine the suitability of NARX model. Using the 10-year hydrological data from 2009 to 2018, 70% of the hydrological data were used for learning and 15% was used for testing and evaluation respectively. As a result of predicting the water surface elevation after 3 hours from the Hangang River bridge in 2018, the ANN, RNN and NARX models for RMSE were 0.20 m, 0.11 m, and 0.09 m, respectively, and 0.12 m, 0.06 m, and 0.05 m for MAE, and 1.56 m, 0.55 m and 0.10 m for peak errors respectively. By analyzing the error of the prediction results considering the time-series parameters, the NARX model is most suitable for predicting water surface elevation. This is because the NARX model can learn the trend of the time series data and also can derive the accurate prediction value even in the high water surface elevation prediction by using the hyperbolic tangent and Rectified Linear Unit function as an activation function. However, the NARX model has a limit to generate a vanishing gradient as the sequence length becomes longer. In the future, the accuracy of the water surface elevation prediction will be examined by using the LSTM model.